Collinearity Diagnostics for Complex Survey Data

dc.contributor.advisorValliant, Richarden_US
dc.contributor.authorLiao, Danen_US
dc.contributor.departmentSurvey Methodologyen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2010-10-07T05:56:50Z
dc.date.available2010-10-07T05:56:50Z
dc.date.issued2010en_US
dc.description.abstractSurvey data are often used to fit models. The values of covariates used in modeling are not controlled as they might be in an experiment. Thus, collinearity among the covariates is an inevitable problem in the analysis of survey data. Although many books and articles have described the collinearity problem and proposed strategies to understand, assess and handle its presence, the survey literature has not provided appropriate diagnostic tools to evaluate its impact on the regression estimation when the survey complexities are considered. The goal of this research is to extend and adapt the conventional ordinary least squares collinearity diagnostics to complex survey data when a linear model or generalized linear model is used. In this dissertation we have developed methods that generally have either a model-based or design-based interpretation. We assume that an analyst uses survey-weighted regression estimators to estimate both underlying model parameters (assuming a correctly specified model) and census-fit parameters in the finite population. Diagnostics statistics, variance inflation factors (VIFs), condition indexes and variance decomposition proportions are constructed to evaluate the impact of collinearity and determine which variables are involved. Survey weights are components of the diagnostic statistics and the estimated variances of the coefficients are obtained from design-consistent estimators which account for complex design features, e.g. clustering and stratification. Illustrations of these methods are given using data from a survey of mental health organizations and a household survey of health and nutrition. We demonstrate that specialized collinearity diagnostic statistics are needed to account for survey weights and complex finite population features that are reflected in the sample design and considered in the regression analysis.en_US
dc.identifier.urihttp://hdl.handle.net/1903/10881
dc.subject.pqcontrolledStatisticsen_US
dc.subject.pquncontrolledCollinearity diagnosticsen_US
dc.subject.pquncontrolledCondition indexen_US
dc.subject.pquncontrolledGeneralized linear modelsen_US
dc.subject.pquncontrolledSurvey weighted least squaresen_US
dc.subject.pquncontrolledVariance decomposition proportionen_US
dc.subject.pquncontrolledVariance inflation factoren_US
dc.titleCollinearity Diagnostics for Complex Survey Dataen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Liao_umd_0117E_11537.pdf
Size:
1.2 MB
Format:
Adobe Portable Document Format