A different way to solve the missing value problem: the case of equal employment opportunity 
data.

zhou, jing

A different way to solve the missing value problem: the case of equal employment opportunity data.

Files

umi-umd-3675.pdf (636.09 KB)

No. of downloads: 1126

Date

2006-07-31

Authors

zhou, jing

Advisor

Smith, Paul

Abstract

The purpose of this thesis is to review methods of imputation and apply them to data collected by Equal Employment Opportunity Commission (EEOC). First, I discuss several imputation methods and review theory of multiple imputation (MI). Next, I review aspects of missing data and outline an artificial data simulation. I describe simulation based on EEOC dataset listing numbers of employees by ethnicity in large establishments. Mean imputation and MI are applied to simulated datasets. In the first scenario, we impute data for nonresponding establishments. The more we impute, the higher our resulting population means. In the second scenario, we simulate item nonresponse. I find mean imputation and MI generate similar means. The means are not affected by percentage of missingness regardless of imputation methods. The results suggest MI produces larger standard error than mean imputation. Last the percentage of missingness has no effect on standard error in case of MI.

URI (handle)

http://hdl.handle.net/1903/3830

Collections

UMD Theses and Dissertations
Mathematics Theses and Dissertations

Full item page