THE USE OF RANDOM FORESTS IN PROPENSITY SCORE WEIGHTING

Zheng, Yating

THE USE OF RANDOM FORESTS IN PROPENSITY SCORE WEIGHTING

dc.contributor.advisor	Stapleton, Laura	en_US
dc.contributor.author	Zheng, Yating	en_US
dc.contributor.department	Measurement, Statistics and Evaluation	en_US
dc.contributor.publisher	Digital Repository at the University of Maryland	en_US
dc.contributor.publisher	University of Maryland (College Park, Md.)	en_US
dc.date.accessioned	2024-06-26T05:47:20Z
dc.date.available	2024-06-26T05:47:20Z
dc.date.issued	2023	en_US
dc.description.abstract	An important problem of social science research is the estimate of causal effects in observationalstudies. Propensity score methods, as effective ways to remove selection bias, have been widely used in estimating causal effects in observational studies. An important step of propensity score methods is to estimate the propensity score. Recently, a machine learning method, random forests, has been proposed as an alternative to the conventional method of logistic regression to estimate the propensity score as it requires less stringent assumptions and provides less biased and more reliable estimate of the treatment effect. However, previous studies only covered limited conditions with a small number of covariates and medium sample sizes, leaving the generalizability of the results in doubt. In addition, previous studies have seldom explored how to choose the hyper-parameters in random forests in the context of propensity score methods. This dissertation, via a simulation study, aims to 1) make a more comprehensive comparison between the use of random forests and logistic regression to determine which model performs better under what conditions, 2) explore the effects of the hyperparameters on the estimate of the treatment effect in propensity score weighting. An empirical study is also used as an illustration about how to choose the hyperparameters in random forests using propensity score weighting in practical settings.	en_US
dc.identifier	https://doi.org/10.13016/g5bl-rpbj
dc.identifier.uri	http://hdl.handle.net/1903/32728
dc.language.iso	en	en_US
dc.subject.pqcontrolled	Educational tests & measurements	en_US
dc.subject.pqcontrolled	Statistics	en_US
dc.subject.pquncontrolled	hyper-parameters	en_US
dc.subject.pquncontrolled	propensity score weighting	en_US
dc.subject.pquncontrolled	random forests	en_US
dc.title	THE USE OF RANDOM FORESTS IN PROPENSITY SCORE WEIGHTING	en_US
dc.type	Dissertation	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Zheng_umd_0117E_23996.pdf
Size:: 3.43 MB
Format:: Adobe Portable Document Format

Download

Collections

UMD Theses and Dissertations
Human Development & Quantitative Methodology Theses and Dissertations