A Comparison Of Multiple Imputation Methods For Categorical Data

dc.contributor.advisor

Connolly, Michelle P

dc.contributor.advisor

Reiter, Jerome P

dc.contributor.author

Akande, Olanrewaju Michael

dc.date.accessioned

2015-05-12T20:50:55Z

dc.date.available

2015-05-12T20:50:55Z

dc.date.issued

2015

dc.department

Statistical and Economic Modeling

dc.description.abstract

This thesis evaluates the performance of several multiple imputation methods for categorical data, including multiple imputation by chained equations using generalized linear models, multiple imputation by chained equations using classification and regression trees and non-parametric Bayesian multiple imputation for categorical data (using the Dirichlet process mixture of products of multinomial distributions model). The performance of each method is evaluated with repeated sampling studies using housing unit data from the American Community Survey 2012. These data afford exploration of practical problems such as multicollinearity and large dimensions. This thesis highlights some advantages and limitations of each method compared to others. Finally, it provides suggestions on which method should be preferred, and conditions under which the suggestions hold.

dc.identifier.uri

https://hdl.handle.net/10161/10028

dc.subject

Statistics

dc.subject

Item Nonresponse

dc.subject

Missing data

dc.subject

Multiple imputation

dc.title

A Comparison Of Multiple Imputation Methods For Categorical Data

dc.type

Master's thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Akande_duke_0066N_12966.pdf
Size:
620.15 KB
Format:
Adobe Portable Document Format

Collections