Multiple Imputation Inferences for Count Data

dc.contributor.advisor

Reiter, Jerome P

dc.contributor.author

Liu, Bo

dc.date.accessioned

2021-05-20T14:12:15Z

dc.date.available

2021-05-20T14:12:15Z

dc.date.issued

2021

dc.department

Statistical Science

dc.description.abstract

Multiple imputation is frequently used for inference with missing data. In cases when the population quantity of interest is desired to be an integer, the original methods for inference need to be modified, as the point estimates based on the average are generally not integers.In this thesis, I propose a modification to the original combining rules, which provides the point estimate as the median of quantities from imputed datasets. Thus, the point estimate of the population quantity of interest is integer-valued when the number of imputed datasets is odd. I derive an estimator of the variance of this modified estimator, as well as a method for obtaining confidence intervals. I compare this method to other ad-hoc methods, such as rounding the original point estimate. Simulations show that these two methods provide similar results, although the novel method has slightly larger mean absolute error. The coverage rate of both methods are close to the nominal coverage of 95%. The correct derivation of variance is important, and simulations show that if one uses the median as point estimate without correcting the variance, the coverage rate is systematically lower.

dc.identifier.uri

https://hdl.handle.net/10161/23152

dc.subject

Statistics

dc.subject

count data

dc.subject

Missing data

dc.subject

Multiple imputation

dc.title

Multiple Imputation Inferences for Count Data

dc.type

Master's thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Liu_duke_0066N_16127.pdf
Size:
417.23 KB
Format:
Adobe Portable Document Format

Collections