Multiple Imputation Inferences for Count Data

Loading...
Thumbnail Image

Date

2021

Authors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

535
views
680
downloads

Abstract

Multiple imputation is frequently used for inference with missing data. In cases when the population quantity of interest is desired to be an integer, the original methods for inference need to be modified, as the point estimates based on the average are generally not integers.In this thesis, I propose a modification to the original combining rules, which provides the point estimate as the median of quantities from imputed datasets. Thus, the point estimate of the population quantity of interest is integer-valued when the number of imputed datasets is odd. I derive an estimator of the variance of this modified estimator, as well as a method for obtaining confidence intervals. I compare this method to other ad-hoc methods, such as rounding the original point estimate. Simulations show that these two methods provide similar results, although the novel method has slightly larger mean absolute error. The coverage rate of both methods are close to the nominal coverage of 95%. The correct derivation of variance is important, and simulations show that if one uses the median as point estimate without correcting the variance, the coverage rate is systematically lower.

Description

Provenance

Citation

Citation

Liu, Bo (2021). Multiple Imputation Inferences for Count Data. Master's thesis, Duke University. Retrieved from https://hdl.handle.net/10161/23152.

Collections


Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.