Simultaneous Edit and Imputation For Household Data with Structural Zeros

dc.contributor.author

Akande, O

dc.contributor.author

Barrientos, Andres

dc.contributor.author

Reiter, Jerome

dc.date.accessioned

2019-01-30T16:49:42Z

dc.date.available

2019-01-30T16:49:42Z

dc.date.updated

2019-01-30T16:49:41Z

dc.description.abstract

Multivariate categorical data nested within households often include reported values that fail edit constraints---for example, a participating household reports a child's age as older than his biological parent's age---as well as missing values. Generally, agencies prefer datasets to be free from erroneous or missing values before analyzing them or disseminating them to secondary data users. We present a model-based engine for editing and imputation of household data based on a Bayesian hierarchical model that includes (i) a nested data Dirichlet process mixture of products of multinomial distributions as the model for the true latent values of the data, truncated to allow only households that satisfy all edit constraints, (ii) a model for the location of errors, and (iii) a reporting model for the observed responses in error. The approach propagates uncertainty due to unknown locations of errors and missing values, generates plausible datasets that satisfy all edit constraints, and can preserve multivariate relationships within and across individuals in the same household. We illustrate the approach using data from the 2012 American Community Survey.

dc.identifier.issn

2325-0984

dc.identifier.issn

2325-0992

dc.identifier.uri

https://hdl.handle.net/10161/17928

dc.language

en

dc.publisher

Oxford University Press (OUP)

dc.relation.ispartof

Journal of Survey Statistics and Methodology

dc.relation.isversionof

10.1093/jssam/smy022

dc.subject

stat.ME

dc.subject

stat.ME

dc.title

Simultaneous Edit and Imputation For Household Data with Structural Zeros

dc.type

Journal article

duke.contributor.orcid

Barrientos, Andres|0000-0001-8196-7229

duke.contributor.orcid

Reiter, Jerome|0000-0002-8374-3832

pubs.organisational-group

Staff

pubs.organisational-group

Duke

pubs.organisational-group

Statistical Science

pubs.organisational-group

Trinity College of Arts & Sciences

pubs.organisational-group

Student

pubs.publication-status

Published online

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1804.05144v2.pdf
Size:
485.86 KB
Format:
Adobe Portable Document Format