DukeSpace

Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics

DukeSpace

Show simple item record

dc.contributor.author Li, Fan en_US
dc.date.accessioned 2011-06-21T17:30:31Z
dc.date.available 2011-06-21T17:30:31Z
dc.date.issued 2010 en_US
dc.identifier.citation Li,Fan;Zhang,Nancy R.. 2010. Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics. Journal of the American Statistical Association 105(491): 1202-1214. en_US
dc.identifier.issn 0162-1459 en_US
dc.identifier.uri http://hdl.handle.net/10161/4400
dc.description.abstract We consider the problem of variable selection in regression modeling in high-dimensional spaces where there is known structure among the covariates. This is an unconventional variable selection problem for two reasons: (1) The dimension of the covariate space is comparable, and often much larger, than the number of subjects in the study. and (2) the covariate space is highly structured, and in some cases it is desirable to incorporate this structural information in to the model building process. We approach this problem through the Bayesian variable selection framework, where we assume that the covariates lie on an undirected graph and formulate an Ising prior on the model space for incorporating structural information. Certain computational and statistical problems arise that are unique to such high-dimensional, structured settings, the most interesting being the phenomenon of phase transitions. We propose theoretical and computational schemes to mitigate these problems. We illustrate our methods on two different graph structures: the linear chain and the regular graph of degree k. Finally, we use our methods to study a specific application in genomics: the modeling of transcription factor binding sites in DNA sequences. en_US
dc.language.iso en_US en_US
dc.publisher AMER STATISTICAL ASSOC en_US
dc.relation.isversionof doi:10.1198/jasa.2010.tm08177 en_US
dc.subject ising model en_US
dc.subject markov chain monte carlo en_US
dc.subject motif analysis en_US
dc.subject phase transition en_US
dc.subject undirected graph en_US
dc.subject dna regulatory motifs en_US
dc.subject model selection en_US
dc.subject information-content en_US
dc.subject gene-expression en_US
dc.subject microarray data en_US
dc.subject binding-sites en_US
dc.subject identification en_US
dc.subject prediction en_US
dc.subject regression en_US
dc.subject yeast en_US
dc.subject statistics & probability en_US
dc.title Bayesian Variable Selection in Structured High-Dimensional Covariate Spaces With Applications in Genomics en_US
dc.title.alternative en_US
dc.description.version Version of Record en_US
duke.date.pubdate 2010-9-0 en_US
duke.description.endpage 1214 en_US
duke.description.issue 491 en_US
duke.description.startpage 1202 en_US
duke.description.volume 105 en_US
dc.relation.journal Journal of the American Statistical Association en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record