Browsing by Author "Iversen, Edwin S"
Results Per Page
Sort Options
Item Open Access A Bayesian Hierarchical Model with SNP-level Functional Priors Applied to a Pathway-wide Association Study.(2010) Huang, WeiziTremendous effort has been put into study of the etiology of complex
diseases including the breast cancer, type 2 diabetes,
cardiovascular diseases, and prostate cancers. Despite large numbers of reported disease-associated loci,
few associated loci have been replicated, and some true associations
does not belong to the group of the most significant loci
reported to be associated. We built a Bayesian hierarchical model incorporated
with SNP-level functional data that can help identify associated SNPs in pathway-wide association studies.
We applied the model to an association study for the serous invasive ovarian cancer based on the DNA repair and apoptosis pathways. We found that using our model, blocks of SNPs located in regions enriched for missense SNPs or gene inversions were more likely to be identified as candidates of the association.
Item Open Access A Bayesian Model for Nucleosome Positioning Using DNase-seq Data(2015) Zhong, JianlingAs fundamental structural units of the chromatin, nucleosomes are involved in virtually all aspects of genome function. Different methods have been developed to map genome-wide nucleosome positions, including MNase-seq and a recent chemical method requiring genetically engineered cells. However, these methods are either low resolution and prone to enzymatic sequence bias or require genetically modified cells. The DNase I enzyme has been used to probe nucleosome structure since the 1960s, but in the current high throughput sequencing era, DNase-seq has mainly been used to study regulatory sequences known as DNase hypersensitive sites. This thesis shows that DNase-seq data is also very informative about nucleosome positioning. The distinctive oscillatory DNase I cutting patterns on nucleosomal DNA are shown and discussed. Based on these patterns, a Bayes factor is proposed to be used for distinguishing nucleosomal and non-nucleosomal genome positions. The results show that this approach is highly sensitive and specific. A Bayesian method that simulates the data generation process and can provide more interpretable results is further developed based on the Bayes factor investigations. Preliminary results on a test genomic region show that the Bayesian model works well in identifying nucleosome positioning. Estimated posterior distributions also agree with some known biological observations from external data. Taken together, methods developed in this thesis show that DNase-seq can be used to identify nucleosome positioning, adding great value to this widely utilized protocol.
Item Open Access A branching process model for flow cytometry and budding index measurements in cell synchrony experiments.(Ann Appl Stat, 2009) Orlando, David A; Iversen, Edwin S; Hartemink, Alexander J; Haase, Steven BWe present a flexible branching process model for cell population dynamics in synchrony/time-series experiments used to study important cellular processes. Its formulation is constructive, based on an accounting of the unique cohorts in the population as they arise and evolve over time, allowing it to be written in closed form. The model can attribute effects to subsets of the population, providing flexibility not available using the models historically applied to these populations. It provides a tool for in silico synchronization of the population and can be used to deconvolve population-level experimental measurements, such as temporal expression profiles. It also allows for the direct comparison of assay measurements made from multiple experiments. The model can be fit either to budding index or DNA content measurements, or both, and is easily adaptable to new forms of data. The ability to use DNA content data makes the model applicable to almost any organism. We describe the model and illustrate its utility and flexibility in a study of cell cycle progression in the yeast Saccharomyces cerevisiae.Item Restricted Association between DNA damage response and repair genes and risk of invasive serous ovarian cancer.(PLoS One, 2010-04-08) Schildkraut, Joellen M; Iversen, Edwin S; Wilson, Melanie A; Clyde, Merlise A; Moorman, Patricia G; Palmieri, Rachel T; Whitaker, Regina; Bentley, Rex C; Marks, Jeffrey R; Berchuck, AndrewBACKGROUND: We analyzed the association between 53 genes related to DNA repair and p53-mediated damage response and serous ovarian cancer risk using case-control data from the North Carolina Ovarian Cancer Study (NCOCS), a population-based, case-control study. METHODS/PRINCIPAL FINDINGS: The analysis was restricted to 364 invasive serous ovarian cancer cases and 761 controls of white, non-Hispanic race. Statistical analysis was two staged: a screen using marginal Bayes factors (BFs) for 484 SNPs and a modeling stage in which we calculated multivariate adjusted posterior probabilities of association for 77 SNPs that passed the screen. These probabilities were conditional on subject age at diagnosis/interview, batch, a DNA quality metric and genotypes of other SNPs and allowed for uncertainty in the genetic parameterizations of the SNPs and number of associated SNPs. Six SNPs had Bayes factors greater than 10 in favor of an association with invasive serous ovarian cancer. These included rs5762746 (median OR(odds ratio)(per allele) = 0.66; 95% credible interval (CI) = 0.44-1.00) and rs6005835 (median OR(per allele) = 0.69; 95% CI = 0.53-0.91) in CHEK2, rs2078486 (median OR(per allele) = 1.65; 95% CI = 1.21-2.25) and rs12951053 (median OR(per allele) = 1.65; 95% CI = 1.20-2.26) in TP53, rs411697 (median OR (rare homozygote) = 0.53; 95% CI = 0.35 - 0.79) in BACH1 and rs10131 (median OR( rare homozygote) = not estimable) in LIG4. The six most highly associated SNPs are either predicted to be functionally significant or are in LD with such a variant. The variants in TP53 were confirmed to be associated in a large follow-up study. CONCLUSIONS/SIGNIFICANCE: Based on our findings, further follow-up of the DNA repair and response pathways in a larger dataset is warranted to confirm these results.Item Open Access Bayesian Model Averaging in the M-Open Framework(Bayesian Theory and Applications, 2013) Clydec, Merlise; Iversen, Edwin SThis chapter presents a model averaging approach in the M-open setting using sample re-use methods to approximate the predictive distribution of future observations. It first reviews the standard M-closed Bayesian Model Averaging approach and decision-theoretic methods for producing inferences and decisions. It then reviews model selection from the M-complete and M-open perspectives, before formulating a Bayesian solution to model averaging in the M-open perspective. It constructs optimal weights for MOMA:M-open Model Averaging using a decision-theoretic framework, where models are treated as part of the ‘action space’ rather than unknown states of nature. Using ‘incompatible’ retrospective and prospective models for data from a case-control study, the chapter demonstrates that MOMA gives better predictive accuracy than the proxy models. It concludes with open questions and future directions.Item Open Access Bayesian Model Uncertainty and Prior Choice with Applications to Genetic Association Studies(2010) Wilson, Melanie AnnThe Bayesian approach to model selection allows for uncertainty in both model specific parameters and in the models themselves. Much of the recent Bayesian model uncertainty literature has focused on defining these prior distributions in an objective manner, providing conditions under which Bayes factors lead to the correct model selection, particularly in the situation where the number of variables, p, increases with the sample size, n. This is certainly the case in our area of motivation; the biological application of genetic association studies involving single nucleotide polymorphisms. While the most common approach to this problem has been to apply a marginal test to all genetic markers, we employ analytical strategies that improve upon these marginal methods by modeling the outcome variable as a function of a multivariate genetic profile using Bayesian variable selection. In doing so, we perform variable selection on a large number of correlated covariates within studies involving modest sample sizes.
In particular, we present an efficient Bayesian model search strategy that searches over the space of genetic markers and their genetic parametrization. The resulting method for Multilevel Inference of SNP Associations MISA, allows computation of multilevel posterior probabilities and Bayes factors at the global, gene and SNP level. We use simulated data sets to characterize MISA's statistical power, and show that MISA has higher power to detect association than standard procedures. Using data from the North Carolina Ovarian Cancer Study (NCOCS), MISA identifies variants that were not identified by standard methods and have been externally 'validated' in independent studies.
In the context of Bayesian model uncertainty for problems involving a large number of correlated covariates we characterize commonly used prior distributions on the model space and investigate their implicit multiplicity correction properties first in the extreme case where the model includes an increasing number of redundant covariates and then under the case of full rank design matrices. We provide conditions on the asymptotic (in n and p) behavior of the model space prior
required to achieve consistent selection of the global hypothesis of at least one associated variable in the analysis using global posterior probabilities (i.e. under 0-1 loss). In particular, under the assumption that the null model is true, we show that the commonly used uniform prior on the model space leads to inconsistent selection of the global hypothesis via global posterior probabilities (the posterior probability of at least one association goes to 1) when the rank of the design matrix is finite. In the full rank case, we also show inconsistency when p goes to infinity faster than the square root of n. Alternatively, we show that any model space prior such that the global prior odds of association increases at a rate slower than the square root of n results in consistent selection of the global hypothesis in terms of posterior probabilities.
Item Open Access Bayesian Statistical Models of Cell-Cycle Progression at Single-Cell and Population Levels(2014) Mayhew, Michael BenjaminCell division is a biological process fundamental to all life. One aspect of the process that is still under investigation is whether or not cells in a lineage are correlated in their cell-cycle progression. Data on cell-cycle progression is typically acquired either in lineages of single cells or in synchronized cell populations, and each source of data offers complementary information on cell division. To formally assess dependence in cell-cycle progression, I develop a hierarchical statistical model of single-cell measurements and extend a previously proposed model of population cell division in the budding yeast, Saccharomyces cerevisiae. Both models capture correlation and cell-to-cell heterogeneity in cell-cycle progression, and parameter inference is carried out in a fully Bayesian manner. The single-cell model is fit to three published time-lapse microscopy datasets and the population-based model is fit to simulated data for which the true model is known. Based on posterior inferences and formal model comparisons, the single-cell analysis demonstrates that budding yeast mother and daughter cells do not appear to correlate in their cell-cycle progression in two of the three experimental settings. In contrast, mother cells grown in a less preferred sugar source, glycerol/ethanol, did correlate in their rate of cell division in two successive cell cycles. Population model fitting to simulated data suggested that, under typical synchrony experimental conditions, population-based measurements of the cell-cycle were not informative for correlation in cell-cycle progression or heterogeneity in daughter-specific G1 phase progression.
Item Open Access Hosts of avian brood parasites have evolved egg signatures with elevated information content.(Proc Biol Sci, 2015-07-07) Caves, Eleanor M; Stevens, Martin; Iversen, Edwin S; Spottiswoode, Claire NHosts of brood-parasitic birds must distinguish their own eggs from parasitic mimics, or pay the cost of mistakenly raising a foreign chick. Egg discrimination is easier when different host females of the same species each lay visually distinctive eggs (egg 'signatures'), which helps to foil mimicry by parasites. Here, we ask whether brood parasitism is associated with lower levels of correlation between different egg traits in hosts, making individual host signatures more distinctive and informative. We used entropy as an index of the potential information content encoded by nine aspects of colour, pattern and luminance of eggs of different species in two African bird families (Cisticolidae parasitized by cuckoo finches Anomalospiza imberbis, and Ploceidae by diederik cuckoos Chrysococcyx caprius). Parasitized species showed consistently higher entropy in egg traits than did related, unparasitized species. Decomposing entropy into two variation components revealed that this was mainly driven by parasitized species having lower levels of correlation between different egg traits, rather than higher overall levels of variation in each individual egg trait. This suggests that irrespective of the constraints that might operate on individual egg traits, hosts can further improve their defensive 'signatures' by arranging suites of egg traits into unpredictable combinations.Item Open Access Maternal stress, preterm birth, and DNA methylation at imprint regulatory sequences in humans.(Genetics & epigenetics, 2014-01) Vidal, Adriana C; Benjamin Neelon, Sara E; Liu, Ying; Tuli, Abbas M; Fuemmeler, Bernard F; Hoyo, Cathrine; Murtha, Amy P; Huang, Zhiqing; Schildkraut, Joellen; Overcash, Francine; Kurtzberg, Joanne; Jirtle, Randy L; Iversen, Edwin S; Murphy, Susan KIn infants exposed to maternal stress in utero, phenotypic plasticity through epigenetic events may mechanistically explain increased risk of preterm birth (PTB), which confers increased risk for neurodevelopmental disorders, cardiovascular disease, and cancers in adulthood. We examined associations between prenatal maternal stress and PTB, evaluating the role of DNA methylation at imprint regulatory regions. We enrolled women from prenatal clinics in Durham, NC. Stress was measured in 537 women at 12 weeks of gestation using the Perceived Stress Scale. DNA methylation at differentially methylated regions (DMRs) associated with H19, IGF2, MEG3, MEST, SGCE/PEG10, PEG3, NNAT, and PLAGL1 was measured from peripheral and cord blood using bisulfite pyrosequencing in a sub-sample of 79 mother-infant pairs. We examined associations between PTB and stress and evaluated differences in DNA methylation at each DMR by stress. Maternal stress was not associated with PTB (OR = 0.98; 95% CI, 0.40-2.40; P = 0.96), after adjustment for maternal body mass index (BMI), income, and raised blood pressure. However, elevated stress was associated with higher infant DNA methylation at the MEST DMR (2.8% difference, P < 0.01) after adjusting for PTB. Maternal stress may be associated with epigenetic changes at MEST, a gene relevant to maternal care and obesity. Reduced prenatal stress may support the epigenomic profile of a healthy infant.Item Open Access Methylation variation at IGF2 differentially methylated regions and maternal folic acid use before and during pregnancy.(Epigenetics, 2011-07) Hoyo, Cathrine; Murtha, Amy P; Schildkraut, Joellen M; Jirtle, Randy L; Demark-Wahnefried, Wendy; Forman, Michele R; Iversen, Edwin S; Kurtzberg, Joanne; Overcash, Francine; Huang, Zhiqing; Murphy, Susan KFolic acid (FA) supplementation before and during pregnancy has been associated with decreased risk of neural tube defects although recent reports suggest it may also increase the risk of other chronic diseases. We evaluated exposure to maternal FA supplementation before and during pregnancy in relation to aberrant DNA methylation at two differentially methylated regions (DMRs) regulating Insulin-like Growth Factor 2 (IGF2) expression in infants. Aberrant methylation at these regions has been associated with IGF2 deregulation and increased susceptibility to several chronic diseases. Using a self-administered questionnaire, we assessed FA intake before and during pregnancy in 438 pregnant women. Pyrosequencing was used to measure methylation at two IGF2 DMRs in umbilical cord blood leukocytes. Mixed models were used to determine relationships between maternal FA supplementation before or during pregnancy and DNA methylation levels at birth. Average methylation at the H19 DMR was 61.2%. Compared to infants born to women reporting no FA intake before or during pregnancy, methylation levels at the H19 DMR decreased with increasing FA intake (2.8%, p=0.03, and 4.9%, p=0.04, for intake before and during pregnancy, respectively). This methylation decrease was most pronounced in male infants (p=0.01). Methylation alterations at the H19 DMR are likely an important mechanism by which FA risks and/or benefits are conferred in utero. Because stable methylation marks at DMRs regulating imprinted genes are acquired before gastrulation, they may serve as archives of early exposures with the potential to improve our understanding of developmental origins of adult disease.Item Restricted Risk of ovarian cancer and inherited variants in relapse-associated genes.(PLoS One, 2010-01-27) Peedicayil, Abraham; Vierkant, Robert A; Hartmann, Lynn C; Fridley, Brooke L; Fredericksen, Zachary S; White, Kristin L; Elliott, Elaine A; Phelan, Catherine M; Tsai, Ya-Yu; Berchuck, Andrew; Iversen, Edwin S; Couch, Fergus J; Peethamabaran, Prema; Larson, Melissa C; Kalli, Kimberly R; Kosel, Matthew L; Shridhar, Vijayalakshmi; Rider, David N; Liebow, Mark; Cunningham, Julie M; Schildkraut, Joellen M; Sellers, Thomas A; Goode, Ellen LBACKGROUND: We previously identified a panel of genes associated with outcome of ovarian cancer. The purpose of the current study was to assess whether variants in these genes correlated with ovarian cancer risk. METHODS AND FINDINGS: Women with and without invasive ovarian cancer (749 cases, 1,041 controls) were genotyped at 136 single nucleotide polymorphisms (SNPs) within 13 candidate genes. Risk was estimated for each SNP and for overall variation within each gene. At the gene-level, variation within MSL1 (male-specific lethal-1 homolog) was associated with risk of serous cancer (p = 0.03); haplotypes within PRPF31 (PRP31 pre-mRNA processing factor 31 homolog) were associated with risk of invasive disease (p = 0.03). MSL1 rs7211770 was associated with decreased risk of serous disease (OR 0.81, 95% CI 0.66-0.98; p = 0.03). SNPs in MFSD7, BTN3A3, ZNF200, PTPRS, and CCND1A were inversely associated with risk (p<0.05), and there was increased risk at HEXIM1 rs1053578 (p = 0.04, OR 1.40, 95% CI 1.02-1.91). CONCLUSIONS: Tumor studies can reveal novel genes worthy of follow-up for cancer susceptibility. Here, we found that inherited markers in the gene encoding MSL1, part of a complex that modifies the histone H4, may decrease risk of invasive serous ovarian cancer.Item Open Access Risk Prediction for Epithelial Ovarian Cancer in 11 United States-Based Case-Control Studies: Incorporation of Epidemiologic Risk Factors and 17 Confirmed Genetic Loci.(Am J Epidemiol, 2016-10-15) Clyde, Merlise A; Palmieri Weber, Rachel; Iversen, Edwin S; Poole, Elizabeth M; Doherty, Jennifer A; Goodman, Marc T; Ness, Roberta B; Risch, Harvey A; Rossing, Mary Anne; Terry, Kathryn L; Wentzensen, Nicolas; Whittemore, Alice S; Anton-Culver, Hoda; Bandera, Elisa V; Berchuck, Andrew; Carney, Michael E; Cramer, Daniel W; Cunningham, Julie M; Cushing-Haugen, Kara L; Edwards, Robert P; Fridley, Brooke L; Goode, Ellen L; Lurie, Galina; McGuire, Valerie; Modugno, Francesmary; Moysich, Kirsten B; Olson, Sara H; Pearce, Celeste Leigh; Pike, Malcolm C; Rothstein, Joseph H; Sellers, Thomas A; Sieh, Weiva; Stram, Daniel; Thompson, Pamela J; Vierkant, Robert A; Wicklund, Kristine G; Wu, Anna H; Ziogas, Argyrios; Tworoger, Shelley S; Schildkraut, Joellen MPreviously developed models for predicting absolute risk of invasive epithelial ovarian cancer have included a limited number of risk factors and have had low discriminatory power (area under the receiver operating characteristic curve (AUC) < 0.60). Because of this, we developed and internally validated a relative risk prediction model that incorporates 17 established epidemiologic risk factors and 17 genome-wide significant single nucleotide polymorphisms (SNPs) using data from 11 case-control studies in the United States (5,793 cases; 9,512 controls) from the Ovarian Cancer Association Consortium (data accrued from 1992 to 2010). We developed a hierarchical logistic regression model for predicting case-control status that included imputation of missing data. We randomly divided the data into an 80% training sample and used the remaining 20% for model evaluation. The AUC for the full model was 0.664. A reduced model without SNPs performed similarly (AUC = 0.649). Both models performed better than a baseline model that included age and study site only (AUC = 0.563). The best predictive power was obtained in the full model among women younger than 50 years of age (AUC = 0.714); however, the addition of SNPs increased the AUC the most for women older than 50 years of age (AUC = 0.638 vs. 0.616). Adapting this improved model to estimate absolute risk and evaluating it in prospective data sets is warranted.