Browsing by Subject "Genome, Human"
Now showing 1 - 20 of 30
- Results Per Page
- Sort Options
Item Open Access 29 mammalian genomes reveal novel exaptations of mobile elements for likely regulatory functions in the human genome.(PloS one, 2012-01) Lowe, Craig B; Haussler, DavidRecent research supports the view that changes in gene regulation, as opposed to changes in the genes themselves, play a significant role in morphological evolution. Gene regulation is largely dependent on transcription factor binding sites. Researchers are now able to use the available 29 mammalian genomes to measure selective constraint at the level of binding sites. This detailed map of constraint suggests that mammalian genomes co-opt fragments of mobile elements to act as gene regulatory sequence on a large scale. In the human genome we detect over 280,000 putative regulatory elements, totaling approximately 7 Mb of sequence, that originated as mobile element insertions. These putative regulatory regions are conserved non-exonic elements (CNEEs), which show considerable cross-species constraint and signatures of continued negative selection in humans, yet do not appear in a known mature transcript. These putative regulatory elements were co-opted from SINE, LINE, LTR and DNA transposon insertions. We demonstrate that at least 11%, and an estimated 20%, of gene regulatory sequence in the human genome showing cross-species conservation was co-opted from mobile elements. The location in the genome of CNEEs co-opted from mobile elements closely resembles that of CNEEs in general, except in the centers of the largest gene deserts where recognizable co-option events are relatively rare. We find that regions of certain mobile element insertions are more likely to be held under purifying selection than others. In particular, we show 6 examples where paralogous instances of an often co-opted mobile element region define a sequence motif that closely matches a transcription factor's binding profile.Item Open Access A cellular genome-wide association study reveals human variation in microtubule stability and a role in inflammatory cell death.(Mol Biol Cell, 2014-01) Salinas, Raul E; Ogohara, Cassandra; Thomas, Monica I; Shukla, Kajal P; Miller, Samuel I; Ko, Dennis CPyroptosis is proinflammatory cell death that occurs in response to certain microbes. Activation of the protease caspase-1 by molecular platforms called inflammasomes is required for pyroptosis. We performed a cellular genome-wide association study (GWAS) using Salmonella typhimurium infection of human lymphoblastoid cell lines as a means of dissecting the genetic architecture of susceptibility to pyroptosis and identifying unknown regulatory mechanisms. Cellular GWAS revealed that a common human genetic difference that regulates pyroptosis also alters microtubule stability. An intergenic single-nucleotide polymorphism on chromosome 18 is associated with decreased pyroptosis and increased expression of TUBB6 (tubulin, β 6 class V). TUBB6 is unique among tubulin isoforms in that its overexpression can completely disrupt the microtubule network. Cells from individuals with higher levels of TUBB6 expression have lower microtubule stability and less pyroptosis. Reducing TUBB6 expression or stabilizing microtubules pharmacologically with paclitaxel (Taxol) increases pyroptosis without affecting the other major readout of caspase-1 activation, interleukin-1β secretion. The results reveal a new role for microtubules and possibly specific tubulin isoforms in the execution of pyroptosis. Furthermore, the finding that there is common diversity in TUBB6 expression and microtubule stability could have broad consequences for other microtubule-dependent phenotypes, diseases, and pharmacological responses.Item Open Access A genome-wide association study of variants associated with acquisition of Staphylococcus aureus bacteremia in a healthcare setting.(BMC Infect Dis, 2014-02-13) Nelson, Charlotte L; Pelak, Kimberly; Podgoreanu, Mihai V; Ahn, Sun Hee; Scott, William K; Allen, Andrew S; Cowell, Lindsay G; Rude, Thomas H; Zhang, Yurong; Tong, Amy; Ruffin, Felicia; Sharma-Kuinkel, Batu K; Fowler, Vance GBACKGROUND: Humans vary in their susceptibility to acquiring Staphylococcus aureus infection, and research suggests that there is a genetic basis for this variability. Several recent genome-wide association studies (GWAS) have identified variants that may affect susceptibility to infectious diseases, demonstrating the potential value of GWAS in this arena. METHODS: We conducted a GWAS to identify common variants associated with acquisition of S. aureus bacteremia (SAB) resulting from healthcare contact. We performed a logistic regression analysis to compare patients with healthcare contact who developed SAB (361 cases) to patients with healthcare contact in the same hospital who did not develop SAB (699 controls), testing 542,410 SNPs and adjusting for age (by decade), sex, and 6 significant principal components from our EIGENSTRAT analysis. Additionally, we evaluated the joint effect of the host and pathogen genomes in association with severity of SAB infection via logistic regression, including an interaction of host SNP with bacterial genotype, and adjusting for age (by decade), sex, the 6 significant principal components, and dialysis status. Bonferroni corrections were applied in both analyses to control for multiple comparisons. RESULTS: Ours is the first study that has attempted to evaluate the entire human genome for variants potentially involved in the acquisition or severity of SAB. Although this study identified no common variant of large effect size to have genome-wide significance for association with either the risk of acquiring SAB or severity of SAB, the variant (rs2043436) most significantly associated with severity of infection is located in a biologically plausible candidate gene (CDON, a member of the immunoglobulin family) and may warrant further study. CONCLUSIONS: The genetic architecture underlying SAB is likely to be complex. Future investigations using larger samples, narrowed phenotypes, and advances in both genotyping and analytical methodologies will be important tools for identifying causative variants for this common and serious cause of healthcare-associated infection.Item Open Access A high-resolution map of human evolutionary constraint using 29 mammals.(Nature, 2011-10-12) Lindblad-Toh, Kerstin; Garber, Manuel; Zuk, Or; Lin, Michael F; Parker, Brian J; Washietl, Stefan; Kheradpour, Pouya; Ernst, Jason; Jordan, Gregory; Mauceli, Evan; Ward, Lucas D; Lowe, Craig B; Holloway, Alisha K; Clamp, Michele; Gnerre, Sante; Alföldi, Jessica; Beal, Kathryn; Chang, Jean; Clawson, Hiram; Cuff, James; Di Palma, Federica; Fitzgerald, Stephen; Flicek, Paul; Guttman, Mitchell; Hubisz, Melissa J; Jaffe, David B; Jungreis, Irwin; Kent, W James; Kostka, Dennis; Lara, Marcia; Martins, Andre L; Massingham, Tim; Moltke, Ida; Raney, Brian J; Rasmussen, Matthew D; Robinson, Jim; Stark, Alexander; Vilella, Albert J; Wen, Jiayu; Xie, Xiaohui; Zody, Michael C; Broad Institute Sequencing Platform and Whole Genome Assembly Team; Baldwin, Jen; Bloom, Toby; Chin, Chee Whye; Heiman, Dave; Nicol, Robert; Nusbaum, Chad; Young, Sarah; Wilkinson, Jane; Worley, Kim C; Kovar, Christie L; Muzny, Donna M; Gibbs, Richard A; Baylor College of Medicine Human Genome Sequencing Center Sequencing Team; Cree, Andrew; Dihn, Huyen H; Fowler, Gerald; Jhangiani, Shalili; Joshi, Vandita; Lee, Sandra; Lewis, Lora R; Nazareth, Lynne V; Okwuonu, Geoffrey; Santibanez, Jireh; Warren, Wesley C; Mardis, Elaine R; Weinstock, George M; Wilson, Richard K; Genome Institute at Washington University; Delehaunty, Kim; Dooling, David; Fronik, Catrina; Fulton, Lucinda; Fulton, Bob; Graves, Tina; Minx, Patrick; Sodergren, Erica; Birney, Ewan; Margulies, Elliott H; Herrero, Javier; Green, Eric D; Haussler, David; Siepel, Adam; Goldman, Nick; Pollard, Katherine S; Pedersen, Jakob S; Lander, Eric S; Kellis, ManolisThe comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ∼4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ∼60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease.Item Open Access Adaptive sequence divergence forged new neurodevelopmental enhancers in humans.(Cell, 2022-11) Mangan, Riley J; Alsina, Fernando C; Mosti, Federica; Sotelo-Fonseca, Jesús Emiliano; Snellings, Daniel A; Au, Eric H; Carvalho, Juliana; Sathyan, Laya; Johnson, Graham D; Reddy, Timothy E; Silver, Debra L; Lowe, Craig BSearches for the genetic underpinnings of uniquely human traits have focused on human-specific divergence in conserved genomic regions, which reflects adaptive modifications of existing functional elements. However, the study of conserved regions excludes functional elements that descended from previously neutral regions. Here, we demonstrate that the fastest-evolved regions of the human genome, which we term "human ancestor quickly evolved regions" (HAQERs), rapidly diverged in an episodic burst of directional positive selection prior to the human-Neanderthal split, before transitioning to constraint within hominins. HAQERs are enriched for bivalent chromatin states, particularly in gastrointestinal and neurodevelopmental tissues, and genetic variants linked to neurodevelopmental disease. We developed a multiplex, single-cell in vivo enhancer assay to discover that rapid sequence divergence in HAQERs generated hominin-unique enhancers in the developing cerebral cortex. We propose that a lack of pleiotropic constraints and elevated mutation rates poised HAQERs for rapid adaptation and subsequent susceptibility to disease.Item Open Access Alpha satellite DNA biology: finding function in the recesses of the genome.(Chromosome research : an international journal on the molecular, supramolecular and evolutionary aspects of chromosome biology, 2018-09) McNulty, Shannon M; Sullivan, Beth ARepetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.Item Open Access An Atlas of Genetic Variation Linking Pathogen-Induced Cellular Traits to Human Disease.(Cell host & microbe, 2018-08) Wang, Liuyang; Pittman, Kelly J; Barker, Jeffrey R; Salinas, Raul E; Stanaway, Ian B; Williams, Graham D; Carroll, Robert J; Balmat, Tom; Ingham, Andy; Gopalakrishnan, Anusha M; Gibbs, Kyle D; Antonia, Alejandro L; eMERGE Network; Heitman, Joseph; Lee, Soo Chan; Jarvik, Gail P; Denny, Joshua C; Horner, Stacy M; DeLong, Mark R; Valdivia, Raphael H; Crosslin, David R; Ko, Dennis CPathogens have been a strong driving force for natural selection. Therefore, understanding how human genetic differences impact infection-related cellular traits can mechanistically link genetic variation to disease susceptibility. Here we report the Hi-HOST Phenome Project (H2P2): a catalog of cellular genome-wide association studies (GWAS) comprising 79 infection-related phenotypes in response to 8 pathogens in 528 lymphoblastoid cell lines. Seventeen loci surpass genome-wide significance for infection-associated phenotypes ranging from pathogen replication to cytokine production. We combined H2P2 with clinical association data from patients to identify a SNP near CXCL10 as a risk factor for inflammatory bowel disease. A SNP in the transcriptional repressor ZBTB20 demonstrated pleiotropy, likely through suppression of multiple target genes, and was associated with viral hepatitis. These data are available on a web portal to facilitate interpreting human genome variation through the lens of cell biology and should serve as a rich resource for the research community.Item Open Access Can Genetics Predict Response to Complex Behavioral Interventions? Evidence from a Genetic Analysis of the Fast Track Randomized Control Trial.(J Policy Anal Manage, 2015) Albert, Dustin; Belsky, Daniel W; Crowley, D Max; Latendresse, Shawn J; Aliev, Fazil; Riley, Brien; Group, Conduct Problems Prevention Research; Dick, Danielle M; Dodge, Kenneth AEarly interventions are a preferred method for addressing behavioral problems in high-risk children, but often have only modest effects. Identifying sources of variation in intervention effects can suggest means to improve efficiency. One potential source of such variation is the genome. We conducted a genetic analysis of the Fast Track randomized control trial, a 10-year-long intervention to prevent high-risk kindergarteners from developing adult externalizing problems including substance abuse and antisocial behavior. We tested whether variants of the glucocorticoid receptor gene NR3C1 were associated with differences in response to the Fast Track intervention. We found that in European-American children, a variant of NR3C1 identified by the single-nucleotide polymorphism rs10482672 was associated with increased risk for externalizing psychopathology in control group children and decreased risk for externalizing psychopathology in intervention group children. Variation in NR3C1 measured in this study was not associated with differential intervention response in African-American children. We discuss implications for efforts to prevent externalizing problems in high-risk children and for public policy in the genomic era.Item Open Access Detecting structure of haplotypes and local ancestry.(Genetics, 2014-03) Guan, YongtaoWe present a two-layer hidden Markov model to detect the structure of haplotypes for unrelated individuals. This allows us to model two scales of linkage disequilibrium (one within a group of haplotypes and one between groups), thereby taking advantage of rich haplotype information to infer local ancestry of admixed individuals. Our method outperforms competing state-of-the-art methods, particularly for regions of small ancestral track lengths. Applying our method to Mexican samples in HapMap3, we found two regions on chromosomes 6 and 8 that show significant departure of local ancestry from the genome-wide average. A software package implementing the methods described in this article is freely available at http://bcm.edu/cnrc/mcmcmc.Item Open Access Endangered species hold clues to human evolution.(The Journal of heredity, 2010-07) Lowe, Craig B; Bejerano, Gill; Salama, Sofie R; Haussler, DavidWe report that 18 conserved, and by extension functional, elements in the human genome are the result of retroposon insertions that are evolving under purifying selection in mammals. We show evidence that 1 of the 18 elements regulates the expression of ASXL3 during development by encoding an alternatively spliced exon that causes nonsense-mediated decay of the transcript. The retroposon that gave rise to these functional elements was quickly inactivated in the mammalian ancestor, and all traces of it have been lost due to neutral decay. However, the tuatara has maintained a near-ancestral version of this retroposon in its extant genome, which allows us to connect the 18 human elements to the evolutionary events that created them. We propose that conservation efforts over more than 100 years may not have only prevented the tuatara from going extinct but could have preserved our ability to understand the evolutionary history of functional elements in the human genome. Through simulations, we argue that species with historically low population sizes are more likely to harbor ancient mobile elements for long periods of time and in near-ancestral states, making these species indispensable in understanding the evolutionary origin of functional elements in the human genome.Item Restricted Evidence-ranked motif identification.(Genome Biol, 2010) Georgiev, S; Boyle, AP; Jayasurya, K; Ding, X; Mukherjee, Sayan; Ohler, UwecERMIT is a computationally efficient motif discovery tool based on analyzing genome-wide quantitative regulatory evidence. Instead of pre-selecting promising candidate sequences, it utilizes information across all sequence regions to search for high-scoring motifs. We apply cERMIT on a range of direct binding and overexpression datasets; it substantially outperforms state-of-the-art approaches on curated ChIP-chip datasets, and easily scales to current mammalian ChIP-seq experiments with data on thousands of non-coding regions.Item Open Access Generalized admixture mapping for complex traits.(G3 (Bethesda), 2013-07-08) Zhu, Bin; Ashley-Koch, Allison E; Dunson, David BAdmixture mapping is a popular tool to identify regions of the genome associated with traits in a recently admixed population. Existing methods have been developed primarily for identification of a single locus influencing a dichotomous trait within a case-control study design. We propose a generalized admixture mapping (GLEAM) approach, a flexible and powerful regression method for both quantitative and qualitative traits, which is able to test for association between the trait and local ancestries in multiple loci simultaneously and adjust for covariates. The new method is based on the generalized linear model and uses a quadratic normal moment prior to incorporate admixture prior information. Through simulation, we demonstrate that GLEAM achieves lower type I error rate and higher power than ANCESTRYMAP both for qualitative traits and more significantly for quantitative traits. We applied GLEAM to genome-wide SNP data from the Illumina African American panel derived from a cohort of black women participating in the Healthy Pregnancy, Healthy Baby study and identified a locus on chromosome 2 associated with the averaged maternal mean arterial pressure during 24 to 28 weeks of pregnancy.Item Restricted Genome-wide association study of Lp-PLA(2) activity and mass in the Framingham Heart Study.(PLoS Genet, 2010-04-29) Suchindran, Sunil; Rivedal, David; Guyton, John R; Milledge, Tom; Gao, Xiaoyi; Benjamin, Ashlee; Rowell, Jennifer; Ginsburg, Geoffrey S; McCarthy, Jeanette JLipoprotein-associated phospholipase A(2) (Lp-PLA(2)) is an emerging risk factor and therapeutic target for cardiovascular disease. The activity and mass of this enzyme are heritable traits, but major genetic determinants have not been explored in a systematic, genome-wide fashion. We carried out a genome-wide association study of Lp-PLA(2) activity and mass in 6,668 Caucasian subjects from the population-based Framingham Heart Study. Clinical data and genotypes from the Affymetrix 550K SNP array were obtained from the open-access Framingham SHARe project. Each polymorphism that passed quality control was tested for associations with Lp-PLA(2) activity and mass using linear mixed models implemented in the R statistical package, accounting for familial correlations, and controlling for age, sex, smoking, lipid-lowering-medication use, and cohort. For Lp-PLA(2) activity, polymorphisms at four independent loci reached genome-wide significance, including the APOE/APOC1 region on chromosome 19 (p = 6 x 10(-24)); CELSR2/PSRC1 on chromosome 1 (p = 3 x 10(-15)); SCARB1 on chromosome 12 (p = 1x10(-8)) and ZNF259/BUD13 in the APOA5/APOA1 gene region on chromosome 11 (p = 4 x 10(-8)). All of these remained significant after accounting for associations with LDL cholesterol, HDL cholesterol, or triglycerides. For Lp-PLA(2) mass, 12 SNPs achieved genome-wide significance, all clustering in a region on chromosome 6p12.3 near the PLA2G7 gene. Our analyses demonstrate that genetic polymorphisms may contribute to inter-individual variation in Lp-PLA(2) activity and mass.Item Open Access Genome-wide gene-environment interaction analysis for asbestos exposure in lung cancer susceptibility.(Carcinogenesis, 2012-08) Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi WeiAsbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.Item Open Access Genome-wide linkage analysis for human longevity: Genetics of Healthy Aging Study.(Aging Cell, 2013-04) Beekman, Marian; Blanché, Hélène; Perola, Markus; Hervonen, Anti; Bezrukov, Vladyslav; Sikora, Ewa; Flachsbart, Friederike; Christiansen, Lene; De Craen, Anton JM; Kirkwood, Tom BL; Rea, Irene Maeve; Poulain, Michel; Robine, Jean-Marie; Valensin, Silvana; Stazi, Maria Antonietta; Passarino, Giuseppe; Deiana, Luca; Gonos, Efstathios S; Paternoster, Lavinia; Sørensen, Thorkild IA; Tan, Qihua; Helmer, Quinta; van den Akker, Erik B; Deelen, Joris; Martella, Francesca; Cordell, Heather J; Ayers, Kristin L; Vaupel, James W; Törnwall, Outi; Johnson, Thomas E; Schreiber, Stefan; Lathrop, Mark; Skytthe, Axel; Westendorp, Rudi GJ; Christensen, Kaare; Gampe, Jutta; Nebel, Almut; Houwing-Duistermaat, Jeanine J; Slagboom, Pieternella Eline; Franceschi, Claudio; GEHA consortiumClear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian sibling pairs that have been enrolled in 15 study centers of 11 European countries as part of the Genetics of Healthy Aging (GEHA) project. In the joint linkage analyses, we observed four regions that show linkage with longevity; chromosome 14q11.2 (LOD = 3.47), chromosome 17q12-q22 (LOD = 2.95), chromosome 19p13.3-p13.11 (LOD = 3.76), and chromosome 19q13.11-q13.32 (LOD = 3.57). To fine map these regions linked to longevity, we performed association analysis using GWAS data in a subgroup of 1228 unrelated nonagenarian and 1907 geographically matched controls. Using a fixed-effect meta-analysis approach, rs4420638 at the TOMM40/APOE/APOC1 gene locus showed significant association with longevity (P-value = 9.6 × 10(-8) ). By combined modeling of linkage and association, we showed that association of longevity with APOEε4 and APOEε2 alleles explain the linkage at 19q13.11-q13.32 with P-value = 0.02 and P-value = 1.0 × 10(-5) , respectively. In the largest linkage scan thus far performed for human familial longevity, we confirm that the APOE locus is a longevity gene and that additional longevity loci may be identified at 14q11.2, 17q12-q22, and 19p13.3-p13.11. As the latter linkage results are not explained by common variants, we suggest that rare variants play an important role in human familial longevity.Item Open Access Genomic analysis of diffuse intrinsic pontine gliomas identifies three molecular subgroups and recurrent activating ACVR1 mutations.(Nat Genet, 2014-05) Buczkowicz, Pawel; Hoeman, Christine; Rakopoulos, Patricia; Pajovic, Sanja; Letourneau, Louis; Dzamba, Misko; Morrison, Andrew; Lewis, Peter; Bouffet, Eric; Bartels, Ute; Zuccaro, Jennifer; Agnihotri, Sameer; Ryall, Scott; Barszczyk, Mark; Chornenkyy, Yevgen; Bourgey, Mathieu; Bourque, Guillaume; Montpetit, Alexandre; Cordero, Francisco; Castelo-Branco, Pedro; Mangerel, Joshua; Tabori, Uri; Ho, King Ching; Huang, Annie; Taylor, Kathryn R; Mackay, Alan; Bendel, Anne E; Nazarian, Javad; Fangusaro, Jason R; Karajannis, Matthias A; Zagzag, David; Foreman, Nicholas K; Donson, Andrew; Hegert, Julia V; Smith, Amy; Chan, Jennifer; Lafay-Cousin, Lucy; Dunn, Sandra; Hukin, Juliette; Dunham, Chris; Scheinemann, Katrin; Michaud, Jean; Zelcer, Shayna; Ramsay, David; Cain, Jason; Brennan, Cameron; Souweidane, Mark M; Jones, Chris; Allis, C David; Brudno, Michael; Becher, Oren; Hawkins, CynthiaDiffuse intrinsic pontine glioma (DIPG) is a fatal brain cancer that arises in the brainstem of children, with no effective treatment and near 100% fatality. The failure of most therapies can be attributed to the delicate location of these tumors and to the selection of therapies on the basis of assumptions that DIPGs are molecularly similar to adult disease. Recent studies have unraveled the unique genetic makeup of this brain cancer, with nearly 80% found to harbor a p.Lys27Met histone H3.3 or p.Lys27Met histone H3.1 alteration. However, DIPGs are still thought of as one disease, with limited understanding of the genetic drivers of these tumors. To understand what drives DIPGs, we integrated whole-genome sequencing with methylation, expression and copy number profiling, discovering that DIPGs comprise three molecularly distinct subgroups (H3-K27M, silent and MYCN) and uncovering a new recurrent activating mutation affecting the activin receptor gene ACVR1 in 20% of DIPGs. Mutations in ACVR1 were constitutively activating, leading to SMAD phosphorylation and increased expression of the downstream activin signaling targets ID1 and ID2. Our results highlight distinct molecular subgroups and novel therapeutic targets for this incurable pediatric cancer.Item Open Access Human genetic risk of treatment with antiviral nucleoside analog drugs that induce lethal mutagenesis: The special case of molnupiravir.(Environmental and molecular mutagenesis, 2022-01) Waters, Michael D; Warren, Stafford; Hughes, Claude; Lewis, Philip; Zhang, FengyuThis review considers antiviral nucleoside analog drugs, including ribavirin, favipiravir, and molnupiravir, which induce genome error catastrophe in SARS-CoV or SARS-CoV-2 via lethal mutagenesis as a mode of action. In vitro data indicate that molnupiravir may be 100 times more potent as an antiviral agent than ribavirin or favipiravir. Molnupiravir has recently demonstrated efficacy in a phase 3 clinical trial. Because of its anticipated global use, its relative potency, and the reported in vitro "host" cell mutagenicity of its active principle, β-d-N4-hydroxycytidine, we have reviewed the development of molnupiravir and its genotoxicity safety evaluation, as well as the genotoxicity profiles of three congeners, that is, ribavirin, favipiravir, and 5-(2-chloroethyl)-2'-deoxyuridine. We consider the potential genetic risks of molnupiravir on the basis of all available information and focus on the need for additional human genotoxicity data and follow-up in patients treated with molnupiravir and similar drugs. Such human data are especially relevant for antiviral NAs that have the potential of permanently modifying the genomes of treated patients and/or causing human teratogenicity or embryotoxicity. We conclude that the results of preclinical genotoxicity studies and phase 1 human clinical safety, tolerability, and pharmacokinetics are critical components of drug safety assessments and sentinels of unanticipated adverse health effects. We provide our rationale for performing more thorough genotoxicity testing prior to and within phase 1 clinical trials, including human PIG-A and error corrected next generation sequencing (duplex sequencing) studies in DNA and mitochondrial DNA of patients treated with antiviral NAs that induce genome error catastrophe via lethal mutagenesis.Item Open Access Human genomic regions with exceptionally high levels of population differentiation identified from 911 whole-genome sequences.(Genome biology, 2014-06-30) Colonna, Vincenza; Ayub, Qasim; Chen, Yuan; Pagani, Luca; Luisi, Pierre; Pybus, Marc; Garrison, Erik; Xue, Yali; Tyler-Smith, Chris; 1000 Genomes Project Consortium; Abecasis, Goncalo R; Auton, Adam; Brooks, Lisa D; DePristo, Mark A; Durbin, Richard M; Handsaker, Robert E; Kang, Hyun Min; Marth, Gabor T; McVean, Gil ABACKGROUND:Population differentiation has proved to be effective for identifying loci under geographically localized positive selection, and has the potential to identify loci subject to balancing selection. We have previously investigated the pattern of genetic differentiation among human populations at 36.8 million genomic variants to identify sites in the genome showing high frequency differences. Here, we extend this dataset to include additional variants, survey sites with low levels of differentiation, and evaluate the extent to which highly differentiated sites are likely to result from selective or other processes. RESULTS:We demonstrate that while sites with low differentiation represent sampling effects rather than balancing selection, sites showing extremely high population differentiation are enriched for positive selection events and that one half may be the result of classic selective sweeps. Among these, we rediscover known examples, where we actually identify the established functional SNP, and discover novel examples including the genes ABCA12, CALD1 and ZNF804, which we speculate may be linked to adaptations in skin, calcium metabolism and defense, respectively. CONCLUSIONS:We identify known and many novel candidate regions for geographically restricted positive selection, and suggest several directions for further research.Item Open Access Identification of Chiari Type I Malformation subtypes using whole genome expression profiles and cranial base morphometrics.(BMC medical genomics, 2014-06) Markunas, Christina A; Lock, Eric; Soldano, Karen; Cope, Heidi; Ding, Chien-Kuang C; Enterline, David S; Grant, Gerald; Fuchs, Herbert; Ashley-Koch, Allison E; Gregory, Simon GBackground
Chiari Type I Malformation (CMI) is characterized by herniation of the cerebellar tonsils through the foramen magnum at the base of the skull, resulting in significant neurologic morbidity. As CMI patients display a high degree of clinical variability and multiple mechanisms have been proposed for tonsillar herniation, it is hypothesized that this heterogeneous disorder is due to multiple genetic and environmental factors. The purpose of the present study was to gain a better understanding of what factors contribute to this heterogeneity by using an unsupervised statistical approach to define disease subtypes within a case-only pediatric population.Methods
A collection of forty-four pediatric CMI patients were ascertained to identify disease subtypes using whole genome expression profiles generated from patient blood and dura mater tissue samples, and radiological data consisting of posterior fossa (PF) morphometrics. Sparse k-means clustering and an extension to accommodate multiple data sources were used to cluster patients into more homogeneous groups using biological and radiological data both individually and collectively.Results
All clustering analyses resulted in the significant identification of patient classes, with the pure biological classes derived from patient blood and dura mater samples demonstrating the strongest evidence. Those patient classes were further characterized by identifying enriched biological pathways, as well as correlated cranial base morphological and clinical traits.Conclusions
Our results implicate several strong biological candidates warranting further investigation from the dura expression analysis and also identified a blood gene expression profile corresponding to a global down-regulation in protein synthesis.Item Open Access Identification of cis-suppression of human disease mutations by comparative genomics.(Nature, 2015-08) Jordan, Daniel M; Frangakis, Stephan G; Golzio, Christelle; Cassa, Christopher A; Kurtzberg, Joanne; Task Force for Neonatal Genomics; Davis, Erica E; Sunyaev, Shamil R; Katsanis, NicholasPatterns of amino acid conservation have served as a tool for understanding protein evolution. The same principles have also found broad application in human genomics, driven by the need to interpret the pathogenic potential of variants in patients. Here we performed a systematic comparative genomics analysis of human disease-causing missense variants. We found that an appreciable fraction of disease-causing alleles are fixed in the genomes of other species, suggesting a role for genomic context. We developed a model of genetic interactions that predicts most of these to be simple pairwise compensations. Functional testing of this model on two known human disease genes revealed discrete cis amino acid residues that, although benign on their own, could rescue the human mutations in vivo. This approach was also applied to ab initio gene discovery to support the identification of a de novo disease driver in BTG2 that is subject to protective cis-modification in more than 50 species. Finally, on the basis of our data and models, we developed a computational tool to predict candidate residues subject to compensation. Taken together, our data highlight the importance of cis-genomic context as a contributor to protein evolution; they provide an insight into the complexity of allele effect on phenotype; and they are likely to assist methods for predicting allele pathogenicity.