Detecting differential copy number variation between groups of samples

Abstract

© 2018 Lowe et al. We present a method to detect copy number variants (CNVs) that are differentially present between two groups of sequenced samples. We use a finite-state transducer where the emitted read depth is conditioned on the mappability and GC-content of all reads that occur at a given base position. In this model, the read depth within a region is a mixture of binomials, which in simulations matches the read depth more closely than the often-used negative binomial distribution. The method analyzes all samples simultaneously, preserving uncertainty as to the breakpoints and magnitude of CNVs present in an individual when it identifies CNVs differentially present between the two groups. We apply this method to identify CNVs that are recurrently associated with postglacial adaptation of marine threespine stickleback (Gasterosteus aculeatus) to freshwater. We identify 6664 regions of the stickleback genome, totaling 1.7 Mbp, which show consistent copy number differences between marine and freshwater populations. These deletions and duplications affect both protein-coding genes and cis-regulatory elements, including a noncoding intronic telencephalon enhancer of DCHS1. The functions of the genes near or included within the 6664 CNVs are enriched for immunity and muscle development, as well as head and limb morphology. Although freshwater stickleback have repeatedly evolved from marine populations, we show that freshwater stickleback also act as reservoirs for ancient ancestral sequences that are highly conserved among distantly related teleosts, but largely missing from marine stickleback due to recent selective sweeps in marine populations.

Department

Description

Provenance

Subjects

Citation

Published Version (Please cite this version)

10.1101/gr.206938.116

Publication Info

Lowe, Craig B, Nicelio Sanchez-Luege, Timothy R Howes, Shannon D Brady, Rhea R Daugherty, Felicity C Jones, Michael A Bell, David M Kingsley, et al. (2018). Detecting differential copy number variation between groups of samples. Genome Research, 28(2). pp. 256–265. 10.1101/gr.206938.116 Retrieved from https://hdl.handle.net/10161/17410.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Scholars@Duke

Lowe

Craig Lowe

Assistant Professor of Molecular Genetics and Microbiology

Craig Lowe is an Assistant Professor in the Department of Molecular Genetics and Microbiology.  His research interests are in understanding how traits and characteristics of humans, and other vertebrates, are encoded in their genomes.  He is especially focused on adaptations and disease susceptibilities that are unique to humans.  To address these questions, Craig uses both computational and experimental approaches.  Craig's recent research has been on differences in how genes are regulated between species, or between different individuals within a species, and how this causes traits to differ.  All students in Craig's lab are exposed to an interdisciplinary environment; current lab members have backgrounds in mathematics, computer science, neuroscience, developmental biology, and genetics.  Each year Craig teaches one or two courses on rotating topics of: ancient DNA, ethical issues in genomics, and software development for genetic analyses.


Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.