Browsing by Subject "High-Throughput Nucleotide Sequencing"
Now showing 1 - 11 of 11
- Results Per Page
- Sort Options
Item Open Access A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples.(Genome Res, 2014-07) Naccache, Samia N; Federman, Scot; Veeraraghavan, Narayanan; Zaharia, Matei; Lee, Deanna; Samayoa, Erik; Bouquet, Jerome; Greninger, Alexander L; Luk, Ka-Cheung; Enge, Barryett; Wadford, Debra A; Messenger, Sharon L; Genrich, Gillian L; Pellegrino, Kristen; Grard, Gilda; Leroy, Eric; Schneider, Bradley S; Fair, Joseph N; Martínez, Miguel A; Isa, Pavel; Crump, John A; DeRisi, Joseph L; Sittler, Taylor; Hackett, John; Miller, Steve; Chiu, Charles YUnbiased next-generation sequencing (NGS) approaches enable comprehensive pathogen detection in the clinical microbiology laboratory and have numerous applications for public health surveillance, outbreak investigation, and the diagnosis of infectious diseases. However, practical deployment of the technology is hindered by the bioinformatics challenge of analyzing results accurately and in a clinically relevant timeframe. Here we describe SURPI ("sequence-based ultrarapid pathogen identification"), a computational pipeline for pathogen identification from complex metagenomic NGS data generated from clinical samples, and demonstrate use of the pipeline in the analysis of 237 clinical samples comprising more than 1.1 billion sequences. Deployable on both cloud-based and standalone servers, SURPI leverages two state-of-the-art aligners for accelerated analyses, SNAP and RAPSearch, which are as accurate as existing bioinformatics tools but orders of magnitude faster in performance. In fast mode, SURPI detects viruses and bacteria by scanning data sets of 7-500 million reads in 11 min to 5 h, while in comprehensive mode, all known microorganisms are identified, followed by de novo assembly and protein homology searches for divergent viruses in 50 min to 16 h. SURPI has also directly contributed to real-time microbial diagnosis in acutely ill patients, underscoring its potential key role in the development of unbiased NGS-based clinical assays in infectious diseases that demand rapid turnaround times.Item Open Access A genetic variant in the APE1/Ref-1 gene promoter -141T/G may modulate risk of glioblastoma in a Chinese Han population.(BMC cancer, 2011-01) Zhou, Keke; Hu, Dezhi; Lu, Juan; Fan, Weiwei; Liu, Hongliang; Chen, Hongyan; Chen, Gong; Wei, Qingyi; Du, Guhong; Mao, Ying; Lu, Daru; Zhou, LiangfuBACKGROUND: The human apurinic/apyrimidinic endonuclease 1/Redox effector factor-1 (APE1/Ref-1) is implicated in tumor development and progression. Recently, the APE1/Ref-1 promoter -141T/G variant (rs1760944) has been reported to be associated with lung cancer risk. Given the importance of APE1/Ref-1 in both DNA repair and redox activity, we speculate that the -141T/G polymorphism may confer individual susceptibility to gliomas or its subtypes. METHODS: The APE1/Ref-1 -141T/G polymorphism was analyzed in a case-control study including 766 glioma patients (among them 241 glioblastoma, 284 astrocytomas except for glioblastoma and 241 other gliomas) and 824 cancer-free controls from eastern China. Genotyping was performed with Sequenom MassARRAY iPLEX platform by use of allele-specific MALDI-TOF mass spectrometry assay. We estimated odds ratios (ORs) and 95% confidence intervals (95% CIs) using unconditional logistic regression. A test of trend was calculated using the genotype as an ordinal variable in the regression model. For each statistically significant association identified, we estimated the false positive reporting probability (FPRP). FPRP values less than 0.2 were consider to indicate robust associations. RESULTS: The significant association between the APE1/Ref-1 promoter -141T/G polymorphism and glioma risk was not observed. However, the stratified analysis by histology revealed the variant allele G significantly decreased glioblastoma risk (OR = 0.80, 95% CI = 0.65-0.98, P = 0.032). Individuals with the homozygous -141GG genotype exhibited 46% reduced risk of glioblastoma (adjusted OR = 0.54, 95% CI 0.34-0.87, P = 0.012), compared with the TT homozygote. This result remained robust given the prior probabilities of 25% (FPRP = 0.052) and 10% (FPRP = 0.140), but not with a prior probability of 1% (FPRP = 0.643). The P-associated with the trend test was 0.014. CONCLUSIONS: Our results suggest that a specific genetic variant located in the APE1/Ref-1 promoter may modulate risk of glioblastoma, but not for other histological gliomas. Larger studies with more APE1 polymorphisms are required to validate these preliminary findings.Item Open Access Complexity of Delivering Precision Medicine: Opportunities and Challenges.(American Society of Clinical Oncology educational book. American Society of Clinical Oncology. Annual Meeting, 2018-05) Davis, Andrew A; McKee, Amy E; Kibbe, Warren A; Villaflor, Victoria MPrecision medicine has emerged as a tool to match patients with the appropriate treatment based on the precise molecular features of an individual patient's tumor. Although examples of targeted therapies exist resulting in dramatic improvements in patient outcomes, comprehensive genomic profiling of tumors has also demonstrated the incredible complexity of molecular alterations in tissue and blood. These sequencing methods provide opportunities to study the landscape of tumors at baseline and serially in response to treatment. These tools also serve as important biomarkers to detect resistance to treatment and determine higher likelihood of responding to particular treatments, such as immune checkpoint blockade. Federally funded and publicly available data repositories have emerged as mechanisms for data sharing. In addition, novel clinical trials are emerging to develop new ways of incorporating molecular matched therapy into clinical trials. Various challenges to delivery of precision oncology include understanding the complexity of advanced tumors based on evolving "omics" and treatment resistance. For physicians, determining when and how to incorporate genetic and molecular tools into clinic in a cost-effective manner is critical. Finally, we discuss the importance of well-designed prospective clinical trials, biomarkers such as liquid biopsies, the use of multidisciplinary tumor boards, and data sharing as evidence-based medicine tools to optimally study and deliver precision oncology to our patients.Item Open Access Depression in pregnancy, infant birth weight and DNA methylation of imprint regulatory elements.(Epigenetics : official journal of the DNA Methylation Society, 2012-07) Liu, Y; Murphy, SK; Murtha, AP; Fuemmeler, BF; Schildkraut, J; Huang, Z; Overcash, F; Kurtzberg, J; Jirtle, R; Iversen, ES; Forman, MR; Hoyo, CDepressed mood in pregnancy has been linked to low birth weight (LBW, 4,500 g) infants had 5.9% higher methylation at the PLAGL1 DMR compared with normal birth weight infants. Our findings confirm that severe maternal depressed mood in pregnancy is associated with LBW, and that MEG3 and IGF2 plasticity may play important roles.Item Open Access EchinoDB, an application for comparative transcriptomics of deeply-sampled clades of echinoderms.(BMC Bioinformatics, 2016-01-22) Janies, Daniel A; Witter, Zach; Linchangco, Gregorio V; Foltz, David W; Miller, Allison K; Kerr, Alexander M; Jay, Jeremy; Reid, Robert W; Wray, Gregory ABACKGROUND: One of our goals for the echinoderm tree of life project (http://echinotol.org) is to identify orthologs suitable for phylogenetic analysis from next-generation transcriptome data. The current dataset is the largest assembled for echinoderm phylogeny and transcriptomics. We used RNA-Seq to profile adult tissues from 42 echinoderm specimens from 24 orders and 37 families. In order to achieve sampling members of clades that span key evolutionary divergence, many of our exemplars were collected from deep and polar seas. DESCRIPTION: A small fraction of the transcriptome data we produced is being used for phylogenetic reconstruction. Thus to make a larger dataset available to researchers with a wide variety of interests, we made a web-based application, EchinoDB (http://echinodb.uncc.edu). EchinoDB is a repository of orthologous transcripts from echinoderms that is searchable via keywords and sequence similarity. CONCLUSIONS: From transcripts we identified 749,397 clusters of orthologous loci. We have developed the information technology to manage and search the loci their annotations with respect to the Sea Urchin (Strongylocentrotus purpuratus) genome. Several users have already taken advantage of these data for spin-off projects in developmental biology, gene family studies, and neuroscience. We hope others will search EchinoDB to discover datasets relevant to a variety of additional questions in comparative biology.Item Open Access Hybrid de novo genome assembly and centromere characterization of the gray mouse lemur (Microcebus murinus).(BMC biology, 2017-11-16) Larsen, Peter A; Harris, R Alan; Liu, Yue; Murali, Shwetha C; Campbell, C Ryan; Brown, Adam D; Sullivan, Beth A; Shelton, Jennifer; Brown, Susan J; Raveendran, Muthuswamy; Dudchenko, Olga; Machol, Ido; Durand, Neva C; Shamim, Muhammad S; Aiden, Erez Lieberman; Muzny, Donna M; Gibbs, Richard A; Yoder, Anne D; Rogers, Jeffrey; Worley, Kim CThe de novo assembly of repeat-rich mammalian genomes using only high-throughput short read sequencing data typically results in highly fragmented genome assemblies that limit downstream applications. Here, we present an iterative approach to hybrid de novo genome assembly that incorporates datasets stemming from multiple genomic technologies and methods. We used this approach to improve the gray mouse lemur (Microcebus murinus) genome from early draft status to a near chromosome-scale assembly.We used a combination of advanced genomic technologies to iteratively resolve conflicts and super-scaffold the M. murinus genome.We improved the M. murinus genome assembly to a scaffold N50 of 93.32 Mb. Whole genome alignments between our primary super-scaffolds and 23 human chromosomes revealed patterns that are congruent with historical comparative cytogenetic data, thus demonstrating the accuracy of our de novo scaffolding approach and allowing assignment of scaffolds to M. murinus chromosomes. Moreover, we utilized our independent datasets to discover and characterize sequences associated with centromeres across the mouse lemur genome. Quality assessment of the final assembly found 96% of mouse lemur canonical transcripts nearly complete, comparable to other published high-quality reference genome assemblies.We describe a new assembly of the gray mouse lemur (Microcebus murinus) genome with chromosome-scale scaffolds produced using a hybrid bioinformatic and sequencing approach. The approach is cost effective and produces superior results based on metrics of contiguity and completeness. Our results show that emerging genomic technologies can be used in combination to characterize centromeres of non-model species and to produce accurate de novo chromosome-scale genome assemblies of complex mammalian genomes.Item Open Access Next generation multilocus sequence typing (NGMLST) and the analytical software program MLSTEZ enable efficient, cost-effective, high-throughput, multilocus sequencing typing.(Fungal Genet Biol, 2015-02) Chen, Yuan; Frazzitta, Aubrey E; Litvintseva, Anastasia P; Fang, Charles; Mitchell, Thomas G; Springer, Deborah J; Ding, Yun; Yuan, George; Perfect, John RMultilocus sequence typing (MLST) has become the preferred method for genotyping many biological species, and it is especially useful for analyzing haploid eukaryotes. MLST is rigorous, reproducible, and informative, and MLST genotyping has been shown to identify major phylogenetic clades, molecular groups, or subpopulations of a species, as well as individual strains or clones. MLST molecular types often correlate with important phenotypes. Conventional MLST involves the extraction of genomic DNA and the amplification by PCR of several conserved, unlinked gene sequences from a sample of isolates of the taxon under investigation. In some cases, as few as three loci are sufficient to yield definitive results. The amplicons are sequenced, aligned, and compared by phylogenetic methods to distinguish statistically significant differences among individuals and clades. Although MLST is simpler, faster, and less expensive than whole genome sequencing, it is more costly and time-consuming than less reliable genotyping methods (e.g. amplified fragment length polymorphisms). Here, we describe a new MLST method that uses next-generation sequencing, a multiplexing protocol, and appropriate analytical software to provide accurate, rapid, and economical MLST genotyping of 96 or more isolates in single assay. We demonstrate this methodology by genotyping isolates of the well-characterized, human pathogenic yeast Cryptococcus neoformans.Item Open Access Next-generation sequencing of apoptotic DNA breakpoints reveals association with actively transcribed genes and gene translocations.(PLoS One, 2011) Fullwood, Melissa J; Lee, Joanne; Lin, Lifang; Li, Guoliang; Huss, Mikael; Ng, Patrick; Sung, Wing-Kin; Shenolikar, ShirishDNA fragmentation is a well-recognized hallmark of apoptosis. However, the precise DNA sequences cleaved during apoptosis triggered by distinct mechanisms remain unclear. We used next-generation sequencing of DNA fragments generated in Actinomycin D-treated human HL-60 leukemic cells to generate a high-throughput, global map of apoptotic DNA breakpoints. These data highlighted that DNA breaks are non-random and show a significant association with active genes and open chromatin regions. We noted that transcription factor binding sites were also enriched within a fraction of the apoptotic breakpoints. Interestingly, extensive apoptotic cleavage was noted within genes that are frequently translocated in human cancers. We speculate that the non-random fragmentation of DNA during apoptosis may contribute to gene translocations and the development of human cancers.Item Open Access Search for microRNAs expressed by intracellular bacterial pathogens in infected mammalian cells.(PLoS One, 2014) Furuse, Yuki; Finethy, Ryan; Saka, Hector A; Xet-Mull, Ana M; Sisk, Dana M; Smith, Kristen L Jurcic; Lee, Sunhee; Coers, Jörn; Valdivia, Raphael H; Tobin, David M; Cullen, Bryan RMicroRNAs are expressed by all multicellular organisms and play a critical role as post-transcriptional regulators of gene expression. Moreover, different microRNA species are known to influence the progression of a range of different diseases, including cancer and microbial infections. A number of different human viruses also encode microRNAs that can attenuate cellular innate immune responses and promote viral replication, and a fungal pathogen that infects plants has recently been shown to express microRNAs in infected cells that repress host cell immune responses and promote fungal pathogenesis. Here, we have used deep sequencing of total expressed small RNAs, as well as small RNAs associated with the cellular RNA-induced silencing complex RISC, to search for microRNAs that are potentially expressed by intracellular bacterial pathogens and translocated into infected animal cells. In the case of Legionella and Chlamydia and the two mycobacterial species M. smegmatis and M. tuberculosis, we failed to detect any bacterial small RNAs that had the characteristics expected for authentic microRNAs, although large numbers of small RNAs of bacterial origin could be recovered. However, a third mycobacterial species, M. marinum, did express an ∼ 23-nt small RNA that was bound by RISC and derived from an RNA stem-loop with the characteristics expected for a pre-microRNA. While intracellular expression of this candidate bacterial microRNA was too low to effectively repress target mRNA species in infected cultured cells in vitro, artificial overexpression of this potential bacterial pre-microRNA did result in the efficient repression of a target mRNA. This bacterial small RNA therefore represents the first candidate microRNA of bacterial origin.Item Open Access Simple and inexpensive ribosome profiling analysis of mRNA translation.(Methods (San Diego, Calif.), 2015-12) Reid, David W; Shenolikar, Shirish; Nicchitta, Christopher VThe development and application of ribosome profiling has markedly advanced our understanding of ribosomes and mRNA translation. The experimental approach, which relies on deep sequencing of ribosome-protected mRNA fragments generated by treatment of polyribosomes with exogenous nucleases, provides a transcriptome-wide assessment of translation. The broad application of ribosome profiling has been slowed by the complexity and expense of the protocol. Here, we provide a simplified ribosome profiling method that uses micrococcal nuclease to generate ribosome footprints in crude cellular extracts, which are then purified simply by size selection via polyacrylamide gel electrophoresis. This simplification removes the laborious or expensive purification of ribosomes that has typically been used. This direct extraction method generates gene-level ribosome profiling data that are similar to a method that includes ribosome purification. This protocol should significantly ease the barrier to entry for research groups interested in employing ribosome profiling.Item Open Access Sperm DNA methylation altered by THC and nicotine: Vulnerability of neurodevelopmental genes with bivalent chromatin.(Scientific reports, 2020-09) Schrott, Rose; Rajavel, Maya; Acharya, Kelly; Huang, Zhiqing; Acharya, Chaitanya; Hawkey, Andrew; Pippen, Erica; Lyerly, H Kim; Levin, Edward D; Murphy, Susan KMen consume the most nicotine and cannabis products but impacts on sperm epigenetics are poorly characterized. Evidence suggests that preconception exposure to these drugs alters offspring neurodevelopment. Epigenetics may in part facilitate heritability. We therefore compared effects of exposure to tetrahydrocannabinol (THC) and nicotine on DNA methylation in rat sperm at genes involved in neurodevelopment. Reduced representation bisulfite sequencing data from sperm of rats exposed to THC via oral gavage showed that seven neurodevelopmentally active genes were significantly differentially methylated versus controls. Pyrosequencing data revealed majority overlap in differential methylation in sperm from rats exposed to THC via injection as well as those exposed to nicotine. Neurodevelopmental genes including autism candidates are vulnerable to environmental exposures and common features may mediate this vulnerability. We discovered that autism candidate genes are significantly enriched for bivalent chromatin structure, suggesting this configuration may increase vulnerability of genes in sperm to disrupted methylation.