A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples.

dc.contributor.author

Naccache, Samia N

dc.contributor.author

Federman, Scot

dc.contributor.author

Veeraraghavan, Narayanan

dc.contributor.author

Zaharia, Matei

dc.contributor.author

Lee, Deanna

dc.contributor.author

Samayoa, Erik

dc.contributor.author

Bouquet, Jerome

dc.contributor.author

Greninger, Alexander L

dc.contributor.author

Luk, Ka-Cheung

dc.contributor.author

Enge, Barryett

dc.contributor.author

Wadford, Debra A

dc.contributor.author

Messenger, Sharon L

dc.contributor.author

Genrich, Gillian L

dc.contributor.author

Pellegrino, Kristen

dc.contributor.author

Grard, Gilda

dc.contributor.author

Leroy, Eric

dc.contributor.author

Schneider, Bradley S

dc.contributor.author

Fair, Joseph N

dc.contributor.author

Martínez, Miguel A

dc.contributor.author

Isa, Pavel

dc.contributor.author

Crump, John A

dc.contributor.author

DeRisi, Joseph L

dc.contributor.author

Sittler, Taylor

dc.contributor.author

Hackett, John

dc.contributor.author

Miller, Steve

dc.contributor.author

Chiu, Charles Y

dc.coverage.spatial

United States

dc.date.accessioned

2017-03-02T19:08:55Z

dc.date.available

2017-03-02T19:08:55Z

dc.date.issued

2014-07

dc.description.abstract

Unbiased next-generation sequencing (NGS) approaches enable comprehensive pathogen detection in the clinical microbiology laboratory and have numerous applications for public health surveillance, outbreak investigation, and the diagnosis of infectious diseases. However, practical deployment of the technology is hindered by the bioinformatics challenge of analyzing results accurately and in a clinically relevant timeframe. Here we describe SURPI ("sequence-based ultrarapid pathogen identification"), a computational pipeline for pathogen identification from complex metagenomic NGS data generated from clinical samples, and demonstrate use of the pipeline in the analysis of 237 clinical samples comprising more than 1.1 billion sequences. Deployable on both cloud-based and standalone servers, SURPI leverages two state-of-the-art aligners for accelerated analyses, SNAP and RAPSearch, which are as accurate as existing bioinformatics tools but orders of magnitude faster in performance. In fast mode, SURPI detects viruses and bacteria by scanning data sets of 7-500 million reads in 11 min to 5 h, while in comprehensive mode, all known microorganisms are identified, followed by de novo assembly and protein homology searches for divergent viruses in 50 min to 16 h. SURPI has also directly contributed to real-time microbial diagnosis in acutely ill patients, underscoring its potential key role in the development of unbiased NGS-based clinical assays in infectious diseases that demand rapid turnaround times.

dc.identifier

https://www.ncbi.nlm.nih.gov/pubmed/24899342

dc.identifier

gr.171934.113

dc.identifier.eissn

1549-5469

dc.identifier.uri

https://hdl.handle.net/10161/13772

dc.language

eng

dc.publisher

Cold Spring Harbor Laboratory

dc.relation.ispartof

Genome Res

dc.relation.isversionof

10.1101/gr.171934.113

dc.subject

Computational Biology

dc.subject

Databases, Nucleic Acid

dc.subject

High-Throughput Nucleotide Sequencing

dc.subject

Humans

dc.subject

Metagenomics

dc.subject

ROC Curve

dc.subject

Reproducibility of Results

dc.subject

Software

dc.title

A cloud-compatible bioinformatics pipeline for ultrarapid pathogen identification from next-generation sequencing of clinical samples.

dc.type

Journal article

pubs.author-url

https://www.ncbi.nlm.nih.gov/pubmed/24899342

pubs.begin-page

1180

pubs.end-page

1192

pubs.issue

7

pubs.organisational-group

Clinical Science Departments

pubs.organisational-group

Duke

pubs.organisational-group

Medicine

pubs.organisational-group

Medicine, Infectious Diseases

pubs.organisational-group

Pathology

pubs.organisational-group

School of Medicine

pubs.publication-status

Published

pubs.volume

24

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GenomeRes_PathogenDiscovery.pdf
Size:
2.88 MB
Format:
Adobe Portable Document Format