Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

Gordân, Raluca; Narlikar, Leelavati; Hartemink, Alexander J

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

View / Download2.21 MB

Date

2010-04

Authors

Gordân, Raluca

Narlikar, Leelavati

Hartemink, Alexander J

Repository Usage Stats

138
views

105
downloads

Citation Stats

Abstract

As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do.

Type

Journal article

Subjects

Base Sequence, Binding Sites, Conserved Sequence, Molecular Sequence Data, Promoter Regions, Genetic, Sequence Alignment, Sequence Analysis, DNA, Transcription Factors

Permalink

https://hdl.handle.net/10161/15158

Published Version (Please cite this version)

10.1093/nar/gkp1166

Publication Info

Gordân, Raluca, Leelavati Narlikar and Alexander J Hartemink (2010). Finding regulatory DNA motifs using alignment-free evolutionary conservation information. Nucleic Acids Res, 38(6). p. e90. 10.1093/nar/gkp1166 Retrieved from https://hdl.handle.net/10161/15158.

This is constructed from limited available data and may be imprecise. To cite this article, please review & use the official citation provided by the journal.

Collections

Scholarly Articles

Full item page

Scholars@Duke

Alexander J. Hartemink

Professor of Computer Science

Computational biology, machine learning, Bayesian statistics, transcriptional regulation, genomics and epigenomics, graphical models, Bayesian networks, hidden Markov models, systems biology, computational neurobiology, classification, feature selection

Unless otherwise indicated, scholarly articles published by Duke faculty members are made available here with a CC-BY-NC (Creative Commons Attribution Non-Commercial) license, as enabled by the Duke Open Access Policy. If you wish to use the materials in ways not already permitted under CC-BY-NC, please consult the copyright owner. Other materials are made available here through the author’s grant of a non-exclusive license to make their work openly accessible.

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

Date

Authors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

Citation Stats

Abstract

Type

Department

Description

Provenance

Subjects

Citation

Permalink

Published Version (Please cite this version)

Publication Info

Collections

Scholars@Duke

Alexander J. Hartemink