Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

dc.contributor.author

Gordân, Raluca

dc.contributor.author

Narlikar, Leelavati

dc.contributor.author

Hartemink, Alexander J

dc.coverage.spatial

England

dc.date.accessioned

2017-08-01T20:53:29Z

dc.date.available

2017-08-01T20:53:29Z

dc.date.issued

2010-04

dc.description.abstract

As an increasing number of eukaryotic genomes are being sequenced, comparative studies aimed at detecting regulatory elements in intergenic sequences are becoming more prevalent. Most comparative methods for transcription factor (TF) binding site discovery make use of global or local alignments of orthologous regulatory regions to assess whether a particular DNA site is conserved across related organisms, and thus more likely to be functional. Since binding sites are usually short, sometimes degenerate, and often independent of orientation, alignment algorithms may not align them correctly. Here, we present a novel, alignment-free approach for using conservation information for TF binding site discovery. We relax the definition of conserved sites: we consider a DNA site within a regulatory region to be conserved in an orthologous sequence if it occurs anywhere in that sequence, irrespective of orientation. We use this definition to derive informative priors over DNA sequence positions, and incorporate these priors into a Gibbs sampling algorithm for motif discovery. Our approach is simple and fast. It requires neither sequence alignments nor the phylogenetic relationships between the orthologous sequences, yet it is more effective on real biological data than methods that do.

dc.identifier

https://www.ncbi.nlm.nih.gov/pubmed/20047961

dc.identifier

gkp1166

dc.identifier.eissn

1362-4962

dc.identifier.uri

https://hdl.handle.net/10161/15158

dc.language

eng

dc.publisher

Oxford University Press (OUP)

dc.relation.ispartof

Nucleic Acids Res

dc.relation.isversionof

10.1093/nar/gkp1166

dc.subject

Base Sequence

dc.subject

Binding Sites

dc.subject

Conserved Sequence

dc.subject

Molecular Sequence Data

dc.subject

Promoter Regions, Genetic

dc.subject

Sequence Alignment

dc.subject

Sequence Analysis, DNA

dc.subject

Transcription Factors

dc.title

Finding regulatory DNA motifs using alignment-free evolutionary conservation information.

dc.type

Journal article

duke.contributor.orcid

Hartemink, Alexander J|0000-0002-1292-2606

pubs.author-url

https://www.ncbi.nlm.nih.gov/pubmed/20047961

pubs.begin-page

e90

pubs.issue

6

pubs.organisational-group

Basic Science Departments

pubs.organisational-group

Biostatistics & Bioinformatics

pubs.organisational-group

Computer Science

pubs.organisational-group

Duke

pubs.organisational-group

Molecular Genetics and Microbiology

pubs.organisational-group

School of Medicine

pubs.organisational-group

Trinity College of Arts & Sciences

pubs.publication-status

Published

pubs.volume

38

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Finding regulatory DNA motifs using alignment-free evolutionary conservation information.pdf
Size:
2.21 MB
Format:
Adobe Portable Document Format
Description:
Published version