Show simple item record

Modeling Multi-factor Binding of the Genome

dc.contributor.advisor Hartemink, Alexander J
dc.contributor.author Wasson, Todd Steven
dc.date.accessioned 2011-01-06T16:03:58Z
dc.date.available 2012-01-01T05:30:13Z
dc.date.issued 2010
dc.identifier.uri https://hdl.handle.net/10161/3139
dc.description.abstract <p>Hundreds of different factors adorn the eukaryotic genome, binding to it in large number. These DNA binding factors (DBFs) include nucleosomes, transcription factors (TFs), and other proteins and protein complexes, such as the origin recognition complex (ORC). DBFs compete with one another for binding along the genome, yet many current models of genome binding do not consider different types of DBFs together simultaneously. Additionally, binding is a stochastic process that results in a continuum of binding probabilities at any position along the genome, but many current models tend to consider positions as being either binding sites or not.</p><p>Here, we present a model that allows a multitude of DBFs, each at different concentrations, to compete with one another for binding sites along the genome. The result is an 'occupancy profile', a probabilistic description of the DNA occupancy of each factor at each position. We implement our model efficiently as the software package COMPETE. We demonstrate genome-wide and at specific loci how modeling nucleosome binding alters TF binding, and vice versa, and illustrate how factor concentration influences binding occupancy. Binding cooperativity between nearby TFs arises implicitly via mutual competition with nucleosomes. Our method applies not only to TFs, but also recapitulates known occupancy profiles of a well-studied replication origin with and without ORC binding.</p><p>We then develop a statistical framework for tuning our model concentrations to further improve its predictions. Importantly, this tuning optimizes with respect to actual biological data. We take steps to ensure that our tuned parameters are biologically plausible.</p><p>Finally, we discuss novel extensions and applications of our model, suggesting next steps in its development and deployment.</p>
dc.subject Biology, Bioinformatics
dc.subject Computer Science
dc.subject Statistics
dc.subject Boltzmann chains
dc.subject Computational Biology
dc.subject DNA binding
dc.subject Hidden Markov models
dc.subject Statistical mechanics
dc.subject Transcription factors
dc.title Modeling Multi-factor Binding of the Genome
dc.type Dissertation
dc.department Computational Biology and Bioinformatics
duke.embargo.months 12


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record