A Bayesian Model for Nucleosome Positioning Using DNase-seq Data

dc.contributor.advisor

Iversen, Edwin S

dc.contributor.author

Zhong, Jianling

dc.date.accessioned

2015-05-12T20:50:24Z

dc.date.issued

2015

dc.department

Statistical Science

dc.description.abstract

As fundamental structural units of the chromatin, nucleosomes are involved in virtually all aspects of genome function. Different methods have been developed to map genome-wide nucleosome positions, including MNase-seq and a recent chemical method requiring genetically engineered cells. However, these methods are either low resolution and prone to enzymatic sequence bias or require genetically modified cells. The DNase I enzyme has been used to probe nucleosome structure since the 1960s, but in the current high throughput sequencing era, DNase-seq has mainly been used to study regulatory sequences known as DNase hypersensitive sites. This thesis shows that DNase-seq data is also very informative about nucleosome positioning. The distinctive oscillatory DNase I cutting patterns on nucleosomal DNA are shown and discussed. Based on these patterns, a Bayes factor is proposed to be used for distinguishing nucleosomal and non-nucleosomal genome positions. The results show that this approach is highly sensitive and specific. A Bayesian method that simulates the data generation process and can provide more interpretable results is further developed based on the Bayes factor investigations. Preliminary results on a test genomic region show that the Bayesian model works well in identifying nucleosome positioning. Estimated posterior distributions also agree with some known biological observations from external data. Taken together, methods developed in this thesis show that DNase-seq can be used to identify nucleosome positioning, adding great value to this widely utilized protocol.

dc.identifier.uri

https://hdl.handle.net/10161/9994

dc.subject

Statistics

dc.subject

Bioinformatics

dc.subject

Bayes factor

dc.subject

Bayesian inference

dc.subject

DNase-seq

dc.subject

nucleosome positioning

dc.title

A Bayesian Model for Nucleosome Positioning Using DNase-seq Data

dc.type

Master's thesis

Files

Collections