Show simple item record

dc.contributor.advisor Ohler, Uwe en_US
dc.contributor.advisor Mukherjee, Sayan en_US
dc.contributor.author Georgiev, Stoyan en_US
dc.date.accessioned 2012-05-29T16:45:41Z
dc.date.available 2012-05-29T16:45:41Z
dc.date.issued 2011 en_US
dc.identifier.uri http://hdl.handle.net/10161/5708
dc.description Dissertation en_US
dc.description.abstract <p>Uncovering the DNA regulatory logic in complex organisms has been one of the important goals of modern biology in the post-genomic era. The sequencing of multiple genomes in combination with the advent of DNA microarrays and, more recently, of massively parallel high-throughput sequencing technologies has made possible the adoption of a global perspective to the inference of the regulatory rules governing the context-specific interpretation of the genetic code that complements the more focused classical experimental approaches. Extracting useful information and managing the complexity resulting from the sheer volume and the high-dimensionality of the data produced by these genomic assays has emerged as a major challenge which we attempt to address in this work by developing computational methods and tools, specifically designed for the study of the gene regulatory processes in this new global genomic context. </p><p>First, we focus on the genome-wide discovery of physical interactions between regulatory sequence regions and their cognate proteins at both the DNA and RNA level. We present a motif analysis framework that leverages the genome-wide</p><p>evidence for sequence-specific interactions between trans-acting factors and their preferred cis-acting regulatory regions. The utility of the proposed framework is demonstarted on DNA and RNA cross-linking high-throughput data.</p><p>A second goal of this thesis is the development of scalable approaches to dimension reduction based on spectral decomposition and their application to the study of population structure in massive high-dimensional genetic data sets. We have developed computational tools and have performed theoretical and empirical analyses of their statistical properties with particular emphasis on the analysis of the individual genetic variation measured by Single Nucleotide Polymorphism (SNP) microrarrays.</p> en_US
dc.subject Bioinformatics en_US
dc.subject binding site en_US
dc.subject chip-seq en_US
dc.subject dimension reduction en_US
dc.subject population structure en_US
dc.subject randomized algorithm en_US
dc.subject transcription factor en_US
dc.title Computational Methods For Functional Motif Identification and Approximate Dimension Reduction in Genomic Data en_US
dc.type Dissertation en_US
dc.department Computational Biology and Bioinformatics en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record