Gene set-based Signal-Detection Analyses with Goodness-of-Fit Statistics and Their Application in Complex Diseases

dc.contributor.advisor

Allen, Andrew S

dc.contributor.author

Zhang, Mengqi

dc.date.accessioned

2020-01-27T16:52:04Z

dc.date.available

2020-01-27T16:52:04Z

dc.date.issued

2019

dc.department

Computational Biology and Bioinformatics

dc.description.abstract

Rare diseases are difficult to diagnose and uncertain to treat. The identification of specific genes associated with particular rare diseases and phenotypes can provide insight into the mechanism of certain rare disease subtypes and suggest therapeutic targets to improve patient outcomes. However, single gene-based methods for detecting rare disease-associated variants are often underpowered and can be hard to interpret. Therefore, this dissertation explores alternative approaches based on gene set-based methods. These analyses can be solved with a goodness-of-fit test that assesses whether the distribution of observed statistics of a given set of genes/variants significantly differs from the expected distribution.

This dissertation explores a flexible gene set-based signal-detection framework based on the goodness-of-fit tests. A user-friendly and efficient R program was developed for this research. In addition, this dissertation proposes a new gene-set analyses method that can leverage prior information to inform the detection of whether any of the genes within a biologically informed gene-set is associated with disease phenotypes on a special goodness-of-fit a test called higher criticism. Further, this dissertation investigates the asymptotic distribution of our higher criticism statistic based on the theoretically weighted p-values. Collectively, these methods are innovative because they based on gene set and incorporate the prior information, which enhances the power of associations between rare variants and complex diseases. These results improve the ability to identify and optimally treat genetic disease subtypes.

dc.identifier.uri

https://hdl.handle.net/10161/19821

dc.subject

Biostatistics

dc.subject

Genetics

dc.subject

Complex Disease

dc.subject

Gene Set-based Analysis

dc.subject

Goodness of Fit Test

dc.subject

Higher Criticism

dc.title

Gene set-based Signal-Detection Analyses with Goodness-of-Fit Statistics and Their Application in Complex Diseases

dc.type

Dissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zhang_duke_0066D_15290.pdf
Size:
65.78 MB
Format:
Adobe Portable Document Format

Collections