Gene set-based Signal-Detection Analyses with Goodness-of-Fit Statistics and Their Application in Complex Diseases
| dc.contributor.advisor | Allen, Andrew S | |
| dc.contributor.author | Zhang, Mengqi | |
| dc.date.accessioned | 2020-01-27T16:52:04Z | |
| dc.date.available | 2020-01-27T16:52:04Z | |
| dc.date.issued | 2019 | |
| dc.department | Computational Biology and Bioinformatics | |
| dc.description.abstract | Rare diseases are difficult to diagnose and uncertain to treat. The identification of specific genes associated with particular rare diseases and phenotypes can provide insight into the mechanism of certain rare disease subtypes and suggest therapeutic targets to improve patient outcomes. However, single gene-based methods for detecting rare disease-associated variants are often underpowered and can be hard to interpret. Therefore, this dissertation explores alternative approaches based on gene set-based methods. These analyses can be solved with a goodness-of-fit test that assesses whether the distribution of observed statistics of a given set of genes/variants significantly differs from the expected distribution. This dissertation explores a flexible gene set-based signal-detection framework based on the goodness-of-fit tests. A user-friendly and efficient R program was developed for this research. In addition, this dissertation proposes a new gene-set analyses method that can leverage prior information to inform the detection of whether any of the genes within a biologically informed gene-set is associated with disease phenotypes on a special goodness-of-fit a test called higher criticism. Further, this dissertation investigates the asymptotic distribution of our higher criticism statistic based on the theoretically weighted p-values. Collectively, these methods are innovative because they based on gene set and incorporate the prior information, which enhances the power of associations between rare variants and complex diseases. These results improve the ability to identify and optimally treat genetic disease subtypes. | |
| dc.identifier.uri | ||
| dc.subject | Biostatistics | |
| dc.subject | Genetics | |
| dc.subject | Complex Disease | |
| dc.subject | Gene Set-based Analysis | |
| dc.subject | Goodness of Fit Test | |
| dc.subject | Higher Criticism | |
| dc.title | Gene set-based Signal-Detection Analyses with Goodness-of-Fit Statistics and Their Application in Complex Diseases | |
| dc.type | Dissertation |
Files
Original bundle
- Name:
- Zhang_duke_0066D_15290.pdf
- Size:
- 65.78 MB
- Format:
- Adobe Portable Document Format