Integrative Modeling of Genetic and Transciptomic Data for the Identification of Allele-Specific Expression

Loading...
Thumbnail Image
Limited Access
This item is unavailable until:
2025-06-06

Date

2024

Authors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

9
views
0
downloads

Abstract

The challenge of diagnosing rare genetic diseases persists despite advances in high-throughput sequencing. The limitation stems from an exome-centric diagnostic focus that often overlooks the influence of non-coding variants on gene expression. This research addresses this shortfall by leveraging allele-specific expression (ASE) analysis to detect cis-regulatory disruptions in gene expression, which could be pivotal for the diagnosis of non-exomic rare diseases.A novel computational framework, Bayesian Estimation of Allele Specific Transcript Integration across Exons (BEASTIE), was developed to refine ASE estimation. BEASTIE incorporates multiple heterozygous loci within a gene and rectifies phasing errors inherent in ASE detection. Comparative analyses reveal BEASTIE's enhanced accuracy over traditional methods, particularly in scenarios characterized by elevated heterozygosity and phasing errors. An advanced iteration, iBEASTIE, further incorporates error rates informed by genetic and genomic features, optimizing ASE estimations. In collaboration, quickBEAST—a C++ implementation of the BEASTIE model—was engineered, employing a subgrid algorithm to expedite the computation of ASE effect sizes. This tool proves essential for genome-wide analyses, evidenced by its application to 1000 Genome Project data, which aimed to map the ASE landscape and unearth novel imprinted genes. The practicality of these methods was tested in a case study of Glycogen Storage Disease (GSD), involving six probands. The integrated diagnostic pipeline—encompassing ASE, isoform, and differential expression analyses—identified a regulatory variant implicated in the disease phenotype. This finding was substantiated through CRISPR assays, verifying the computational predictions.

Description

Provenance

Citation

Citation

Zou, Xue (2024). Integrative Modeling of Genetic and Transciptomic Data for the Identification of Allele-Specific Expression. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/30883.

Collections


Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.