Methods for Systematic Exploratory Analysis of Gene Expression Data with Applications to Cancer Genomics

Loading...
Thumbnail Image

Date

2017

Authors

Wagner, Florian

Advisors

Dave, Sandeep S

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

392
views
127
downloads

Abstract

Advances in technologies for gene expression profiling have resulted in an unprecedented abundance of gene expression data. However, computational methods available for the exploratory analysis of such data are limited in their ability to generate an interpretable overview of biologically relevant similarities and differences among samples. This work first introduces the XL-mHG test, a sensitive and specific hypothesis test for detecting gene set enrichment, and discusses its algorithmic and statistical properties. It further introduces GO-PCA, a method for exploratory analysis of gene expression data using prior knowledge. The XL-mHG test serves as a building block for GO-PCA. The output of GO-PCA consists of functional expression signatures, designed to provide an interpretable representation of biologically meaningful variation in the data. The power and versatility of the method is demonstrated on heterogeneous human and mouse expression data. Finally, applications of the proposed methods to carcinoma and lymphoma expression data aim to demonstrate their clinical relevance. The effective utilization of prior knowledge in the exploratory analysis of gene expression data through carefully designed computational methods is essential for successfully harnessing the power of current and future platforms for gene expression profiling, with the aim of generating clinically relevant insights into complex diseases such as cancer.

Description

Provenance

Citation

Citation

Wagner, Florian (2017). Methods for Systematic Exploratory Analysis of Gene Expression Data with Applications to Cancer Genomics. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/14375.

Collections


Dukes student scholarship is made available to the public using a Creative Commons Attribution / Non-commercial / No derivative (CC-BY-NC-ND) license.