Bayesian and Information-Theoretic Learning of High Dimensional Data

Chen, Minhua

Bayesian and Information-Theoretic Learning of High Dimensional Data

View / Download5.3 MB

Date

2012

Authors

Chen, Minhua

Advisors

Carin, Lawrence

Repository Usage Stats

1083
views

1365
downloads

Abstract

The concept of sparseness is harnessed to learn a low dimensional representation of high dimensional data. This sparseness assumption is exploited in multiple ways. In the Bayesian Elastic Net, a small number of correlated features are identified for the response variable. In the sparse Factor Analysis for biomarker trajectories, the high dimensional gene expression data is reduced to a small number of latent factors, each with a prototypical dynamic trajectory. In the Bayesian Graphical LASSO, the inverse covariance matrix of the data distribution is assumed to be sparse, inducing a sparsely connected Gaussian graph. In the nonparametric Mixture of Factor Analyzers, the covariance matrices in the Gaussian Mixture Model are forced to be low-rank, which is closely related to the concept of block sparsity.

Finally in the information-theoretic projection design, a linear projection matrix is explicitly sought for information-preserving dimensionality reduction. All the methods mentioned above prove to be effective in learning both simulated and real high dimensional datasets.

Type

Dissertation

Department

Electrical and Computer Engineering

Subjects

Electrical engineering, Statistics, Computer science, Bayesian statistics, High Dimensional Data Analysis, Information-Theoretic Learning, Machine learning, Signal processing, Sparseness

Permalink

https://hdl.handle.net/10161/5588

Citation

Chen, Minhua (2012). Bayesian and Information-Theoretic Learning of High Dimensional Data. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/5588.

Collections

Dissertations

Full item page

Except where otherwise noted, student scholarship that was shared on DukeSpace after 2009 is made available to the public under a Creative Commons Attribution / Non-commercial / No derivatives (CC-BY-NC-ND) license. All rights in student work shared on DukeSpace before 2009 remain with the author and/or their designee, whose permission may be required for reuse.

Bayesian and Information-Theoretic Learning of High Dimensional Data

Date

Authors

Advisors

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

Abstract

Type

Department

Description

Provenance

Subjects

Citation

Permalink

Citation

Collections