Bayesian and Information-Theoretic Learning of High Dimensional Data

Carin, LawrenceChen, Minhua2012-05-252012-05-252012https://hdl.handle.net/10161/5588<p>The concept of sparseness is harnessed to learn a low dimensional representation of high dimensional data. This sparseness assumption is exploited in multiple ways. In the Bayesian Elastic Net, a small number of correlated features are identified for the response variable. In the sparse Factor Analysis for biomarker trajectories, the high dimensional gene expression data is reduced to a small number of latent factors, each with a prototypical dynamic trajectory. In the Bayesian Graphical LASSO, the inverse covariance matrix of the data distribution is assumed to be sparse, inducing a sparsely connected Gaussian graph. In the nonparametric Mixture of Factor Analyzers, the covariance matrices in the Gaussian Mixture Model are forced to be low-rank, which is closely related to the concept of block sparsity. </p><p>Finally in the information-theoretic projection design, a linear projection matrix is explicitly sought for information-preserving dimensionality reduction. All the methods mentioned above prove to be effective in learning both simulated and real high dimensional datasets.</p>Electrical engineeringStatisticsComputer scienceBayesian statisticsHigh Dimensional Data AnalysisInformation-Theoretic LearningMachine learningSignal processingSparsenessBayesian and Information-Theoretic Learning of High Dimensional DataDissertation