Improved Genomic Selection using Vowpal Wabbit with Random Fourier Features

dc.contributor.advisor

Mukherjee, Sayan

dc.contributor.author

Zhang, Jaslyn

dc.date.accessioned

2017-05-08T22:25:23Z

dc.date.available

2017-05-08T22:25:23Z

dc.date.issued

2017-05-08

dc.department

Statistical Science

dc.description.abstract

Nonlinear regression models are often used in statistics and machine learning due to greater accuracy than linear models. In this work, we present a novel modeling framework that is both computationally efficient for high-dimensional datasets, and predicts more accurately than most of the classic state-of-the-art predictive models. Here, we couple a nonlinear random Fourier feature data transformation with an intrinsically fast learning algorithm called Vowpal Wabbit or VW. The key idea we develop is that by introducing nonlinear structure to an otherwise linear framework, we are able to consider all possible higher-order interactions between entries in a string. The utility of our nonlinear VW extension is examined, in some detail, under an important problem in statistical genetics: genomic selection (i.e.~the prediction of phenotype from genotype). We illustrate the benefits of our method and its robustness to underlying genetic architecture on a real dataset, which includes 129 quantitative heterogeneous stock mice traits from the Wellcome Trust Centre for Human Genetics.

dc.identifier.uri

https://hdl.handle.net/10161/14308

dc.language.iso

en_US

dc.subject

Fourier Transforms

dc.subject

Vowpal Wabbit

dc.subject

Genomic Selection

dc.title

Improved Genomic Selection using Vowpal Wabbit with Random Fourier Features

dc.type

Honors thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
improved-genomic-selection.pdf
Size:
735.9 KB
Format:
Adobe Portable Document Format
Description:
Main article