Efficient Gaussian process regression for large datasets.

dc.contributor.author

Banerjee, Anjishnu

dc.contributor.author

Dunson, David B

dc.contributor.author

Tokdar, Surya T

dc.coverage.spatial

England

dc.date.accessioned

2017-10-01T21:14:04Z

dc.date.available

2017-10-01T21:14:04Z

dc.date.issued

2013-03

dc.description.abstract

Gaussian processes are widely used in nonparametric regression, classification and spatiotemporal modelling, facilitated in part by a rich literature on their theoretical properties. However, one of their practical limitations is expensive computation, typically on the order of n(3) where n is the number of data points, in performing the necessary matrix inversions. For large datasets, storage and processing also lead to computational bottlenecks, and numerical stability of the estimates and predicted values degrades with increasing n. Various methods have been proposed to address these problems, including predictive processes in spatial data analysis and the subset-of-regressors technique in machine learning. The idea underlying these approaches is to use a subset of the data, but this raises questions concerning sensitivity to the choice of subset and limitations in estimating fine-scale structure in regions that are not well covered by the subset. Motivated by the literature on compressive sensing, we propose an alternative approach that involves linear projection of all the data points onto a lower-dimensional subspace. We demonstrate the superiority of this approach from a theoretical perspective and through simulated and real data examples.

dc.identifier

https://www.ncbi.nlm.nih.gov/pubmed/23869109

dc.identifier.issn

0006-3444

dc.identifier.uri

https://hdl.handle.net/10161/15591

dc.language

eng

dc.publisher

Oxford University Press (OUP)

dc.relation.ispartof

Biometrika

dc.relation.isversionof

10.1093/biomet/ass068

dc.subject

Bayesian regression

dc.subject

Compressive sensing

dc.subject

Dimensionality reduction

dc.subject

Gaussian process

dc.subject

Random projection

dc.title

Efficient Gaussian process regression for large datasets.

dc.type

Journal article

duke.contributor.orcid

Tokdar, Surya T|0000-0001-5162-1155

pubs.author-url

https://www.ncbi.nlm.nih.gov/pubmed/23869109

pubs.begin-page

75

pubs.end-page

89

pubs.issue

1

pubs.organisational-group

Duke

pubs.organisational-group

Duke Institute for Brain Sciences

pubs.organisational-group

Electrical and Computer Engineering

pubs.organisational-group

Institutes and Provost's Academic Units

pubs.organisational-group

Pratt School of Engineering

pubs.organisational-group

Statistical Science

pubs.organisational-group

Trinity College of Arts & Sciences

pubs.organisational-group

University Institutes and Centers

pubs.publication-status

Published

pubs.volume

100

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1106.5779v1.pdf
Size:
230.75 KB
Format:
Adobe Portable Document Format