Show simple item record

Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach

dc.contributor.advisor Maggioni, Mauro
dc.contributor.author Little, Anna Victoria
dc.date.accessioned 2011-05-20T19:35:35Z
dc.date.available 2011-11-15T05:30:16Z
dc.date.issued 2011
dc.identifier.uri https://hdl.handle.net/10161/3863
dc.description.abstract <p>This work deals with the problem of estimating the intrinsic dimension of noisy, high-dimensional point clouds. A general class of sets which are locally well-approximated by <italic>k</italic> dimensional planes but which are embedded in a <italic>D</italic>>><italic>k</italic> dimensional Euclidean space are considered. Assuming one has samples from such a set, possibly corrupted by high-dimensional noise, if the data is linear the dimension can be recovered using PCA. However, when the data is non-linear, PCA fails, overestimating the intrinsic dimension. A multiscale version of PCA is thus introduced which is robust to small sample size, noise, and non-linearities in the data.</p>
dc.subject Applied Mathematics
dc.subject dimension estimation
dc.subject geometric measure theory
dc.subject multiscale analysis
dc.subject point cloud data
dc.title Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach
dc.type Dissertation
dc.department Mathematics
duke.embargo.months 6


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record