Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach
dc.contributor.advisor | Maggioni, Mauro | |
dc.contributor.author | Little, Anna Victoria | |
dc.date.accessioned | 2011-05-20T19:35:35Z | |
dc.date.available | 2011-11-15T05:30:16Z | |
dc.date.issued | 2011 | |
dc.identifier.uri | https://hdl.handle.net/10161/3863 | |
dc.description.abstract | <p>This work deals with the problem of estimating the intrinsic dimension of noisy, high-dimensional point clouds. A general class of sets which are locally well-approximated by <italic>k</italic> dimensional planes but which are embedded in a <italic>D</italic>>><italic>k</italic> dimensional Euclidean space are considered. Assuming one has samples from such a set, possibly corrupted by high-dimensional noise, if the data is linear the dimension can be recovered using PCA. However, when the data is non-linear, PCA fails, overestimating the intrinsic dimension. A multiscale version of PCA is thus introduced which is robust to small sample size, noise, and non-linearities in the data.</p> | |
dc.subject | Applied Mathematics | |
dc.subject | dimension estimation | |
dc.subject | geometric measure theory | |
dc.subject | multiscale analysis | |
dc.subject | point cloud data | |
dc.title | Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach | |
dc.type | Dissertation | |
dc.department | Mathematics | |
duke.embargo.months | 6 |
Files in this item
This item appears in the following Collection(s)
- Duke Dissertations
Dissertations by Duke students