Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach

dc.contributor.advisor

Maggioni, Mauro

dc.contributor.author

Little, Anna Victoria

dc.date.accessioned

2011-05-20T19:35:35Z

dc.date.available

2011-11-15T05:30:16Z

dc.date.issued

2011

dc.department

Mathematics

dc.description.abstract

This work deals with the problem of estimating the intrinsic dimension of noisy, high-dimensional point clouds. A general class of sets which are locally well-approximated by k dimensional planes but which are embedded in a D>>k dimensional Euclidean space are considered. Assuming one has samples from such a set, possibly corrupted by high-dimensional noise, if the data is linear the dimension can be recovered using PCA. However, when the data is non-linear, PCA fails, overestimating the intrinsic dimension. A multiscale version of PCA is thus introduced which is robust to small sample size, noise, and non-linearities in the data.

dc.identifier.uri

https://hdl.handle.net/10161/3863

dc.subject

Applied mathematics

dc.subject

dimension estimation

dc.subject

geometric measure theory

dc.subject

Multiscale analysis

dc.subject

point cloud data

dc.title

Estimating the Intrinsic Dimension of High-Dimensional Data Sets: A Multiscale, Geometric Approach

dc.type

Dissertation

duke.embargo.months

6

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Little_duke_0066D_10913.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format

Collections