Robust and scalable unsupervised learning via landmark diffusion, from theory to medical application
Biomedical time series contain rich information about human systems, however, effective algorithms for analyzing long-term physiological time series have not yet been developed because of the huge volume size, high dimensionality and large noise nature of the data. Motivated by such challenging task, we proposed a novel spectral embedding algorithm, which we coined Robust and Scalable Embedding via Landmark Diffusion (ROSELAND). The solution is a generic and not limited to analyze physiological waveforms. In short, we measure the affinity between two points via a set of landmarks, which is composed of a small number of points, and ``diffuse'' on the dataset via the landmark set to achieve a spectral embedding. The algorithm is applied to study the arterial blood pressure waveform dynamics during a liver transplant operation lasting for 12 hours long. In addition, we show that Roseland is not only numerically scalable, but also preserves the geometric properties via its diffusion nature under the manifold setup; that is, we theoretically explore the asymptotical behavior of Roseland under the manifold setup, and provide a L-infinity spectral convergence with a rate. Moreover, we offer a high dimensional noise analysis with the help of Gaussian approximation, and show that Roseland is robust to noise.

This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License.
Rights for Collection: Duke Dissertations
Works are deposited here by their authors, and represent their research and opinions, not that of Duke University. Some materials and descriptions may include offensive content. More info