Wavelet Regression using MapReduce and Analysis of Multiple Sclerosis Clinical Data

dc.contributor.advisor

Ma, Li

dc.contributor.advisor

Li, Meng

dc.contributor.author

Song, Hanyu

dc.date.accessioned

2017-08-16T18:26:12Z

dc.date.available

2017-08-16T18:26:12Z

dc.date.issued

2017

dc.department

Statistical Science

dc.description.abstract

Two problems, one related to scalable methods and the other on application of statistical methods to clinical data are addressed in this thesis. In the first chapter, motivated by growing numbers of ``large p'' datasets, we present a novel MapReduce framework for handling multivariate wavelet regression. We compare the time complexity of proposed and conventional methods and show the novel framework scales linearly in the dimension $p$ of the response matrix. Empirical results show consistency with our complexity analysis. This work has its potential application in analysing image data or genomic data where the dimensions are huge.

In the second chapter, we explore a clinical dataset of Multiple Sclerosis (MS) provided by Biogen, which comprises 579 actively managed MS patients enrolled at single center for up to 5 years. Since a therapy to curing MS is unknown, Biogen and we are developing statistical models to predict the progression of disability level as a therapeutic guide. Such disability can be roughly quantified by EDSS (Expanded Disability Status Scale), and as such we conduct predict modelling of EDSS. Before we arrive at these models, we perform explanatory data analysis, conduct predictive modelling of current EDSS based on measurements in the same year.

dc.identifier.uri

https://hdl.handle.net/10161/15265

dc.subject

Statistics

dc.title

Wavelet Regression using MapReduce and Analysis of Multiple Sclerosis Clinical Data

dc.type

Master's thesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Song_duke_0066N_14020.pdf
Size:
2.22 MB
Format:
Adobe Portable Document Format

Collections