Cumulon: Simplified Matrix-Based Data Analytics in the Cloud

dc.contributor.advisor

Yang, Jun

dc.contributor.advisor

Babu, Shivnath

dc.contributor.author

Huang, Botong

dc.date.accessioned

2016-06-06T14:35:57Z

dc.date.available

2017-04-15T04:30:03Z

dc.date.issued

2016

dc.department

Computer Science

dc.description.abstract

Cumulon is a system aimed at simplifying the development and deployment of statistical analysis of big data in public clouds. Cumulon allows users to program in their familiar language of matrices and linear algebra, without worrying about how to map data and computation to specific hardware and cloud software platforms. Given user-specified requirements in terms of time, monetary cost, and risk tolerance, Cumulon automatically makes intelligent decisions on implementation alternatives, execution parameters, as well as hardware provisioning and configuration settings -- such as what type of machines and how many of them to acquire. Cumulon also supports clouds with auction-based markets: it effectively utilizes computing resources whose availability varies according to market conditions, and suggests best bidding strategies for them. Cumulon explores two alternative approaches toward supporting such markets, with different trade-offs between system and optimization complexity. Experimental study is conducted to show the efficiency of Cumulon's execution engine, as well as the optimizer's effectiveness in finding the optimal plan in the vast plan space.

dc.identifier.uri

https://hdl.handle.net/10161/12116

dc.subject

Computer science

dc.subject

Computer engineering

dc.subject

cloud computing

dc.subject

matrix data analytics

dc.subject

optimization

dc.subject

parallel execution

dc.subject

spot instance

dc.title

Cumulon: Simplified Matrix-Based Data Analytics in the Cloud

dc.type

Dissertation

duke.embargo.months

10

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Huang_duke_0066D_13283.pdf
Size:
3.15 MB
Format:
Adobe Portable Document Format

Collections