Cumulon: Simplified Matrix-Based Data Analytics in the Cloud

Loading...
Thumbnail Image

Date

2016

Advisors

Babu, Shivnath

Journal Title

Journal ISSN

Volume Title

Repository Usage Stats

278
views
328
downloads

Abstract

Cumulon is a system aimed at simplifying the development and deployment of statistical analysis of big data in public clouds. Cumulon allows users to program in their familiar language of matrices and linear algebra, without worrying about how to map data and computation to specific hardware and cloud software platforms. Given user-specified requirements in terms of time, monetary cost, and risk tolerance, Cumulon automatically makes intelligent decisions on implementation alternatives, execution parameters, as well as hardware provisioning and configuration settings -- such as what type of machines and how many of them to acquire. Cumulon also supports clouds with auction-based markets: it effectively utilizes computing resources whose availability varies according to market conditions, and suggests best bidding strategies for them. Cumulon explores two alternative approaches toward supporting such markets, with different trade-offs between system and optimization complexity. Experimental study is conducted to show the efficiency of Cumulon's execution engine, as well as the optimizer's effectiveness in finding the optimal plan in the vast plan space.

Description

Provenance

Citation

Citation

Huang, Botong (2016). Cumulon: Simplified Matrix-Based Data Analytics in the Cloud. Dissertation, Duke University. Retrieved from https://hdl.handle.net/10161/12116.

Collections


Dukes student scholarship is made available to the public using a Creative Commons Attribution / Non-commercial / No derivative (CC-BY-NC-ND) license.