Home // CLOUD COMPUTING 2013, The Fourth International Conference on Cloud Computing, GRIDs, and Virtualization // View article
K Means of Cloud Computing: MapReduce, DVM, and Windows Azure
Authors:
Lin Gu
Zhonghua Sheng
Zhiqiang Ma
Xiang Gao
Charles Zhang
Yaohui Jin
Keywords: Cloud computing; k-means; parallel programming; MapReduce; DISA; big data processing
Abstract:
Cloud-based systems and the datacenter computing environment present a series of challenges to system designers for supporting massively concurrent computation on clusters with commodity hardware. The platform software should abstract the unreliable but highly provisioned hardware to provide a high-performance platform for a diversity of concurrent programs processing potentially very large data sets. Toward this goal, a number of solutions are designed or proposed. Among these products and systems, we elect three technologies, MapReduce/Hadoop, DVM, and Windows Azure, as representatives of three different approaches to constructing the infrastructure and instructing the programming in the cloud. We empirically study these technologies using a well-known and widely used application, k-means, and analyze their performance data in relation with the abstraction layers they establish. The implementations of k-means on the three platforms are presented with sufficient details to show the design patterns with these technologies. We analyze the evaluation results in the context of the design goals and constraints of the technologies, and show that the instruction-level abstraction can provide flexible programming capability as well as high performance.
Pages: 13 to 18
Copyright: Copyright (c) IARIA, 2013
Publication date: May 27, 2013
Published in: conference
ISSN: 2308-4294
ISBN: 978-1-61208-271-4
Location: Valencia, Spain
Dates: from May 27, 2013 to June 1, 2013