Home // CLOUD COMPUTING 2016, The Seventh International Conference on Cloud Computing, GRIDs, and Virtualization // View article


Data Locality via Coordinated Caching for Distributed Processing

Authors:
Max Fischer
Eileen Kuehn

Keywords: Cooperative caching; Coordinated caching; Distributed caching; Batch production systems; Distributed processing

Abstract:
Modern data analysis methods often rely on data locality. Processing applications are executed directly where data is stored. Frameworks enabling this require their own specialised environment and modifications. We propose an alternative ap- proach using coordinated caching integrated into classic batch systems. A custom middleware layer provides relevant data locally on worker nodes. Most importantly no modifications are needed to add data locality to existing workflows. However, considerably more factors must be addressed by distributed caches compared to isolated ones. We investigated our approach with theoretic modelling, simulations, and a prototype implementation. Our evaluations show promising results for both applicability and performance.

Pages: 113 to 118

Copyright: Copyright (c) IARIA, 2016

Publication date: March 20, 2016

Published in: conference

ISSN: 2308-4294

ISBN: 978-1-61208-460-2

Location: Rome, Italy

Dates: from March 20, 2016 to March 24, 2016