Home // CLOUD COMPUTING 2012, The Third International Conference on Cloud Computing, GRIDs, and Virtualization // View article
Datanode Optimization in Distributed Storage Systems
Authors:
Xiaokang Fan
Shanshan Li
Xiangke Liao
Lei Wang
Chenlin Huang
Jun Ma
Keywords: distributed storage system; file-level cache; co-location
Abstract:
Distributed storage systems designed for small files have been developing rapidly, like Facebook’s hayStack, Twitter’s Cassandra and so on. But, under our observation, there are still some drawbacks in these systems. For example, they do not have cache specified for files and have not taken the relationship inherent in application-specific knowledge between files into consideration. We propose a file-level cache on datanode and co-location of affinitive files based on application-specific knowledge. We use a synthetic data set and a real world trace to evaluate our optimization. The file-level cache and co-location of affinitive files together can improve system’s throughput by 20%-50%.
Pages: 247 to 252
Copyright: Copyright (c) IARIA, 2012
Publication date: July 22, 2012
Published in: conference
ISSN: 2308-4294
ISBN: 978-1-61208-216-5
Location: Nice, France
Dates: from July 22, 2012 to July 27, 2012