Home // ICIW 2013, The Eighth International Conference on Internet and Web Applications and Services // View article


Compressing Large Size Files on the Web in MapReduce

Authors:
Sergio De Agostino

Keywords: web computing, mapreduce framework, lossless compression, string factorization

Abstract:
Lempel-Ziv (LZ) techniques are the most widely used for lossless file compression. LZ compression basicly comprises two methods, called LZ1 and LZ2. The LZ1 method is the one employed by the family of Zip compressors, while the LZW compressor implements the LZ2 method, which is slightly less effective but twice faster. When the file size is large, both methods can be implemented on a distributed system guaranteeing linear speed-up, scalability and robustness. With Web computing, the MapReduce model of distributed processing is emerging as the most widely used. In this framework, we present and make a comparative analysis of different implementations of LZ compression. An alternative to standard versions of the Lempel-Ziv method is proposed as the most efficient one for large size files compression.

Pages: 135 to 140

Copyright: Copyright (c) IARIA, 2013

Publication date: June 23, 2013

Published in: conference

ISSN: 2308-3972

ISBN: 978-1-61208-280-6

Location: Rome, Italy

Dates: from June 23, 2013 to June 28, 2013