Home // DEPEND 2015, The Eighth International Conference on Dependability // View article
On Handling Redundancy for Failure Log Analysis of Cluster Systems
Authors:
Nentawe Gurumdimma
Arshad Jhumka
Maria Liakata
Edward Chuah
James Browne
Keywords: Cluster Log Data; Unsupervised learning; Compres- sion; Levenshtein distance; filtering
Abstract:
System event logs contain information that capture the sequence of events occurring in the system. They are often the primary source of information from large-scale distributed systems, such as cluster systems, which enable system administrators to determine the causes and detect system failures. Due to the complex interactions between the system hardware and software components, the system event logs are typically huge in size, comprising streams of interleaved log messages. However, only a small fraction of those log messages are relevant for analysis. We thus develop a novel, generic log compression or filtering (i.e., redundancy removal) technique to address this problem. We apply the technique over three different log files obtained from two different production systems and validate the technique through the application of an unsupervised failure detection approach. Our results are positive: (i) our technique achieves good compression, (ii) log analysis yields better results for our filtering method than normal approach.
Pages: 7 to 14
Copyright: Copyright (c) IARIA, 2015
Publication date: August 23, 2015
Published in: conference
ISSN: 2308-4324
ISBN: 978-1-61208-429-9
Location: Venice, Italy
Dates: from August 23, 2015 to August 28, 2015