Home // PATTERNS 2019, The Eleventh International Conference on Pervasive Patterns and Applications // View article


Pattern Matching in the Era of Big Data: A Benchmark of Cluster Quality Metrics

Authors:
Ole Kristian Ekseth
Per-Jarle Furnes
Svein-Olaf Hvasshovd

Keywords: patterns; clustering; similarity metrics; data analysis.

Abstract:
In today’s quest for knowledge, there is a need for accurate and fast measures for pattern matching. While numerous new metrics and algorithms are published every year, researchers are unaware of which metric to choose. There does not exist an established strategy for pattern matching of cluster algorithms, which may explain why new hypothesis and algorithms are often forgotten. In this work, we address this issue. The paper presents a new micro-benchmark for automated evaluation of pattern matching algorithms. From key characteristics of training data, the micro-benchmark deduce fast and accurate cluster quality metrics, hence enabling pattern searches in big data. The micro-benchmark address key issues in pattern analysis: while recent algorithms improve prediction accuracy by less than 2x, there is a 5x+ inaccuracy in established pattern matching algorithms. The evaluation of 100+ real-life data-sets reveals how the micro-benchmark manages to identify patterns which are otherwise hidden, hence paving the ground for improved quality in the field of big-data pattern matching.

Pages: 23 to 28

Copyright: Copyright (c) IARIA, 2019

Publication date: May 5, 2019

Published in: conference

ISSN: 2308-3557

ISBN: 978-1-61208-708-5

Location: Venice, Italy

Dates: from May 5, 2019 to May 9, 2019