Home // DBKDA 2015, The Seventh International Conference on Advances in Databases, Knowledge, and Data Applications // View article


Comparison of Methods Hamming Distance, Jaro, and Monge-Elkan

Authors:
Maria del Pilar Angeles
Adrian Espino-Gamez

Keywords: data matching; de-duplication; record linkage

Abstract:
The present paper shows the implementation of a more strict comparison algorithm Hamming Distance, which has been enhanced because it not only determines similarity among substrings, but also takes into consideration their corresponding order. Furthermore, we have carried out an evaluation of quality data matching through the string similarity functions Hamming distance, Jaro distance, and Monge-Elkan distance in terms of precision-recall, f-measure, and execution time.

Pages: 63 to 69

Copyright: Copyright (c) IARIA, 2015

Publication date: May 24, 2015

Published in: conference

ISSN: 2308-4332

ISBN: 978-1-61208-408-4

Location: Rome, Italy

Dates: from May 24, 2015 to May 29, 2015