Home // DBKDA 2015, The Seventh International Conference on Advances in Databases, Knowledge, and Data Applications // View article
Comparison of Methods Hamming Distance, Jaro, and Monge-Elkan
Authors:
Maria del Pilar Angeles
Adrian Espino-Gamez
Keywords: data matching; de-duplication; record linkage
Abstract:
The present paper shows the implementation of a more strict comparison algorithm Hamming Distance, which has been enhanced because it not only determines similarity among substrings, but also takes into consideration their corresponding order. Furthermore, we have carried out an evaluation of quality data matching through the string similarity functions Hamming distance, Jaro distance, and Monge-Elkan distance in terms of precision-recall, f-measure, and execution time.
Pages: 63 to 69
Copyright: Copyright (c) IARIA, 2015
Publication date: May 24, 2015
Published in: conference
ISSN: 2308-4332
ISBN: 978-1-61208-408-4
Location: Rome, Italy
Dates: from May 24, 2015 to May 29, 2015