Home // DBKDA 2015, The Seventh International Conference on Advances in Databases, Knowledge, and Data Applications // View article


Analysis of String Comparison Methods During De-Duplication Process

Authors:
Maria del Pilar Angeles
Francisco García-Ugalde
Ricardo Valencia
Arturo Nava

Keywords: data matching; de-duplication; record linkage.

Abstract:
This paper presents three comparison algorithms in terms of computational resources utilized during record linkage process. The comparison algorithms are Monge-Elkan, Bag Distance and Edit Distance. The Monge-Elkan method meets all the requirements to be implemented and to obtain reliable results characteristics. Besides, the method falls within the average execution time efficiency.

Pages: 57 to 62

Copyright: Copyright (c) IARIA, 2015

Publication date: May 24, 2015

Published in: conference

ISSN: 2308-4332

ISBN: 978-1-61208-408-4

Location: Rome, Italy

Dates: from May 24, 2015 to May 29, 2015