Home // International Journal On Advances in Software, volume 12, numbers 3 and 4, 2019 // View article
Creating and Evaluating Data-Driven Ontologies
Authors:
Maaike H.T. de Boer
Jack P.C. Verhoosel
Keywords: Ontologies; Machine Learning; NLP; Word2vec; Ontology Learning; F1 score
Abstract:
Automatically creating data-driven ontologies is a challenge but it can save time and resources. In this paper, eight data-driven algorithms are compared to create ontologies, four ontologies based on documents and four based on keywords, on three different document sets. We evaluate the performance using three different evaluation metrics based on nodes, weights and relations. Results show that 1) keyword-based methods are in general better than document-based methods; 2) a co-occurrence-based algorithm is the best document-based method; 3) the evaluation metrics give useful insight, but need to be enhanced in future work. It is advised to a) use the created ontologies as a head start in an ontology creation session, but not use the ontologies as created; b) use word2vec to generate an ontology in a generic domain, whereas the co-occurrences algorithm should be used in specific domain.
Pages: 300 to 309
Copyright: Copyright (c) to authors, 2019. Used with permission.
Publication date: December 30, 2019
Published in: journal
ISSN: 1942-2628