Home // International Journal On Advances in Software, volume 6, numbers 3 and 4, 2013 // View article
A New Representation of WordNet® using Graph Databases On-Disk and In-Memory
Authors:
Khaled Nagi
Keywords: WordNet®; semantic relationships; graph databases; storage models; Neo4j; on-disk and in-memory DBMS; performance analysis
Abstract:
WordNet® is one of the most important resources in computation linguistics. The semantically related database of English terms is widely used in text analysis and retrieval domains, which constitute typical features, employed by social networks and other modern Web 2.0 applications. Under the hood, WordNet® can be seen as a sort of read-only social network relating its language terms. In our work, we implement a new storage technique for WordNet® based on graph databases. Graph databases are a major pillar of the NoSQL movement with lots of emerging products, such as Neo4j. In this extended paper, we present two new graph data models for the WordNet® dictionary. We use the emerging graph database management system Neo4j and deploy the models on-disk as well as in-memory. We analyze their performance and compare them to other traditional storage models based on native file systems and relational database management systems. With this contribution, we also validate the applicability of modern graph databases in new areas beside the typical large-scale social networks with several hundreds of millions of nodes.
Pages: 298 to 308
Copyright: Copyright (c) to authors, 2013. Used with permission.
Publication date: December 31, 2013
Published in: journal
ISSN: 1942-2628