Home // SEMAPRO 2013, The Seventh International Conference on Advances in Semantic Processing // View article


Parallel Search Through Statistical Semantic Spaces for Querying Big RDF Data

Authors:
Alexey Cheptsov
Axel Tenschert

Keywords: Statistical Semantics; Random Indexing; Parallelization; High Performance Computing; Message-Passing Interface;JUNIPER

Abstract:
With billions of triples in the Linked Open Data cloud, which continues to grow exponentially, challenging tasks start to emerge related to the exploitation and reasoning of Web data. A considerable amount of work has been done in the area of using Information Retrieval (IR) methods to address these problems. However, although applied models work on the Web scale, they downgrade the semantics contained in an RDF graph by observing each physical resource as a ’bag of words (URIs/literals)’. Distributional statistic methods can address this problem by capturing the structure of the graph more efficiently. However, these methods are computationally expensive. In this paper, we describe the parallelization algorithm of one such method (Random Indexing) based on the Message-Passing Interface technology. Our evaluation results show super linear improvement.

Pages: 1 to 6

Copyright: Copyright (c) IARIA, 2013

Publication date: September 29, 2013

Published in: conference

ISSN: 2308-4510

ISBN: 978-1-61208-293-6

Location: Porto, Portugal

Dates: from September 29, 2013 to October 3, 2013