Home // IMMM 2011, The First International Conference on Advances in Information Mining and Management // View article


Mining Cross-document Relationships from Text

Authors:
Petr Knoth
Zdenek Zdrahal

Keywords: text mining; automatic link generation and typing; semantic similarity; digital libraries

Abstract:
The paper argues that automatic link generation and typing methods are needed to find and maintain cross-document links in large and growing textual collections. Such links are important to organise information and to support search and navigation. We present an experimental study on mining cross-document links from a collection of 5000 documents. We identify a set of link types and show that the value of semantic similarity can be used as a distinguishing indicator.

Pages: 55 to 60

Copyright: Copyright (c) IARIA, 2011

Publication date: October 23, 2011

Published in: conference

ISSN: 2326-9332

ISBN: 978-1-61208-162-5

Location: Barcelona, Spain

Dates: from October 23, 2011 to October 29, 2011