Home // MMEDIA 2011, The Third International Conferences on Advances in Multimedia // View article


A Clustering-based Approach to Web Image Context Extraction

Authors:
Sadet Alcic
Stefan Conrad

Keywords: Image Context Extraction, Web Content Mining

Abstract:
Images on the Web come along with textual descriptions that are valuable for different applications, such as image annotation, clustering of images, image categorization, etc. But usually Web pages are poorly structured and cluttered with contents of different topics, which hinder the accurate detection of the image context. Existing approaches are based on heuristic rules and thus cannot handle the variety of documents on the Web. In this paper, we introduce a novel approach to image context extraction, building on a Web content distance measure. Utilizing this distance measure, the addressed problem can be reduced to a content clustering problem where an image is associated with the textual contents of the cluster it belongs to. Our evaluation studies confirm the validity and quality of the proposed method and demonstrate its applicability to the Web.

Pages: 74 to 79

Copyright: Copyright (c) IARIA, 2011

Publication date: April 17, 2011

Published in: conference

ISSN: 2308-4448

ISBN: 978-1-61208-129-8

Location: Budapest, Hungary

Dates: from April 17, 2011 to April 22, 2011