Home // SEMAPRO 2010, The Fourth International Conference on Advances in Semantic Processing // View article
On the Way towards Standardized Semantic Corpora for Development of Semantic Analysis Systems
Authors:
Ivan Habernal
Miloslav Konopík
Keywords: semantic analysis; semantic corpus; ATIS
Abstract:
One of the main means to achieve progress in science is cooperation. It is advantageous if the cooperation is carried among teams at different institutions. In semantics, the basic necessity for cooperation is a standardized annotated corpus. Such a corpus allows to share individual findings by the whole research community because then different systems can be tested under the same conditions. Unfortunately there is no standardized semantic corpus for the Czech language and many other languages suffer the same. Moreover the ATIS corpus set is more than ten years old and it does not meet today's trends in semantic annotation. In this article we summarize the problems of the ATIS corpora set as well as the problems encountered during our research. As a result, we provide a methodology to avoid such problems. For practical deployment of the methodology we offer a set of annotation tools. The purpose of this article is to discuss the problematic of semantic annotation and to gather other teams to create standardized shared semantic corpora.
Pages: 96 to 99
Copyright: Copyright (c) IARIA, 2010
Publication date: October 25, 2010
Published in: conference
ISSN: 2308-4510
ISBN: 978-1-61208-104-5
Location: Florence, Italy
Dates: from October 25, 2010 to October 30, 2010