Home // ICDS 2013, The Seventh International Conference on Digital Society // View article
Extracting Occupational Therapy Concepts to Develop Domain Ontology
Authors:
Ahlam Sawsaa
Joan Lu
Christopher Newman
Helen Ribchester
Keywords: Ontology; Information extracting; Regular expression; Natural Language Programming.
Abstract:
Recently, unstructured data on the World Wide Web has generated significant interest in the extraction of text, emails, web pages, reports and research papers in their raw form. Far more interestingly, extracting information from a specific domain using distributed corpora from the World Wide Web is a vital step towards creating corpus annotation. This paper describes a method of annotation, based on Occupational Therapy (OT) concepts, to build domain ontology using Natural Language Programming (NLP) technology. We used Java Annotation Patterns Engine (JAPE) grammar to support regular expression matching and thus annotate OT concepts using a GATE developer tool. This speeds up the time-consuming development of the ontology, which is important for experts in the domain facing time constraints and high workloads. The rules provide significant results: the pattern matching of OT concepts based on the lookup list produced 403 correct concepts and the accuracy was generally higher. Using NLP technique is a good approach to reducing the domain expert’s work, and the results can be evaluated.
Pages: 65 to 73
Copyright: Copyright (c) IARIA, 2013
Publication date: February 24, 2013
Published in: conference
ISSN: 2308-3956
ISBN: 978-1-61208-249-3
Location: Nice, France
Dates: from February 24, 2013 to March 1, 2013