Home // ICDS 2013, The Seventh International Conference on Digital Society // View article


Extracting Occupational Therapy Concepts to Develop Domain Ontology

Authors:
Ahlam Sawsaa
Joan Lu
Christopher Newman
Helen Ribchester

Keywords: Ontology; Information extracting; Regular expression; Natural Language Programming.

Abstract:
Recently, unstructured data on the World Wide Web has generated significant interest in the extraction of text, emails, web pages, reports and research papers in their raw form. Far more interestingly, extracting information from a specific domain using distributed corpora from the World Wide Web is a vital step towards creating corpus annotation. This paper describes a method of annotation, based on Occupational Therapy (OT) concepts, to build domain ontology using Natural Language Programming (NLP) technology. We used Java Annotation Patterns Engine (JAPE) grammar to support regular expression matching and thus annotate OT concepts using a GATE developer tool. This speeds up the time-consuming development of the ontology, which is important for experts in the domain facing time constraints and high workloads. The rules provide significant results: the pattern matching of OT concepts based on the lookup list produced 403 correct concepts and the accuracy was generally higher. Using NLP technique is a good approach to reducing the domain expert’s work, and the results can be evaluated.

Pages: 65 to 73

Copyright: Copyright (c) IARIA, 2013

Publication date: February 24, 2013

Published in: conference

ISSN: 2308-3956

ISBN: 978-1-61208-249-3

Location: Nice, France

Dates: from February 24, 2013 to March 1, 2013