Home // IMMM 2015, The Fifth International Conference on Advances in Information Mining and Management // View article
An Extensible Conceptual Model for Tabular Scientific Datasets
Authors:
Javad Chamanara
Michael Owonibi
Alsayed Algergawy
Roman Gerlach
Keywords: Scientific data; Dataset structure; Biodiversity data
Abstract:
There is a proliferation of datasets generated by various scientists of different scientific disciplines. Therefore, there is a growing need to construct and develop platforms that enable scientists to capture, exchange, process, and interpret data for immediate use, as well as to store and manage data to support future reuse. Modeling and organizing data within such platforms are key challenges. To this end, in this paper, we introduce the dataset model of the BExIS 2 platform and how data can be organized inside the model. In particular, we describe the anatomy of a general purpose tabular dataset, which consists of data tuples to represent the table rows and data cells that are compound objects holding the obtained values and their auxiliary information. The structure of datasets is defined and applied separately in order to factor out shared concepts such as unit of measurement, methodology, data type, valid and missing values, processing functions and so on. The datasets are extensible in multiple ways and can be annotated on various levels utilizing taxonomies, ontologies, and custom metadata structures
Pages: 72 to 76
Copyright: Copyright (c) IARIA, 2015
Publication date: June 21, 2015
Published in: conference
ISSN: 2326-9332
ISBN: 978-1-61208-415-2
Location: Brussels, Belgium
Dates: from June 21, 2015 to June 26, 2015