Home // IMMM 2015, The Fifth International Conference on Advances in Information Mining and Management // View article


Augmenting Data Files with Semantics for Coherency, Extensibility, and Reproducibility

Authors:
John McCloud
Subhasish Mazumdar

Keywords: Knowledge Representation; Comprehension; Semantics; Bioinformatics

Abstract:
Data files have traditionally been thought of as the input and output of programs, as well as their intermediaries. When the need for usage of data files by a diverse set of consumers and needs was recognized, it was addressed primarily by the addition of metadata. This metadata is structured data, providing guidance regarding the use of the data. Unfortunately, this approach has proven inadequate for the myriad applications of today. We posed two questions of a very common and popular data file standard in bioinformatics. First, are the conclusions presented in such a file verifiable? Second, can one use the data to test for alternative conclusions? Our answers for both questions were negative. In this paper, we outline the problems we found and propose a remedy. While we have used bioinformatics as a case study, our results are more general.

Pages: 54 to 60

Copyright: Copyright (c) IARIA, 2015

Publication date: June 21, 2015

Published in: conference

ISSN: 2326-9332

ISBN: 978-1-61208-415-2

Location: Brussels, Belgium

Dates: from June 21, 2015 to June 26, 2015