Home // BIOTECHNO 2011, The Third International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies // View article


Separating Drivers from Passengers in Whole Genome Analysis: Identification of Combinatorial Effects of Genes by Mining Knowledge Sources

Authors:
Stephen Anthony
Vitali Sintchenko
Enrico Coiera

Keywords: structural bioinformatics; whole genome analysis; text mining; infectious diseases; knowledge discovery

Abstract:
This study aimed to develop a new informatics platform for the discovery, recovery and multi-level analysis of the effects of individual genes and multiple gene combinations on pathophenotypes of bacteria. Natural language processing algorithms were employed to extract gene-disease associations from PubMed literature and annotated genomes of bacteria with epidemic potential. From these associations gene virulence profiles were generated enabling the comparison of gene signatures within and across genomes. It allowed the identification of virulence genes and construction of their association networks as well as the detection of knowledge gaps. This proof-of-concept study confirmed the feasibility of our original approach for integrating bacterial genome level knowledge with published observations from clinical settings.

Pages: 6 to 11

Copyright: Copyright (c) IARIA, 2011

Publication date: May 22, 2011

Published in: conference

ISSN: 2308-4383

ISBN: 978-1-61208-137-3

Location: Venice/Mestre, Italy

Dates: from May 22, 2011 to May 27, 2011