Home // ICONS 2013, The Eighth International Conference on Systems // View article
On the Estimation of Missing Data in Incomplete Databases: Autoregressive Bayesian Networks
Authors:
Pablo H. Ibargüengoytia
Uriel A. García
Javier Herrera Vega
Pablo Hernández Leal
Eduardo Morales
L. Enrique Sucar
Felipe Orihuela Espina
Keywords: dynamic probabilistic graphical models; incomplete data series; value estimation; knowledge discovery; autoregressive models
Abstract:
Missing data can be estimated by means of interpolation, time series modelling, or exploiting statistically dependent information. The limits of when one approach is preferable to the alternatives have not been explored, but are likely to be a compromise between a signal autoregressive information, availability of future observations, stationary behaviour and the strength of the dependence with concomitant information. This paper takes a first step towards clarifying dataset characteristics delimiting the realm of application for each technique. In addition, this paper introduces autoregressive Bayesian networks (AR-BN), a variant of Dynamic Bayesian Networks for completing databases which exploits latent variable relations while still benefitting from autoregressive information of the variable being filled. Using AR-BN, new estimated values are calculated using inference in the dynamic model. Our results unveil how the interplay between the variable autoregressive information and the variable relationship to others in the dataset is critical to selecting the optimal data estimation technique. AR-BN appears as a good candidate ensuring a consistent performance across scenarios, datasets and error metrics
Pages: 111 to 116
Copyright: Copyright (c) IARIA, 2013
Publication date: January 27, 2013
Published in: conference
ISSN: 2308-4243
ISBN: 978-1-61208-246-2
Location: Seville, Spain
Dates: from January 27, 2013 to February 1, 2013