Research Article Open Access

HEURISTIC DISCRETIZATION METHOD FOR BAYESIAN NETWORKS

Mariana D.C. Lima1, Silvia M. Nassar1, Pedro Ivo R.B.G. Rodrigues1, Paulo J. Freitas Filho1 and Carlos M.C. Jacinto2
  • 1 Universidade Federal de Santa Catarina, Brazil
  • 2 , Brazil

Abstract

Bayesian Network (BN) is a classification technique widely used in Artificial Intelligence. Its structure is a Direct Acyclic Graph (DAG) used to model the association of categorical variables. However, in cases where the variables are numerical, a previous discretization is necessary. Discretization methods are usually based on a statistical approach using the data distribution, such as division by quartiles. In this article we present a discretization using a heuristic that identifies events called peak and valley. Genetic Algorithm was used to identify these events having the minimization of the error between the estimated average for BN and the actual value of the numeric variable output as the objective function. The BN has been modeled from a database of Bit’s Rate of Penetration of the Brazilian pre-salt layer with 5 numerical variables and one categorical variable, using the proposed discretization and the division of the data by the quartiles. The results show that the proposed heuristic discretization has higher accuracy than the quartiles discretization.

Journal of Computer Science
Volume 10 No. 5, 2014, 869-878

DOI: https://doi.org/10.3844/jcssp.2014.869.878

Submitted On: 6 December 2013 Published On: 18 January 2014

How to Cite: Lima, M. D., Nassar, S. M., Rodrigues, P. I. R., Filho, P. J. F. & Jacinto, C. M. (2014). HEURISTIC DISCRETIZATION METHOD FOR BAYESIAN NETWORKS. Journal of Computer Science, 10(5), 869-878. https://doi.org/10.3844/jcssp.2014.869.878

  • 3,379 Views
  • 2,722 Downloads
  • 9 Citations

Download

Keywords

  • Bayesian Network
  • Discretization
  • Global Optimization
  • Genetic Algorithm
  • Heuristic