Obesity Level Estimation Software based on Decision Trees
- 1 Corporación Universitaria Americana, Colombia
- 2 Universidad de la Costa, Colombia
Abstract
Obesity has become a global epidemic that has doubled since 1980, with serious consequences for health in children, teenagers and adults. Obesity is a problem has been growing steadily and that is why every day appear new studies involving children obesity, especially those looking for influence factors and how to predict emergence of the condition under these factors. In this study, authors applied the SEMMA data mining methodology, to select, explore and model the data set and then three methods were selected: Decision trees (J48), Bayesian networks (Naïve Bayes) and Logistic Regression (Simple Logistic), obtaining the best results with J48 based on the metrics: Precision, recall, TP Rate and FP Rate. Finally, a software was built to use and train the selected method, using the Weka library. The results confirmed the Decision Trees technique has the best precision rate (97.4%), improving results of previous studies with similar background.
DOI: https://doi.org/10.3844/jcssp.2019.67.77
Copyright: © 2019 Eduardo De-La-Hoz-Correa, Fabio E. Mendoza-Palechor, Alexis De-La-Hoz-Manotas, Roberto C. Morales-Ortega and Sánchez Hernández Beatriz Adriana. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 8,268 Views
- 7,730 Downloads
- 46 Citations
Download
Keywords
- Obesity
- Data Mining
- Semma
- Decision Trees
- Naive Bayes
- Logistic Regression
- Weka
- Java