Development of Big Data Classifier for Biomedicine Early Diagnosis: An Experimental Approach Using Machine Learning Methods
- 1 Department of Information Systems, Faculty of College of Information and Communications Technology, West Visayas State University, La Paz, Iloilo City, Philippines
- 2 Division of Computer Science, Faculty of College of Information and Communications Technology, West Visayas State University, La Paz, Iloilo City, Philippines
- 3 Division of Information Technology, Faculty of College of Information and Communications Technology, West Visayas State University, La Paz, Iloilo City, Philippines
- 4 Division of Entertainment, Multimedia and Computing, Faculty of College of Information and Communications Technology, West Visayas State University, La Paz, Iloilo City, Philippines
- 5 Division of Computer Science, Faculty of College of Information and Communications Technology, West Visayas State University, La Paz, Iloilo City, Philippines
Abstract
In the fast-phase world, data availability is abundant due to a rapid adaptation increase of big data technologies. Large amounts of data have been generated and collected at an unprecedented speed and scale, introducing a revolution in medical research practices for biomedicine informatics. Thus, there is an immense demand for statistically rigorous approaches, especially in the medical diagnosis discipline. Therefore, this study utilized the Bayesian Belief Network (BBN) for feature selection, which identifies relevant features from a larger set of attributes and employs a stratification for the Stochastic Gradient Descent (SGD) classifier in the classifying of breast cancer on the publicly available machine learning repository at the University of California, Irvine (UCI) such, breast cancer Wisconsin and Coimbra breast cancer datasets. The experimental approach of using BBN as feature selection achieved 0.95% coincidence. Thus, a stratified Stochastic Gradient Descent (SGD) was employed to build a classification model to validate the coincidence. Our proposed modeling classifier approach reached novelty 98%, which improved by 7% compared to the previous works. Furthermore, this study presents a web-based application, a prototype type, to employ the proposed classifier model for breast cancer diagnosis. This study expects to provide a source of confidence and satisfaction for medical physicians to use decision-support tools.
DOI: https://doi.org/10.3844/jcssp.2024.379.388
Copyright: © 2024 Ma Beth Solas Concepcion, Bobby Dioquino Gerardo, Frank Ibañez Elijorde, Joel Traifalgar De Castro and Nerilou Bermudez Dela Cruz. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 1,345 Views
- 788 Downloads
- 0 Citations
Download
Keywords
- Biomedicine Diagnosis Application
- Big Data
- Data Mining
- Classification Algorithm
- Bayesian Belief Network