Publicación: Random Forest Model Based on Machine Learning for Early Detection of Diabetes
| dc.contributor.author | Rubio-Paucar, Inoc | |
| dc.contributor.author | Yactayo-Arias, Cesar | |
| dc.contributor.author | Andrade-Arenas, Laberiano | |
| dc.date.accessioned | 2025-09-05T16:31:37Z | |
| dc.description.abstract | Diabetes mellitus presents a growing prevalence at the global level, representing a significant public health challenge. Despite the availability of specific treatments, it is imperative to develop innovative strategies that optimize early detection and management of the disease. The research aims to develop a model that allows for the early detection of diabetes using the Random Forest algorithm, using the Knowledge Discovery in Databases (KDD) methodology, which comprises the phases of selection, preprocessing, transformation, data mining, interpretation and evaluation. The dataset used include 520 randomly selected patient records. The model achieved robust performance, with an accuracy of 85%, sensitivity of 75%, and an F1-score of 78%, indicating an adequate balance between precision and sensitivity. Specificity was 78%, while the area under the ROC curve (AUC) reached 86%, demonstrating a high discriminative ability between positive and negative cases. The balanced accuracy was 82%, andthe Matthews correlation coefficient (MCC) registered a value of 0.72, confirming the strength and reliability of the model even in the presence of class imbalance. These results demonstrate the effectiveness of the machine learning-based approach for the early detection of diabetes mellitus, with potential application in clinical decision support systems. © 2025 Elsevier B.V., All rights reserved. | |
| dc.identifier.doi | 10.14569/IJACSA.2025.01606103 | |
| dc.identifier.scopus | 2-s2.0-105009687905 | |
| dc.identifier.uri | https://cris.uwiener.edu.pe/handle/001/78 | |
| dc.identifier.uuid | 4f53466f-a952-4d17-9c65-6aa857634388 | |
| dc.language.iso | en | |
| dc.publisher | Science and Information Organization | |
| dc.relation.citationissue | 6 | |
| dc.relation.citationvolume | 16 | |
| dc.relation.ispartofseries | International Journal of Advanced Computer Science and Applications | |
| dc.relation.issn | 21565570 | |
| dc.rights | http://purl.org/coar/access_right/c_14cb | |
| dc.title | Random Forest Model Based on Machine Learning for Early Detection of Diabetes | |
| dc.type | http://purl.org/coar/resource_type/c_2df8fbb1 | |
| dspace.entity.type | Publication | |
| oaire.citation.endPage | 1063 | |
| oaire.citation.startPage | 1051 |
