Publicación:
Random Forest Model Based on Machine Learning for Early Detection of Diabetes

dc.contributor.authorRubio-Paucar, Inoc
dc.contributor.authorYactayo-Arias, Cesar
dc.contributor.authorAndrade-Arenas, Laberiano
dc.date.accessioned2025-09-05T16:31:37Z
dc.description.abstractDiabetes mellitus presents a growing prevalence at the global level, representing a significant public health challenge. Despite the availability of specific treatments, it is imperative to develop innovative strategies that optimize early detection and management of the disease. The research aims to develop a model that allows for the early detection of diabetes using the Random Forest algorithm, using the Knowledge Discovery in Databases (KDD) methodology, which comprises the phases of selection, preprocessing, transformation, data mining, interpretation and evaluation. The dataset used include 520 randomly selected patient records. The model achieved robust performance, with an accuracy of 85%, sensitivity of 75%, and an F1-score of 78%, indicating an adequate balance between precision and sensitivity. Specificity was 78%, while the area under the ROC curve (AUC) reached 86%, demonstrating a high discriminative ability between positive and negative cases. The balanced accuracy was 82%, andthe Matthews correlation coefficient (MCC) registered a value of 0.72, confirming the strength and reliability of the model even in the presence of class imbalance. These results demonstrate the effectiveness of the machine learning-based approach for the early detection of diabetes mellitus, with potential application in clinical decision support systems. © 2025 Elsevier B.V., All rights reserved.
dc.identifier.doi10.14569/IJACSA.2025.01606103
dc.identifier.scopus2-s2.0-105009687905
dc.identifier.urihttps://cris.uwiener.edu.pe/handle/001/78
dc.identifier.uuid4f53466f-a952-4d17-9c65-6aa857634388
dc.language.isoen
dc.publisherScience and Information Organization
dc.relation.citationissue6
dc.relation.citationvolume16
dc.relation.ispartofseriesInternational Journal of Advanced Computer Science and Applications
dc.relation.issn21565570
dc.rightshttp://purl.org/coar/access_right/c_14cb
dc.titleRandom Forest Model Based on Machine Learning for Early Detection of Diabetes
dc.typehttp://purl.org/coar/resource_type/c_2df8fbb1
dspace.entity.typePublication
oaire.citation.endPage1063
oaire.citation.startPage1051

Archivos

Colecciones