Benchmarking machine learning algorithm for stunting risk prediction in Indonesia

Penulis: Novalina, Nadya; Tarigan, Ibrahim Amyas Aksar; Kameela, Fatimah Kayla; Rizkinia, Mia
Informasi
JurnalBulletin of Electrical Engineering and Informatics
PenerbitInstitute of Advanced Engineering and Science
Volume & EdisiVol. 14,Edisi 3
Halaman2252 - 2263
Tahun Publikasi2025
ISSN20893191
Jenis SumberScopus
Abstrak
Stunting is a condition caused by poor nutrition that results in below-average height development, potentially leading to long-term effects such as intellectual disability, low learning abilities, and an increased risk of developing chronic diseases. One effort to reduce stunting is to apply a machine learning algorithm with a data science approach to develop risk prediction models based on factors in stunting. The study used the current cross industry standard process for data mining (CRISP-DM) framework to gain insight and analyzed 1561 records of data collected from the Indonesia family life survey (IFLS) for the prediction models. Two sampling methods, random undersampling, and oversampling synthetic minority oversampling technique (SMOTE), were employed and compared to overcome the data imbalance problem. Four machine learning classifier algorithms were trained and tested to determine the best-performing model. The experiment results showed that the algorithms yielded an average accuracy of more than 75%. Using the undersampling technique, the accuracy obtained by logistic regression, k-nearest neighbor (KNN), support vector classifier (SVC), and decision tree classifier were 95.21%, 78.91%, 92.97%, and 86.26% respectively. Meanwhile, the oversampling technique reached 96.17%, 88.50%, 93.29%, and 95.21%, respectively. Logistic regression emerges as the best classification, with oversampling yielding superior performance. © 2025, Institute of Advanced Engineering and Science. All rights reserved.
Dokumen & Tautan

© 2025 Universitas Indonesia. Seluruh hak cipta dilindungi.