Benchmarking machine learning algorithm for stunting risk prediction in Indonesia

Penulis: Novalina, Nadya; Tarigan, Ibrahim Amyas Aksar; Kameela, Fatimah Kayla; Rizkinia, Mia

Informasi

JurnalBulletin of Electrical Engineering and Informatics

PenerbitInstitute of Advanced Engineering and Science

Volume & EdisiVol. 14,Edisi 3

Halaman2252 - 2263

Tahun Publikasi2025

DOI10.11591/eei.v14i3.8997

ISSN20893191

Jenis SumberScopus

Abstrak

Stunting is a condition caused by poor nutrition that results in below-average height development, potentially leading to long-term effects such as intellectual disability, low learning abilities, and an increased risk of developing chronic diseases. One effort to reduce stunting is to apply a machine learning algorithm with a data science approach to develop risk prediction models based on factors in stunting. The study used the current cross industry standard process for data mining (CRISP-DM) framework to gain insight and analyzed 1561 records of data collected from the Indonesia family life survey (IFLS) for the prediction models. Two sampling methods, random undersampling, and oversampling synthetic minority oversampling technique (SMOTE), were employed and compared to overcome the data imbalance problem. Four machine learning classifier algorithms were trained and tested to determine the best-performing model. The experiment results showed that the algorithms yielded an average accuracy of more than 75%. Using the undersampling technique, the accuracy obtained by logistic regression, k-nearest neighbor (KNN), support vector classifier (SVC), and decision tree classifier were 95.21%, 78.91%, 92.97%, and 86.26% respectively. Meanwhile, the oversampling technique reached 96.17%, 88.50%, 93.29%, and 95.21%, respectively. Logistic regression emerges as the best classification, with oversampling yielding superior performance. © 2025, Institute of Advanced Engineering and Science. All rights reserved.

Dokumen & Tautan

Scopus