Comparative Analysis of Random Forest, Logistic Regression and SVM for Stunting Prediction Using Anthropometric Data

Shalsa Bela Dwi Widyawati; Purwadi Purwadi; Ika Romadoni Yunita

doi:10.63158/journalisi.v7i4.1387

Shalsa Bela Dwi Widyawati Amikom Purwokerto University, Indonesia
Purwadi Purwadi Amikom Purwokerto University, Indonesia
Ika Romadoni Yunita Amikom Purwokerto University, Indonesia

Keywords: Stunting, Nutritional Status, Machine Learning, Random Forest, Logistic Regression, Support Vector Machine (SVM)

Abstract

Stunting remains a critical nutritional issue in Indonesia, significantly impacting the physical and cognitive development of children under five. Prompt and accurate detection of nutritional status is essential for early intervention. This study aims to predict toddlers' nutritional health using the Random Forest algorithm, based on age and height data. From an initial dataset of 120,998 anthropometric records, preprocessing steps—such as duplicate removal and nutritional status recategorization—resulted in a final dataset of 39,425 entries. The research methodology includes data collection, preprocessing, exploratory analysis, model training, handling class imbalance, and performance evaluation using accuracy, precision, recall, and F1-score. The study also compares the Random Forest model with Logistic Regression and Support Vector Machine (SVM). Results show that Random Forest outperforms the other models, achieving perfect classification metrics: Accuracy (1.00), Recall (1.00), F1-Score (1.00), and Cross-validation Accuracy (99.74%). These outcomes highlight Random Forest's robustness in classifying under-five nutrition data, making it an effective tool for rapid and reliable stunting risk detection. This research supports efforts to reduce Indonesia's stunting rate to below 20% by 2024, contributing to national health improvement strategies through technology-driven early diagnosis.

Downloads

Download data is not yet available.

References

R. Ratnasari, A. J. Wahidin, and T. H. Andika, “Early Detection of Stunting in Children Based on Anthropometric Indicators Using Machine Learning Algorithmsg,” J. Algoritm., vol. 21, no. 2, pp. 378–387, 2024, doi: 10.33364/algoritma/v.21-2.2122.

N. Rusliani, W. R. Hidayani, and H. Sulistyoningsih, “Literature Review: Factors Associated with Stunting in Toddlers,” Bul. Ilmu Kebidanan dan Keperawatan, vol. 1, no. 01, pp. 32–40, 2022, doi: 10.56741/bikk.v1i01.39.

S. N. Azizah and Z. Fatah, “Implementation of the K-Nearest Neighbor (K-NN) Method in the Classification of Stunting in Toddlers ,” Gudang J. Multidisiplin Ilmu, vol. 2, no. 10, pp. 282–288, 2024.

T. R. . Lestari, “Stunting in Indonesia: The Root of the Problem and Its Solution ,” Info Singk. Kaji. Singk. Terhadap Isu Aktual dan Strateg., vol. XV, no. 14, pp. 21–25, 2023.

F. M. Mulyaningrum, M. M. Susanti, and U. A. Nuur, “Factors Affecting Childhood Stunting,” pp. 74–84, 2021.

N. L. Rambe, “Indonesian Health Magazine,” J. Ilm. Kebidanan Imelda, vol. 1, no. 2, pp. 45–49, 2020.

J. Aurima, S. Susaldi, N. Agustina, A. Masturoh, R. Rahmawati, and M. Tresiana Monika Madhe, “Factors Associated with Stunting in Indonesian Toddlers,” Open Access Jakarta J. Heal. Sci., vol. 1, no. 2, pp. 43–48, 2021, doi: 10.53801/oajjhs.v1i3.23.

S. Marsya Finda and D. Wahyu Utomo, “Classification of Stunting in Toddlers using Ensemble Learning and Random Forest Methods,” Jl. Imam Bonjol No, vol. 15, no. 02, pp. 287–295, 2024, doi: 10.35970/infotekmesin.v15i2.2326.

A. T. Armando Sibuea, P. Harry Gunawan, and Indwiarti, “Classifying Stunting Status in Toddlers Using K-Nearest Neighbor and Logistic Regression Analysis,” in 2024 International Conference on Data Science and Its Applications (ICoDSA), 2024, pp. 6–11. doi: 10.1109/ICoDSA62899.2024.10652063.

A. Jalil, A. Homaidi, and Z. Fatah, “Implementation of Support Vector Machine Algorithm for Classification of Stunting Status in Toddlers,” G-Tech J. Teknol. Terap., vol. 8, no. 3, pp. 2070–2079, 2024, doi: 10.33379/gtech.v8i3.4811.

M. R. Akbar Ariyadi, S. Lestanti, and S. Kirom, “Classification of Stunted Toddlers Using Random Forest Classifier in Blita Districtr,” JATI (Jurnal Mhs. Tek. Inform., vol. 7, no. 6, pp. 3846–3851, 2024, doi: 10.36040/jati.v7i6.7822.

J. Juwariyem, S. Sriyanto, S. Lestari, and C. Chairani, “Prediction of Stunting in Toddlers Using Bagging and Random Forest Algorithms,” Sinkron, vol. 8, no. 2, pp. 947–955, 2024, doi: 10.33395/sinkron.v8i2.13448.

R. Fauzan and A. Rosita, “Comparison of KNN and Naïve Bayes Classification Algorithms for Predicting Stunting in Toddlers in Banjaran District,” Jurnal Teknologi dan Sistem Komputer, vol. 9, no. 5, pp. 2711–2717, 2025.

J. Han, J. Pei, and H. Tong, Data Mining: Concepts and Techniques, 4th ed., Elsevier, 2022.

A. Smote and D. A. N. Neighbor, “Classification of Unbalanced Data Using SMOTE,” Jurnal Data Science Indonesia (DSI), vol. 3, no. 1, pp. 44–49.

F. Azimah and K. Rizky Nova Wardani, “Early Covid-19 Symptom Detection System Using the Al Project Cycle Method,” J. Locus Penelit. dan Pengabdi., vol. 1, no. 6, pp. 405–418, 2022, doi: 10.36418/locus.v1i6.135.

P. K. Neighbors and D. A. N. LightGBM, “Combining K-Nearest Neighbors and LightGBM for Diabetes Prediction on the Pima Indians Dataset,” Jurnal Ilmu Komputer dan Aplikasi, vol. 9, no. 3, pp. 1133–1144, 2024.

F. Duran, F. Wijaya, Y. R. Hulu, M. Harahap, and A. Prabowo, “Performance Comparison of Random Forest Classifier and LightGBM Classifier Algorithms for Heart Disease Prediction,” Data Sci. Indones., vol. 3, no. 2, pp. 98–103, 2024, doi: 10.47709/dsi.v3i2.3831.

K. Science, E. Journal, and M. A. Engin, “Parkinson’s Disease Detection via Machine Learning Using Data Splitting and Validation Methods,” Karaelmas Fen ve Mühendislik Dergisi (KaraelmasFen), vol. 14, no. 2, pp. 134–147, 2024, doi: 10.7212/karaelmasfen.1484222.

F. O. Awalullaili et al., “Classification of Hypertension Using the SVM Grid Search Method and SVM Genetic Algorithm,” Jurnal Gaussian (J. Gauss), vol. 11, no. 4, pp. 488–498, 2023, doi: 10.14710/j.gauss.11.4.488-498.

T. Burzykowski, M. Geubbelmans, A. Rousseau, and D. Valkenborg, “Validation of machine learning algorithms,” Am. J. Orthod. Dentofac. Orthop., vol. 164, no. 2, pp. 295–297, doi: 10.1016/j.ajodo.2023.05.007.

N. R. Muntiari, K. H. Hanif, and L. Herawati, “Detection of Stunting in Toddlers Using Comparison Method,” Jurnal Teknologi Informasi dan Ilmu Komputer, vol. 13, no. 1, pp. 17–23, 2025.

N. Nurdiansyah et al., “Mental Health Analysis to Prevent Mental Disorders in Students Using K-Nearest Neighbor and Random Forest Algorithms,” Jurnal Ilmiah Teknologi dan Komputer, vol. 5, no. 1, pp. 1–9, 2025.

M. I. Elim and E. Utami, “Performance Comparison of Child Stunting Prediction: Support Vector Machine vs Random Forest with Grid Search Optimization,” Jurnal Teknologi dan Sistem Informasi, vol. 6, no. 5, pp. 5305–5319, 2025.

A. Syukron, “Application of the SMOTE Method to Overcome Class Imbalance in Heart Failure Prediction,” Jurnal Gaussian (J. Gauss), vol. 10, no. 1, pp. 47–50, 2023.