Predicting and identifying factors associated with undernutrition among children under five years in Ghana using machine learning algorithms

被引:6
作者
Anku, Eric Komla [1 ]
Duah, Henry Ofori [2 ]
机构
[1] Cape Coast Teaching Hosp, Dietherapy & Nutr, Cape Coast, Ghana
[2] Univ Cincinnati, Coll Nursing, Cincinnati, OH USA
关键词
D O I
10.1371/journal.pone.0296625
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Undernutrition among children under the age of five is a major public health concern, especially in developing countries. This study aimed to use machine learning (ML) algorithms to predict undernutrition and identify its associated factors. Methods Secondary data analysis of the 2017 Multiple Indicator Cluster Survey (MICS) was performed using R and Python. The main outcomes of interest were undernutrition (stunting: height-for-age (HAZ) < -2 SD; wasting: weight-for-height (WHZ) < -2 SD; and underweight: weight-for-age (WAZ) < -2 SD). Seven ML algorithms were trained and tested: linear discriminant analysis (LDA), logistic model, support vector machine (SVM), random forest (RF), least absolute shrinkage and selection operator (LASSO), ridge regression, and extreme gradient boosting (XGBoost). The ML models were evaluated using the accuracy, confusion matrix, and area under the curve (AUC) receiver operating characteristics (ROC). Results In total, 8564 children were included in the final analysis. The average age of the children was 926 days, and the majority were females. The weighted prevalence rates of stunting, wasting, and underweight were 17%, 7%, and 12%, respectively. The accuracies of all the ML models for wasting were (LDA: 84%; Logistic: 95%; SVM: 92%; RF: 94%; LASSO: 96%; Ridge: 84%, XGBoost: 98%), stunting (LDA: 86%; Logistic: 86%; SVM: 98%; RF: 88%; LASSO: 86%; Ridge: 86%, XGBoost: 98%), and for underweight were (LDA: 90%; Logistic: 92%; SVM: 98%; RF: 89%; LASSO: 92%; Ridge: 88%, XGBoost: 98%). The AUC values of the wasting models were (LDA: 99%; Logistic: 100%; SVM: 72%; RF: 94%; LASSO: 99%; Ridge: 59%, XGBoost: 100%), for stunting were (LDA: 89%; Logistic: 90%; SVM: 100%; RF: 92%; LASSO: 90%; Ridge: 89%, XGBoost: 100%), and for underweight were (LDA: 95%; Logistic: 96%; SVM: 100%; RF: 94%; LASSO: 96%; Ridge: 82%, XGBoost: 82%). Age, weight, length/height, sex, region of residence and ethnicity were important predictors of wasting, stunting and underweight. Conclusion The XGBoost model was the best model for predicting wasting, stunting, and underweight. The findings showed that different ML algorithms could be useful for predicting undernutrition and identifying important predictors for targeted interventions among children under five years in Ghana.
引用
收藏
页数:16
相关论文
共 27 条
[1]  
[Anonymous], 2009, Ghana Maternal Health Survey 2007
[2]  
[Anonymous], 2021, Malnutrition
[3]  
[Anonymous], 2015, Demographic Health Survey, P530
[4]   Interpretable machine learning for demand modeling with high-dimensional data using Gradient Boosting Machines and Shapley values [J].
Antipov, Evgeny A. ;
Pokryshevskaya, Elena B. .
JOURNAL OF REVENUE AND PRICING MANAGEMENT, 2020, 19 (05) :355-364
[5]  
Bisong E., 2019, BUILDING MACHINE LEA
[6]   Machine learning algorithms for predicting undernutrition among under-five children in Ethiopia [J].
Bitew, Fikrewold H. ;
Sparks, Corey S. ;
Nyarko, Samuel H. .
PUBLIC HEALTH NUTRITION, 2022, 25 (02) :269-280
[7]  
Boah M., 2019, Internet], V14, P1, DOI [10.1371/journal.pone.0219665, DOI 10.1371/JOURNAL.PONE.0219665]
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   A machine learning classifier approach for identifying the determinants of under-five child undernutrition in Ethiopian administrative zones [J].
Fenta, Haile Mekonnen ;
Zewotir, Temesgen ;
Muluneh, Essey Kebede .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2021, 21 (01)
[10]  
Ghana Statistical Service, 2018, Snapshots of key findings: Ghana Multiple Indicator Cluster Survey 2017/2018