Predicting nutritional status for women of childbearing age from their economic, health, and demographic features: A supervised machine learning approach

被引:6
作者
Khudri, Md. Mohsan [1 ]
Rhee, Kang Keun [1 ]
Hasan, Mohammad Shabbir [2 ]
Ahsan, Karar Zunaid [3 ]
机构
[1] Univ Memphis, Fogelman Coll Business & Econ, Dept Econ, Memphis, TN USA
[2] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[3] Univ North Carolina Chapel Hill, Gillings Sch Global Publ Hlth, Publ Hlth Leadership Program, Chapel Hill, NC 27599 USA
来源
PLOS ONE | 2023年 / 18卷 / 05期
关键词
BODY-MASS INDEX; CHI-SQUARED TESTS; PHYSICAL-ACTIVITY; WEIGHT CHANGE; DOUBLE BURDEN; BIG DATA; OBESITY; OVERWEIGHT; MALNUTRITION; IMPACT;
D O I
10.1371/journal.pone.0277738
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundMalnutrition imposes enormous costs resulting from lost investments in human capital and increased healthcare expenditures. There is a dearth of research focusing on the prediction of women's body mass index (BMI) and malnutrition outcomes (underweight, overweight, and obesity) in developing countries. This paper attempts to fill out this knowledge gap by predicting the BMI and the risks of malnutrition outcomes for Bangladeshi women of childbearing age from their economic, health, and demographic features. MethodsData from the 2017-18 Bangladesh Demographic and Health Survey and a series of supervised machine learning (SML) techniques are used. Additionally, this study circumvents the imbalanced distribution problem in obesity classification by utilizing an oversampling approach. ResultsStudy findings demonstrate that the support vector machine and k-nearest neighbor are the two best-performing methods in BMI prediction based on the coefficient of determination (R2), root mean square error (RMSE), and mean absolute error (MAE). The combined predictor algorithms consistently yield top specificity, Cohen's kappa, F1-score, and AUC in classifying the malnutrition status, and their performance is robust to alternative standards. The feature importance ranking based on several nonparametric and combined predictors indicates that socioeconomic status, women's age, and breastfeeding status are the most important features in predicting women's nutritional outcomes. Furthermore, the conditional inference trees corroborate that those three features, along with the partner's educational attainment and employment status, significantly predict malnutrition risks. ConclusionTo the best of our knowledge, this is the first study that predicts BMI and one of the pioneer studies to classify all three malnutrition outcomes for women of childbearing age in Bangladesh, let alone in any lower-middle income country, using SML techniques. Moreover, in the context of Bangladesh, this paper is the first to identify and rank features that are critical in predicting nutritional outcomes using several feature selection algorithms. The estimators from this study predict the outcomes of interest most accurately and efficiently compared to other existing studies in the relevant literature. Therefore, study findings can aid policymakers in designing policy and programmatic approaches to address the double burden of malnutrition among Bangladeshi women, thereby reducing the country's economic burden.
引用
收藏
页数:31
相关论文
共 124 条
  • [1] Socio-economic characteristics and obesity in underdeveloped economies: does income really matter?
    Abdulai, Awudu
    [J]. APPLIED ECONOMICS, 2010, 42 (02) : 157 - 169
  • [2] New empirical nonparametric kernels for support vector machine classification
    Al Daoud, Essam
    Turabieh, Hamza
    [J]. APPLIED SOFT COMPUTING, 2013, 13 (04) : 1759 - 1765
  • [3] Nutrition transition - Pattern IV: Leads Bangladeshi youth to the increasing prevalence of overweight and obesity
    Al Muktadir, Mohammad Hamid
    Islam, Md Ashraful
    Amin, Mohammad Nurul
    Ghosh, Supriya
    Siddiqui, Shafayet Ahmed
    Debnath, Dipti
    Islam, Md Monirul
    Ahmed, Tufael
    Sultana, Farhana
    [J]. DIABETES & METABOLIC SYNDROME-CLINICAL RESEARCH & REVIEWS, 2019, 13 (03) : 1943 - 1947
  • [4] Machine Labor
    Angrist, Joshua D.
    Frandsen, Brigham
    [J]. JOURNAL OF LABOR ECONOMICS, 2022, 40 : S97 - S140
  • [5] [Anonymous], 2016, The double burden of malnutrition: Policy brief
  • [6] Machine Learning Methods That Economists Should Know About
    Athey, Susan
    Imbens, Guido W.
    [J]. ANNUAL REVIEW OF ECONOMICS, VOL 11, 2019, 2019, 11 : 685 - 725
  • [7] Beyond prediction: Using big data for policy problems
    Athey, Susan
    [J]. SCIENCE, 2017, 355 (6324) : 483 - 485
  • [8] Factors associated with duration of breastfeeding in Bangladesh: evidence from Bangladesh demographic and health survey 2014
    Ayesha, Ummay
    Mamun, A. S. M. A.
    Sayem, Md. Abu
    Hossain, Md. Golam
    [J]. BMC PUBLIC HEALTH, 2021, 21 (01)
  • [9] Machine Learning Methods for Demand Estimation
    Bajari, Patrick
    Nekipelov, Denis
    Ryan, Stephen P.
    Yang, Miaoyu
    [J]. AMERICAN ECONOMIC REVIEW, 2015, 105 (05) : 481 - 485
  • [10] THE EFFECTS OF CIGARETTE COSTS ON BMI AND OBESITY
    Baum, Charles L.
    [J]. HEALTH ECONOMICS, 2009, 18 (01) : 3 - 19