Machine learning study using 2020 SDHS data to determine poverty determinants in Somalia

被引:8
作者
Hassan, Abdirizak A. [1 ]
Muse, Abdisalam Hassan [1 ]
Chesneau, Christophe [2 ]
机构
[1] Amoud Univ, Sch Postgrad Studies & Res, Amoud Valley, Borama 25263, Awdal, Somalia
[2] Univ Caen, Dept Math, LMNO, CNRS, Campus II,Sci 3, F-14032 Caen, France
关键词
Machine learning; Somalia; Random forest; Model precision; Classical regression; Sustainability; Demography;
D O I
10.1038/s41598-024-56466-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Extensive research has been conducted on poverty in developing countries using conventional regression analysis, which has limited prediction capability. This study aims to address this gap by applying advanced machine learning (ML) methods to predict poverty in Somalia. Utilizing data from the first-ever 2020 Somalia Demographic and Health Survey (SDHS), a cross-sectional study design is considered. ML methods, including random forest (RF), decision tree (DT), support vector machine (SVM), and logistic regression, are tested and applied using R software version 4.1.2, while conventional methods are analyzed using STATA version 17. Evaluation metrics, such as confusion matrix, accuracy, precision, sensitivity, specificity, recall, F1 score, and area under the receiver operating characteristic (AUROC), are employed to assess the performance of predictive models. The prevalence of poverty in Somalia is notable, with approximately seven out of ten Somalis living in poverty, making it one of the highest rates in the region. Among nomadic pastoralists, agro-pastoralists, and internally displaced persons (IDPs), the poverty average stands at 69%, while urban areas have a lower poverty rate of 60%. The accuracy of prediction ranged between 67.21% and 98.36% for the advanced ML methods, with the RF model demonstrating the best performance. The results reveal geographical region, household size, respondent age group, husband employment status, age of household head, and place of residence as the top six predictors of poverty in Somalia. The findings highlight the potential of ML methods to predict poverty and uncover hidden information that traditional statistical methods cannot detect, with the RF model identified as the best classifier for predicting poverty in Somalia.
引用
收藏
页数:19
相关论文
共 40 条
[1]  
Achia T.N., 2010, A logistic regression model to identify key determinants of poverty using demographic and health survey data
[2]  
Addae-Karankye A., 2014, Ameri Intern J Soci Sci, V3, P147
[3]   Towards the global zero poverty agenda: examining the multidimensional poverty situation in South Africa [J].
A. A. Adetoro ;
M. S. C. Ngidi ;
Gideon Danso-Abbeam .
SN Social Sciences, 3 (9)
[4]  
Adeyemi S., 2009, INT MULTIDISCIPLINAR, V3, P162
[5]   Structural and Institutional Determinants of Poverty in Sub-Saharan African Countries [J].
Akanbi, Olusegun Ayodele .
JOURNAL OF HUMAN DEVELOPMENT AND CAPABILITIES, 2015, 16 (01) :122-141
[6]  
Ali A., 2016, Int. J. Educ. Res, V4, P273
[7]   Poverty Classification Using Machine Learning: The Case of Jordan [J].
Alsharkawi, Adham ;
Al-Fetyani, Mohammad ;
Dawas, Maha ;
Saadeh, Heba ;
Alyaman, Musa .
SUSTAINABILITY, 2021, 13 (03) :1-16
[8]  
Anyanwu J., 2013, Determining the correlates of poverty for inclusive growth in Africa working paper series, V181
[9]  
Bank W, 2020, Poverty and shared prosperity 2020: Reversals of fortune, DOI [10.1596/978-1-4648-1602-4, DOI 10.1596/978-1-4648-1602-4]
[10]  
Binam JN., 2011, Mod. Econ, V2, P308, DOI [10.4236/me.2011.23034, DOI 10.4236/ME.2011.23034]