A comparative analysis of variants of machine learning and time series models in predicting women's participation in the labor force

被引:0
作者
Elstohy, Rasha [1 ,2 ]
Aneis, Nevein [3 ]
Ali, Eman Mounir [4 ]
机构
[1] Obour Inst, Dept Informat Syst, Al Sharqia, Egypt
[2] New Cairo Technol Univ, Dept Informat Commun Technol, New Cairo, Egypt
[3] Obour Inst, Basic Sci Dept, Al Sharqia, Egypt
[4] Benha Univ, Fac Comp & Artificial Intelligence, Sci Comp Dept, Al Qalyubia, Egypt
关键词
Machine learning; Time series; Women employment; Crisis times; Labor force; Employment rate; Forecasting; Analysis; UNEMPLOYMENT;
D O I
10.7717/peerj-cs.2430
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Labor force participation of Egyptian women has been a chronic economic problem in Egypt. Despite the improvement in the human capital front, whether on the education or health indicators, female labor force participation remains persistently low. This study proposes a hybrid machine-learning model that integrates principal component analysis (PCA) for feature extraction with various machine learning and time-series models to predict women's employment in times of crisis. Various machine learning (ML) algorithms, such as support vector machine (SVM), neural network, K-nearest neighbor (KNN), linear regression, random forest, and AdaBoost, in addition to popular time series algorithms, including autoregressive integrated moving average (ARIMA) and vector autoregressive (VAR) models, have been applied to an actual dataset from the public sector. The manpower dataset considered gender from different regions, ages, and educational levels. The dataset was then trained, tested, and evaluated. For performance validation, forecasting accuracy metrics were constructed using mean squared error (MSE), root mean squared error (RMSE), mean absolute error (MAE), mean absolute percent error (MAPE), R-squared (R2), and cross-validated root mean squared error (CVRMSE). Another Dickey-Fuller test was performed to evaluate and compare the accuracy of the applied models, and the results showed that AdaBoost outperforms the other methods by an accuracy of 100%. Compared to alternative works, our findings demonstrate a comprehensive comparative analysis for predicting women's participation in different regions during an economic crisis.
引用
收藏
页数:18
相关论文
共 34 条
[1]   Diabetes Mellitus Disease Prediction and Type Classification Involving Predictive Modeling Using Machine Learning Techniques and Classifiers [J].
Ahamed, B. Shamreen ;
Arya, Meenakshi S. ;
Sangeetha, S. K. B. ;
Auxilia Osvin, Nancy V. .
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2022, 2022
[2]   Time Series Data Modeling Using Advanced Machine Learning and AutoML [J].
Alsharef, Ahmad ;
Sonia ;
Kumar, Karan ;
Iwendi, Celestine .
SUSTAINABILITY, 2022, 14 (22)
[4]   Performances of Machine Learning Models for Diagnosis of Alzheimer’s Disease [J].
Arjaria S.K. ;
Rathore A.S. ;
Bisen D. ;
Bhattacharyya S. .
Annals of Data Science, 2024, 11 (01) :307-335
[5]  
CAPMAS, 2018, Quarterly Bulltin Labour Force Survey, V4, P42
[6]  
CAPMAS, 2021, Quarterly Bulltin Labour Force Survey, V1, P38
[7]  
CAPMAS, 2022, Quarterly Bulletin Labour Force Survey, V3, P29
[8]  
CAPMAS, 2020, Quarterly Bulltin Labour Force Survey, V4, P36
[9]  
CAPMAS, 2019, Quarterly Bulltin Labour Force Survey, V4, P38
[10]   Unemployment in Rural Europe: A Machine Learning Perspective [J].
Celbis, Mehmet Guney .
APPLIED SPATIAL ANALYSIS AND POLICY, 2023, 16 (03) :1071-1095