Employee attrition prediction for imbalanced data using genetic algorithm-based parameter optimization of XGB Classifier

被引:1
作者
Konar, Karabi [1 ]
Das, Saptarshi [1 ]
Das, Samiran [1 ]
机构
[1] JIS Univ, Ctr Data Sci, JIS Inst Adv Studies & Res, Kolkata, India
来源
2023 INTERNATIONAL CONFERENCE ON COMPUTER, ELECTRICAL & COMMUNICATION ENGINEERING, ICCECE | 2023年
关键词
Machine learning; Imbalanced classification; XGBoost; Genetic Algorithm;
D O I
10.1109/ICCECE51049.2023.10085402
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Attrition of employees is vital for any organization as it significantly influences productivity and hampers the longterm growth strategies of the organization. Since employee attrition leads to loss of skills and experiences any organization always try to find a way to retain their employees to reduce training and recruiting cost as well as to achieve their business goal smoothly. Machine learning approaches, which predict the possibility of attrition based on the employee attributes avoid the tedious, and biased manual prediction, and help the organization take preventive measures. This paper presents a framework for attrition prediction that emphasizes imbalance classification and the adoption of genetic algorithms to optimize the model. First, we have adopted different oversampling methods like Synthetic Minority Over-sampling Technique (SMOTE), Adaptive Synthetic (ADASYN), and Borderline Synthetic Minority Over-sampling Technique to balance our data set. We have used XGBoost classifiers for classification with the data that are obtained from different over-sampling techniques. As the XGBoost classifier has many hyperparameter a genetic algorithm is used to optimize our model where the accuracy is chosen as the fitness function. The comparative performance analysis of different over-sampling methods as well as hyper-parameter tuning (Amongst Genetic algorithm, GridSearchCV, and with the default value of different hyper-parameter) on the real dataset suggests that SMOTE for oversampling techniques and genetic algorithm for optimization attains improved performance.
引用
收藏
页数:6
相关论文
共 23 条
[1]   Employee Attrition Prediction Using Deep Neural Networks [J].
Al-Darraji, Salah ;
Honi, Dhafer G. ;
Fallucchi, Francesca ;
Abdulsada, Ayad, I ;
Giuliano, Romeo ;
Abdulmalik, Husam A. .
COMPUTERS, 2021, 10 (11)
[2]  
Alduayj SS, 2018, IEEE INT CONF INNOV, P93, DOI 10.1109/INNOVATIONS.2018.8605976
[3]   Automated Prediction of Employee Attrition Using Ensemble Model Based on Machine Learning Algorithms [J].
Alsheref, Fahad Kamal ;
Fattoh, Ibrahim Eldesouky ;
Ead, Waleed M. .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
[4]   From Big Data to Deep Data to Support People Analytics for Employee Attrition Prediction [J].
Ben Yahia, Nesrine ;
Hlel, Jihen ;
Colomo-Palacios, Ricardo .
IEEE ACCESS, 2021, 9 (09) :60447-60458
[5]   Machine Learning Approach to Predicting Attrition Among Employees at Work [J].
Bhatta, Sudipta ;
Zaman, Isfaf Uz ;
Raisa, Nuzhat ;
Fahim, Shazzadul Islam ;
Momen, Sifat .
ARTIFICIAL INTELLIGENCE TRENDS IN SYSTEMS, VOL 2, 2022, 502 :285-294
[6]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[7]   Identifying factors for employee retention using computational techniques: an approach to assist the decision-making process [J].
Halim, Zahid ;
Maria ;
Waqas, Muhammad ;
Edwin, Cedric A. ;
Shah, Ahsan .
SN APPLIED SCIENCES, 2020, 2 (09)
[8]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328
[9]  
ibm, ABOUT US
[10]   Explaining and predicting employees' attrition: a machine learning approach [J].
Jain, Praphula Kumar ;
Jain, Madhur ;
Pamula, Rajendra .
SN APPLIED SCIENCES, 2020, 2 (04)