Software defects prediction by metaheuristics tuned extreme gradient boosting and analysis based on Shapley Additive Explanations

被引:27
|
作者
Zivkovic, Tamara [1 ]
Nikolic, Bosko [1 ]
Simic, Vladimir [2 ,3 ]
Pamucar, Dragan [4 ]
Bacanin, Nebojsa [5 ]
机构
[1] Univ Belgrade, Sch Elect Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
[2] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11000, Serbia
[3] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Yuandong Rd, Taoyuan City 320315, Taiwan
[4] Univ Belgrade, Fac Org Sci, Dept Operat Res & Stat, Jove Ilica 154, Belgrade 11000, Serbia
[5] Singidunum Univ, Fac Informat & Comp, Danijelova 32, Belgrade 11000, Serbia
关键词
Software testing; Software defect prediction; XGBoost; Reptile search algorithm; Metaheuristics optimization;
D O I
10.1016/j.asoc.2023.110659
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software testing represents a crucial component of software development, and it is usually making the difference between successful and failed projects. Although it is extremely important, due to the fast pace and short deadlines of contemporary projects it is often neglected or not detailed enough due to the lack of available time, leading to the potential loss of reputation, private users' data, money, and even lives in some circumstances. In such situations, it would be vital to have the option to predict what modules are error-prone according to the collection of software metrics, and to focus testing on them, and that task is a typical classification task. Machine learning models have been frequently employed within a wide range of classification problems with significant success, and this paper proposes eXtreme gradient boosting (XGBoost) model to execute the defect prediction task. A modified variant of the well-known reptile search optimization algorithm has been suggested to carry out the calibrating of the XGBoost hyperparameters. The enhanced algorithm was named HARSA and evaluated on the collection of challenging CEC2019 benchmark functions, where it exhibited excellent performance. Later, the introduced XGBoost model that uses the proposed algorithm has been evaluated on two benchmark software testing datasets, and the simulation outcomes have been compared to other powerful swarm intelligence metaheuristics that were used in the identical experimental environment, where the proposed approach attained superior classification accuracy on both datasets. Finally, Shapley Additive Explanations analysis was conducted to discover the impact of various software metrics on the classification results.<br />& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:32
相关论文
共 50 条
  • [31] Machine learning-based heat deflection temperature prediction and effect analysis in polypropylene composites using catboost and shapley additive explanations
    Joo, Chonghyo
    Park, Hyundo
    Lim, Jongkoo
    Cho, Hyungtae
    Kim, Junghwan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [32] A Satellite Fault Diagnosis and Analysis Method based on Extreme Gradient Boosting
    Liu, Xiaopeng
    Wang, Yuechuan
    Chai, Senchun
    Li, Zhaoyang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6694 - 6699
  • [33] Improvement of orbit prediction accuracy using extreme gradient boosting and principal component analysis
    Zhai, Min
    Huyan, Zongbo
    Hu, Yuanyuan
    Jiang, Yu
    Li, Hengnian
    OPEN ASTRONOMY, 2022, 31 (01) : 229 - 243
  • [34] BooDet: Gradient Boosting Object Detection With Additive Learning-Based Prediction Aggregation
    Li, Ya-Li
    Wang, Shengjin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2620 - 2632
  • [35] Anode effect prediction based on a singular value thresholding and extreme gradient boosting approach
    Zhou, Kai-Bo
    Zhang, Zhi-Xin
    Liu, Jie
    Hu, Zhong-Xu
    Duan, Xiao-Kun
    Xu, Qi
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2019, 30 (01)
  • [36] Grid-based Urban Fire Prediction Using Extreme Gradient Boosting (XGBoost)
    Oh, Haeng Yeol
    Jeong, Meong-Hun
    SENSORS AND MATERIALS, 2022, 34 (12) : 4879 - 4890
  • [37] Business Failure Prediction Based on a Cost-Sensitive Extreme Gradient Boosting Machine
    Zou, Yao
    Gao, Changchun
    Gao, Han
    IEEE ACCESS, 2022, 10 : 42623 - 42639
  • [38] Prediction of voltage stability margin in power system based on extreme gradient boosting algorithm
    Wang H.-F.
    Zhang C.-Y.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2020, 54 (03): : 606 - 613
  • [39] Multiclassification Prediction of Clay Sensitivity Using Extreme Gradient Boosting Based on Imbalanced Dataset
    Ma, Tao
    Wu, Lizhou
    Zhu, Shuairun
    Zhu, Hongzhou
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [40] Clinical Prediction of Heart Failure in Hemodialysis Patients: Based on the Extreme Gradient Boosting Method
    Wang, Yanfeng
    Miao, Xisha
    Xiao, Gang
    Huang, Chun
    Sun, Junwei
    Wang, Ying
    Li, Panlong
    You, Xu
    FRONTIERS IN GENETICS, 2022, 13