Software defects prediction by metaheuristics tuned extreme gradient boosting and analysis based on Shapley Additive Explanations

被引:27
|
作者
Zivkovic, Tamara [1 ]
Nikolic, Bosko [1 ]
Simic, Vladimir [2 ,3 ]
Pamucar, Dragan [4 ]
Bacanin, Nebojsa [5 ]
机构
[1] Univ Belgrade, Sch Elect Engn, Bulevar Kralja Aleksandra 73, Belgrade 11000, Serbia
[2] Univ Belgrade, Fac Transport & Traff Engn, Vojvode Stepe 305, Belgrade 11000, Serbia
[3] Yuan Ze Univ, Coll Engn, Dept Ind Engn & Management, Yuandong Rd, Taoyuan City 320315, Taiwan
[4] Univ Belgrade, Fac Org Sci, Dept Operat Res & Stat, Jove Ilica 154, Belgrade 11000, Serbia
[5] Singidunum Univ, Fac Informat & Comp, Danijelova 32, Belgrade 11000, Serbia
关键词
Software testing; Software defect prediction; XGBoost; Reptile search algorithm; Metaheuristics optimization;
D O I
10.1016/j.asoc.2023.110659
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software testing represents a crucial component of software development, and it is usually making the difference between successful and failed projects. Although it is extremely important, due to the fast pace and short deadlines of contemporary projects it is often neglected or not detailed enough due to the lack of available time, leading to the potential loss of reputation, private users' data, money, and even lives in some circumstances. In such situations, it would be vital to have the option to predict what modules are error-prone according to the collection of software metrics, and to focus testing on them, and that task is a typical classification task. Machine learning models have been frequently employed within a wide range of classification problems with significant success, and this paper proposes eXtreme gradient boosting (XGBoost) model to execute the defect prediction task. A modified variant of the well-known reptile search optimization algorithm has been suggested to carry out the calibrating of the XGBoost hyperparameters. The enhanced algorithm was named HARSA and evaluated on the collection of challenging CEC2019 benchmark functions, where it exhibited excellent performance. Later, the introduced XGBoost model that uses the proposed algorithm has been evaluated on two benchmark software testing datasets, and the simulation outcomes have been compared to other powerful swarm intelligence metaheuristics that were used in the identical experimental environment, where the proposed approach attained superior classification accuracy on both datasets. Finally, Shapley Additive Explanations analysis was conducted to discover the impact of various software metrics on the classification results.<br />& COPY; 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:32
相关论文
共 50 条
  • [1] Sensitivity analysis of slope stability based on eXtreme gradient boosting and SHapley Additive exPlanations: An exploratory study
    Lin, Hanjie
    Li, Li
    Qiang, Yue
    Zhang, Yi
    Liang, Siyu
    Xu, Xinlong
    Li, Hongjian
    Hu, Shengchao
    HELIYON, 2024, 10 (16)
  • [2] Using Shapley additive explanations to interpret extreme gradient boosting predictions of grassland degradation in Xilingol, China
    Batunacun
    Wieland, Ralf
    Lakes, Tobia
    Nendel, Claas
    GEOSCIENTIFIC MODEL DEVELOPMENT, 2021, 14 (03) : 1493 - 1510
  • [3] Gradient boosting and Shapley additive explanations for fraud detection in electricity distribution grids
    Santos, Ricardo N.
    Yamouni, Sami
    Albiero, Beatriz
    Vicente, Renato
    Silva, Juliano A.
    Souza, Tales F. B.
    Freitas Souza, Mario C. M.
    Lei, Zhili
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (09)
  • [4] Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations
    Zou, Ying
    Shi, Yan
    Sun, Fang
    Liu, Jihua
    Guo, Yu
    Zhang, Huanlei
    Lu, Xiudi
    Gong, Yan
    Xia, Shuang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 225
  • [5] Explainable Risk Assessment of Rockbolts' Failure in Underground Coal Mines Based on Categorical Gradient Boosting and SHapley Additive exPlanations (SHAP)
    Ibrahim, Bemah
    Ahenkorah, Isaac
    Ewusi, Anthony
    SUSTAINABILITY, 2022, 14 (19)
  • [6] Leading Predictors of COVID-19-Related Poor Mental Health in Adult Asian Indians: An Application of Extreme Gradient Boosting and Shapley Additive Explanations
    Ikram, Mohammad
    Shaikh, Nazneen Fatima
    Vishwanatha, Jamboor K.
    Sambamoorthi, Usha
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2023, 20 (01)
  • [7] Application of extreme gradient boosting and Shapley Additive explanations to predict temperature regimes inside forests from standard open-field meteorological data
    Ghafarian, Fatemeh
    Wieland, Ralf
    Luettschwager, Dietmar
    Nendel, Claas
    ENVIRONMENTAL MODELLING & SOFTWARE, 2022, 156
  • [8] Interpretable credit scoring based on an additive extreme gradient boosting
    Zou, Yao
    Xia, Meng
    Lan, Xingyu
    CHAOS SOLITONS & FRACTALS, 2025, 194
  • [9] Prediction of Cable Failures based on eXtreme Gradient Boosting
    Zhan, Huiyu
    Liu, Keyan
    Jia, Dongli
    2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 610 - 614
  • [10] An Extreme Gradient Boosting-based Prediction for Depression
    Ibrahum, Ahmed
    Park, Kwang Ho
    Hong, Jang-Eui
    Van-Huy Pham
    Ryu, Keun Ho
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1607 - 1613