Predicting Software Defects with Explainable Machine Learning

被引:10
|
作者
Santos, Geanderson [1 ]
Figueiredo, Eduardo [1 ]
Veloso, Adriano [1 ]
Viggiato, Markos [2 ]
Ziviani, Nivio [1 ]
机构
[1] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
[2] Univ Alberta, Edmonton, AB, Canada
来源
PROCEEDINGS OF THE 19TH BRAZILIAN SYMPOSIUM ON SOFTWARE QUALITY, SBOS 2020 | 2020年
关键词
software defects; explainable models; NASA datasets; SHAP values;
D O I
10.1145/3439961.3439979
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most software systems must evolve to cope with stakeholders' requirements and fix existing defects. Hence, software defect prediction represents an area of interest in both academia and the software industry. As a result, predicting software defects can help the development team to maintain substantial levels of software quality. For this reason, machine learning models have increased in popularity for software defect prediction and have demonstrated effectiveness in many scenarios. In this paper, we evaluate a machine learning approach for selecting features to predict software module defects. We use a tree boosting algorithm that receives as input a training set comprising records of software features encoding characteristics of each module and outputs whether the corresponding module is defective prone. For nine projects within the widely known NASA data program, we build prediction models from a set of easy-to-compute module features. We then sample this sizable model space by randomly selecting software features to compose each model. This significant number of models allows us to structure our work along model understandability and predictive accuracy. We argue that explaining model predictions is meaningful to provide information to developers on features related to each module defective-prone. We show that (i) features that contribute most to finding the best models may vary depending on the project, and (ii) effective models are highly understandable based on a survey with 40 developers.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Explainable Software Defects Classification Using SMOTE and Machine Learning
    Jude A.
    Uddin J.
    Annals of Emerging Technologies in Computing, 2024, 8 (01) : 35 - 49
  • [2] Software Defects Identification: Results Using Machine Learning and Explainable Artificial Intelligence Techniques
    Begum, Momotaz
    Shuvo, Mehedi Hasan
    Ashraf, Imran
    Al Mamun, Abdullah
    Uddin, Jia
    Samad, Md Abdus
    IEEE ACCESS, 2023, 11 : 132750 - 132765
  • [3] Predicting Dengue Outbreaks with Explainable Machine Learning
    Aleixo, Robson
    Kon, Fabio
    Rocha, Rudi
    Camargo, Marcela Santos
    de Camargo, Raphael Y.
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 940 - 947
  • [4] Predicting kidney allograft survival with explainable machine learning
    Fabreti-Oliveira, Raquel A.
    Nascimento, Evaldo
    de Melo Santosa, Luiz Henrique
    de Oliveira Santos, Marina Ribeiro
    Veloso, Adriano Alonso
    TRANSPLANT IMMUNOLOGY, 2024, 85
  • [5] Predicting Software Defects in Hybrid MPI and OpenMP Parallel Programs Using Machine Learning
    Althiban, Amani S.
    Alharbi, Hajar M.
    Al Khuzayem, Lama A.
    Eassa, Fathy Elbouraey
    ELECTRONICS, 2024, 13 (01)
  • [6] Explainable machine learning models for defects detection in industrial processes
    Oliveira, Rodrigo Marcel Araujo
    Sant'Anna, Angelo Marcio Oliveira
    da Silva, Paulo Henrique Ferreira
    COMPUTERS & INDUSTRIAL ENGINEERING, 2024, 192
  • [7] Explainable machine learning for predicting the mechanical properties in bainitic steels
    Ackermann, Marc
    Iren, Deniz
    Yao, Yao
    MATERIALS & DESIGN, 2023, 230
  • [8] Predicting Hospital Stay Length Using Explainable Machine Learning
    Alsinglawi, Belal S.
    Alnajjar, Fady
    Alorjani, Mohammed S.
    Al-Shari, Osama Mohammed
    Munoz, Mauricio Novoa
    Mubin, Omar
    IEEE ACCESS, 2024, 12 : 90571 - 90585
  • [9] Explainable Machine Learning Algorithms For Predicting Glass Transition Temperatures
    Alcobaca, Edesio
    Mastelini, Saulo Martiello
    Botari, Tiago
    Pimentel, Bruno Almeida
    Cassar, Daniel Roberto
    de Leon Ferreira de Carvalho, Andre Carlos Ponce
    Zanotto, Edgar Dutra
    ACTA MATERIALIA, 2020, 188 : 92 - 100
  • [10] Explainable machine learning for predicting lung metastasis of colorectal cancer
    Zhentian Guo
    Zongming Zhang
    Limin Liu
    Yue Zhao
    Zhuo Liu
    Chong Zhang
    Hui Qi
    Jinqiu Feng
    Peijie Yao
    Scientific Reports, 15 (1)