An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models

被引:90
作者
Jiarpakdee, Jirayus [1 ]
Tantithamthavorn, Chakkrit [1 ]
Dam, Hoa Khanh [2 ]
Grundy, John [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[2] Univ Wollongong, Sch Comp & Informat Technol, Fac Engn & Informat Sci, Wollongong, NSW W2522, Australia
基金
澳大利亚研究理事会;
关键词
Explainable software analytics; software quality assurance; defect prediction models; model-agnostic techniques; PRONE SOFTWARE MODULES; GLOBAL OPTIMIZATION;
D O I
10.1109/TSE.2020.2982385
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software analytics have empowered software organisations to support a wide range of improved decision-making and policy-making. However, such predictions made by software analytics to date have not been explained and justified. Specifically, current defect prediction models still fail to explain why models make such a prediction and fail to uphold the privacy laws in terms of the requirement to explain any decision made by an algorithm. In this paper, we empirically evaluate three model-agnostic techniques, i.e., two state-of-the-art Local Interpretability Model-agnostic Explanations technique (LIME) and BreakDown techniques, and our improvement of LIME with Hyper Parameter Optimisation (LIME-HPO). Through a case study of 32 highly-curated defect datasets that span across 9 open-source software systems, we conclude that (1) model-agnostic techniques are needed to explain individual predictions of defect models; (2) instance explanations generated by model-agnostic techniques are mostly overlapping (but not exactly the same) with the global explanation of defect models and reliable when they are re-generated; (3) model-agnostic techniques take less than a minute to generate instance explanations; and (4) more than half of the practitioners perceive that the contrastive explanations are necessary and useful to understand the predictions of defect models. Since the implementation of the studied model-agnostic techniques is available in both Python and R, we recommend model-agnostic techniques be used in the future.
引用
收藏
页码:166 / 185
页数:20
相关论文
共 111 条
[71]  
Nelson E., 1967, MANAGEMENT HDB ESTIM
[72]   Software defect prediction using Bayesian networks [J].
Okutan, Ahmet ;
Yildiz, Olcay Taner .
EMPIRICAL SOFTWARE ENGINEERING, 2014, 19 (01) :154-181
[73]   Fine-grained just-in-time defect prediction [J].
Pascarella, Luca ;
Palomba, Fabio ;
Bacchelli, Alberto .
JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 150 :22-36
[74]  
Pedersen T. L., 2019, LIME LOCAL INTERPRET
[75]  
Petkovic D., 2016, 2016 IEEE FRONTIERS, P1
[76]   SIMPLIFYING DECISION TREES [J].
QUINLAN, JR .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1987, 27 (03) :221-234
[77]  
R. C. Team and contributors worldwide, STATS R STATS PACK R
[78]  
Rahman F, 2013, PROCEEDINGS OF THE 35TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2013), P432, DOI 10.1109/ICSE.2013.6606589
[79]  
Rahman F, 2011, 2011 33RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), P491, DOI 10.1145/1985793.1985860
[80]   The Impact of Using Regression Models to Build Defect Classifiers [J].
Rajbahadur, Gopi Krishnan ;
Wang, Shaowei ;
Kamei, Yasutaka ;
Hassan, Ahmed E. .
2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, :135-145