An Empirical Study of Model-Agnostic Techniques for Defect Prediction Models

被引:90
作者
Jiarpakdee, Jirayus [1 ]
Tantithamthavorn, Chakkrit [1 ]
Dam, Hoa Khanh [2 ]
Grundy, John [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Clayton, Vic 3800, Australia
[2] Univ Wollongong, Sch Comp & Informat Technol, Fac Engn & Informat Sci, Wollongong, NSW W2522, Australia
基金
澳大利亚研究理事会;
关键词
Explainable software analytics; software quality assurance; defect prediction models; model-agnostic techniques; PRONE SOFTWARE MODULES; GLOBAL OPTIMIZATION;
D O I
10.1109/TSE.2020.2982385
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software analytics have empowered software organisations to support a wide range of improved decision-making and policy-making. However, such predictions made by software analytics to date have not been explained and justified. Specifically, current defect prediction models still fail to explain why models make such a prediction and fail to uphold the privacy laws in terms of the requirement to explain any decision made by an algorithm. In this paper, we empirically evaluate three model-agnostic techniques, i.e., two state-of-the-art Local Interpretability Model-agnostic Explanations technique (LIME) and BreakDown techniques, and our improvement of LIME with Hyper Parameter Optimisation (LIME-HPO). Through a case study of 32 highly-curated defect datasets that span across 9 open-source software systems, we conclude that (1) model-agnostic techniques are needed to explain individual predictions of defect models; (2) instance explanations generated by model-agnostic techniques are mostly overlapping (but not exactly the same) with the global explanation of defect models and reliable when they are re-generated; (3) model-agnostic techniques take less than a minute to generate instance explanations; and (4) more than half of the practitioners perceive that the contrastive explanations are necessary and useful to understand the predictions of defect models. Since the implementation of the studied model-agnostic techniques is available in both Python and R, we recommend model-agnostic techniques be used in the future.
引用
收藏
页码:166 / 185
页数:20
相关论文
共 111 条
[61]   Data mining static code attributes to learn defect predictors [J].
Menzies, Tim ;
Greenwald, Jeremy ;
Frank, Art .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (01) :2-13
[62]   Software Analytics: So What? [J].
Menzies, Tim ;
Zimmermann, Thomas .
IEEE SOFTWARE, 2013, 30 (04) :31-37
[63]   Local versus Global Lessons for Defect Prediction and Effort Estimation [J].
Menzies, Tim ;
Butcher, Andrew ;
Cok, David ;
Marcus, Andrian ;
Layman, Lucas ;
Shull, Forrest ;
Turhan, Burak ;
Zimmermann, Thomas .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (06) :822-834
[64]   Special issue on repeatable results in software engineering prediction [J].
Menzies, Tim ;
Shepperd, Martin .
EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (1-2) :1-17
[65]   Agile software development: Human values and culture [J].
Miller, KW ;
Larson, DK .
IEEE TECHNOLOGY AND SOCIETY MAGAZINE, 2005, 24 (04) :36-42
[66]   Explanation in artificial intelligence: Insights from the social sciences [J].
Miller, Tim .
ARTIFICIAL INTELLIGENCE, 2019, 267 :1-38
[67]  
Mullen KM, 2011, J STAT SOFTW, V40, P1
[68]  
Nagappan N., 2006, 28th International Conference on Software Engineering Proceedings, P452, DOI 10.1145/1134285.1134349
[69]   Heterogeneous Defect Prediction [J].
Nam, Jaechang ;
Fu, Wei ;
Kim, Sunghun ;
Menzies, Tim ;
Tan, Lin .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (09) :874-896
[70]   Gradient boosting machines, a tutorial [J].
Natekin, Alexey ;
Knoll, Alois .
FRONTIERS IN NEUROROBOTICS, 2013, 7