Assessing Practitioner Beliefs about Software Defect Prediction

被引:17
作者
Shrikanth, N. C. [1 ]
Menzies, Tim [1 ]
机构
[1] North Carolina State Univ, Raleigh, NC 27695 USA
来源
2020 IEEE/ACM 42ND INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING: SOFTWARE ENGINEERING IN PRACTICE (ICSE-SEIP) | 2020年
基金
美国国家科学基金会;
关键词
defects; beliefs; practitioner; empirical software engineering;
D O I
10.1145/3377813.3381367
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Just because software developers say they believe in "X", that does not necessarily mean that "X" is true. As shown here, there exist numerous beliefs listed in the recent Software Engineering literature which are only supported by small portions of the available data. Hence we ask what is the source of this disconnect between beliefs and evidence?. To answer this question we look for evidence for ten beliefs within 300,000+ changes seen in dozens of open-source projects. Some of those beliefs had strong support across all the projects; specifically, "A commit that involves more added and removed lines is more bug-prone" and "Files with fewer lines contributed by their owners (who contribute most changes) are bug-prone". Most of the widely-held beliefs studied are only sporadically supported in the data; i.e. large effects can appear in project data and then disappear in subsequent releases. Such sporadic support explains why developers believe things that were relevant to their prior work, but not necessarily their current work. Our conclusion will be that we need to change the nature of the debate with Software Engineering. Specifically, while it is important to report the effects that hold right now, it is also important to report on what effects change over time.
引用
收藏
页码:182 / 190
页数:9
相关论文
共 37 条
[21]   Ranking and Clustering Software Cost Estimation Models through a Multiple Comparisons Algorithm [J].
Mittas, Nikolaos ;
Angelis, Lefteris .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2013, 39 (04) :537-551
[22]   Curating GitHub for engineered software projects [J].
Munaiah, Nuthan ;
Kroh, Steven ;
Cabrey, Craig ;
Nagappan, Meiyappan .
EMPIRICAL SOFTWARE ENGINEERING, 2017, 22 (06) :3219-3253
[23]   An Empirical Study of Goto in C Code from GitHub Repositories [J].
Nagappan, Meiyappan ;
Robbes, Romain ;
Kamei, Yasutaka ;
Tanter, Eric ;
McIntosh, Shane ;
Mockus, Audris ;
Hassan, Ahmed E. .
2015 10TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (ESEC/FSE 2015) PROCEEDINGS, 2015, :404-414
[24]   Use of relative code churn measures to predict system defect density [J].
Nagappan, N ;
Ball, T .
ICSE 05: 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2005, :284-292
[25]  
Passos C., 2011, ESEM'11
[26]  
Petersen K, 2009, INT SYMP EMP SOFTWAR, P402
[27]   A Large Scale Study of Programming Languages and Code Quality in Github [J].
Ray, Baishakhi ;
Posnett, Daryl ;
Filkov, Vladimir ;
Devanbu, Premkumar .
22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, :155-165
[28]  
Sliwerski J., 2005, MSR 05, V30, P1, DOI [DOI 10.1145/1083142.1083147, 10.1145/1083142.1083147]
[29]  
Thong Hoang, 2019, 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), P34, DOI 10.1109/MSR.2019.00016
[30]   A critique and improvement of the CL common language effect size statistics of McGraw and Wong [J].
Vargha, A ;
Delaney, HD .
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2000, 25 (02) :101-132