The Impact of Dormant Defects on Defect Prediction: A Study of 19 Apache Projects

被引：13

作者：

Falessi, Davide ^{[1
]}

Ahluwalia, Aalok ^{[2
]}

Di Penta, Massimiliano ^{[3
]}

机构：

[1] Univ Roma Tor Vergata, Rome, Italy

[2] Calif Polytech State Univ San Luis Obispo, San Luis Obispo, CA 93407 USA

[3] Univ Sannio, Benevento, Italy

来源：

ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY | 2022年 / 31卷 / 01期

关键词：

Defect prediction; fix-inducing changes; dataset bias; SOFTWARE; METRICS; FAULTS; VALIDATION; QUALITY;

D O I：

10.1145/3467895

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Defect prediction models can be beneficial to prioritize testing, analysis, or code review activities, and has been the subject of a substantial effort in academia, and some applications in industrial contexts. A necessary precondition when creating a defect prediction model is the availability of defect data from the history of projects. If this data is noisy, the resulting defect prediction model could result to be unreliable. One of the causes of noise for defect datasets is the presence of "dormant defects," i.e., of defects discovered several releases after their introduction. This can cause a class to be labeled as defect-free while it is not, and is, therefore "snoring." In this article, we investigate the impact of snoring on classifiers' accuracy and the effectiveness of a possible countermeasure, i.e., dropping too recent data from a training set. We analyze the accuracy of 15 machine learning defect prediction classifiers, on data from more than 4,000 defects and 600 releases of 19 open source projects from the Apache ecosystem. Our results show that on average across projects (i) the presence of dormant defects decreases the recall of defect prediction classifiers, and (ii) removing from the training set the classes that in the last release are labeled as not defective significantly improves the accuracy of the classifiers. In summary, this article provides insights on how to create defects datasets by mitigating the negative effect of dormant defects on defect prediction.

引用

页数：26

共 78 条

[1] Is "Better Data" Better Than "Better Data Miners"? On the Benefits of Tuning SMOTE for Defect Prediction [J].

Agrawal, Amritanshu ;

Menzies, Tim .

PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, :1050-1061

[2] INSTANCE-BASED LEARNING ALGORITHMS [J].

AHA, DW ;

KIBLER, D ;

ALBERT, MK .

MACHINE LEARNING, 1991, 6 (01) :37-66

[3] Snoring: a Noise in Defect Prediction Datasets [J].

Ahluwalia, Aalok ;

Falessi, Davide ;

Di Penta, Massimiliano .

2019 IEEE/ACM 16TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2019), 2019, :63-67

[4]

[Anonymous], 2007, 3 INT WORKSHOP PREDI, DOI DOI 10.1109/PROMISE.2007.10

[5] A validation of object-oriented design metrics as quality indicators [J].

Basili, VR ;

Briand, LC ;

Melo, WL .

IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (10) :751-761

[6]

Bayley Sean, 2018, CORR ABS180107194

[7] Fair and Balanced? Bias in Bug-Fix Datasets [J].

Bird, Christian ;

Bachmann, Adrian ;

Aune, Eirik ;

Duffy, John ;

Bernstein, Abraham ;

Filkov, Vladimir ;

Devanbu, Premkumar .

7TH JOINT MEETING OF THE EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND THE ACM SIGSOFT SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2009, :121-130

[8] Random forests [J].

Breiman, L .

MACHINE LEARNING, 2001, 45 (01) :5-32

[9]

Neto EC, 2018, 2018 25TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION AND REENGINEERING (SANER 2018), P380, DOI 10.1109/SANER.2018.8330225

[10]

Chen T, 2014, PROCEEDINGS OF THE FIFTH INTERNATIONAL SYMPOSIUM ON TEST AUTOMATION & INSTRUMENTATION, VOLS 1 AND 2, P82

← 1 2 3 4 5 6 7 8 →