Facilitating Feasibility Analysis: The Pilot Defects Prediction Dataset Maker

被引:4
作者
Falessi, Davide [1 ]
Moede, Max Jason [1 ]
机构
[1] Calif Polytech State Univ San Luis Obispo, San Luis Obispo, CA 93407 USA
来源
PROCEEDINGS OF THE 4TH ACM SIGSOFT INTERNATIONAL WORKSHOP ON SOFTWARE ANALYTICS (SWAN'18) | 2018年
关键词
Defects prediction; EMPIRICAL SOFTWARE;
D O I
10.1145/3278142.3278147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Our industrial experience in institutionalizing defect prediction models in the software industry shows that the first step is to measure prediction metrics and defects to assess the feasibility of the tool, i.e., if the accuracy of the defect prediction tool is higher than of a random predictor. However, computing prediction metrics is time consuming and error prone. Thus, the feasibility analysis has a cost which needs some initial investment by the potential clients. This initial investment acts as a barrier for convincing potential clients of the benefits of institutionalizing a software prediction model. To reduce this barrier, in this paper we present the Pilot Defects Prediction Dataset Maker (PDPDM), a desktop application for measuring metrics to use for defect prediction. PDPDM receives as input the repository's information of a software project, and it provides as output, in an easy and replicable way, a dataset containing a set of 17 well-defined product and process metrics, that have been shown to be useful for defect prediction, such as size and smells. PDPDM avoids the use of outdated datasets and it allows researchers and practitioners to create defect datasets without the need to write any lines of code.
引用
收藏
页码:15 / 18
页数:4
相关论文
共 30 条
[1]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[2]   Evaluating defect prediction approaches: a benchmark and an extensive comparison [J].
D'Ambros, Marco ;
Lanza, Michele ;
Robbes, Romain .
EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (4-5) :531-577
[3]  
Diep M., 2015, ART SCI ANAL SOFTWAR, P327
[4]   Boa: Ultra-Large-Scale Software Repository and Source-Code Mining [J].
Dyer, Robert ;
Hoan Anh Nguyen ;
Rajan, Hridesh ;
Nguyen, Tien N. .
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2015, 25 (01)
[5]  
Edelman A, 2008, LECT NOTES COMPUT SC, V5030, P347, DOI 10.1007/978-3-540-68073-4_37
[6]  
Falessi D., 2017, ACM IEEE INT S EMP S
[7]  
Falessi D., 2015, 2015 IEEE 7 INT WORK
[8]  
Falessi D., IEEE 7 INT WORKSH MA, p215AD
[9]  
Falessi D., 2017, ACM IEEE C EMP SOFTW
[10]  
Fowler Martin., 1999, XTEMP01, P1