Deep Feature Learning to Quantitative Prediction of Software Defects

被引:6
作者
Qiao, Lei [1 ]
Li, Guangjie [2 ]
Yu, Daohua [1 ]
Liu, Hui [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
[2] Natl Innovat Inst Def Technol, Management Dept Collaborat Innovat, Beijing, Peoples R China
来源
2021 IEEE 45TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2021) | 2021年
关键词
Software defect prediction; deep feature learning; convolutional neural network; software metrics; number of defects; regression model;
D O I
10.1109/COMPSAC51774.2021.00204
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Defect prediction forecasts defect proneness or the number of defects contained in software systems. It is frequently employed to efficiently prioritize and allocate the limited testing resources to the modules that are more likely to be defective during the process of software development and maintenance. Consequently, a number of defect prediction approaches have been proposed. Most of the existing approaches on defect prediction regard defect prediction as a classification problem in which programs are classified as buggy or non-buggy. However, identifying the defect proneness of a given software module is not sufficient in practical software testing. The research on predicting the number of defects is limited and the performances of these approaches are constantly being optimized and improved. Therefore, in this paper, we propose a novel approach that leverages a convolutional neural network to predict the number of defects in software systems automatically. First, we preprocess the PROMISE dataset, which involves performing natural logarithm transformation and data normalization. Second, we feed the preprocessed dataset to a specially designed convolutional neural network-based model to predict the number of defects. Third, we rank the software modules according to the corresponding predicted number of defects in descending order. We also evaluate the proposed approach on a well-known dataset by cross-validation. The evaluation results suggest that the proposed approach is both accurate and robust, and it improves the state of the art. On average, it significantly improves the Kendall correlation coefficient by 16% and the fault-percentile-average by 4%.
引用
收藏
页码:1401 / 1402
页数:2
相关论文
共 7 条
[1]  
Chen Mingming., 2015, SEKE, P397
[2]   Software defect number prediction: Unsupervised vs supervised methods [J].
Chen, Xiang ;
Zhang, Dun ;
Zhao, Yingquan ;
Cui, Zhanqi ;
Ni, Chao .
INFORMATION AND SOFTWARE TECHNOLOGY, 2019, 106 :161-181
[3]   A critique of software defect prediction models [J].
Fenton, NE ;
Neil, M .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (05) :675-689
[4]   A new measure of rank correlation [J].
Kendall, MG .
BIOMETRIKA, 1938, 30 :81-93
[5]   An empirical study of some software fault prediction techniques for the number of faults prediction [J].
Rathore, Santosh S. ;
Kumar, Sandeep .
SOFT COMPUTING, 2017, 21 (24) :7417-7434
[6]   The Impact of Automated Parameter Optimization on Defect Prediction Models [J].
Tantithamthavorn, Chakkrit ;
McIntosh, Shane ;
Hassan, Ahmed E. ;
Matsumoto, Kenichi .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2019, 45 (07) :683-711
[7]   Comparing the effectiveness of several modeling methods for fault prediction [J].
Weyuker, Elaine J. ;
Ostrand, Thomas J. ;
Bell, Robert M. .
EMPIRICAL SOFTWARE ENGINEERING, 2010, 15 (03) :277-295