Software fault prediction: A literature review and current trends

被引:188
作者
Catal, Cagatay [1 ]
机构
[1] Sci & Technol Res Council Turkey TUBITAK, Marmara Res Ctr, Inst Informat Technol, Kocaeli, Turkey
关键词
Machine learning; Automated fault prediction models; Expert systems; Software quality engineering; Software engineering; Statistical methods; STATIC CODE ATTRIBUTES; EMPIRICAL VALIDATION; QUALITY ESTIMATION; PRONE CLASSES; DEFECT; METRICS; MODELS;
D O I
10.1016/j.eswa.2010.10.024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software engineering discipline contains several prediction approaches such as test effort prediction, correction cost prediction, fault prediction, reusability prediction, security prediction, effort prediction, and quality prediction. However, most of these prediction approaches are still in preliminary phase and more research should be conducted to reach robust models. Software fault prediction is the most popular research area in these prediction approaches and recently several research centers started new projects on this area. In this study, we investigated 90 software fault prediction papers published between year 1990 and year 2009 and then we categorized these papers according to the publication year. This paper surveys the software engineering literature on software fault prediction and both machine learning based and statistical based approaches are included in this survey. Papers explained in this article reflect the outline of what was published so far, but naturally this is not a complete review of all the papers published so far. This paper will help researchers to investigate the previous studies from metrics, methods, datasets, performance evaluation metrics, and experimental results perspectives in an easy and effective manner. Furthermore, current trends are introduced and discussed. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4626 / 4636
页数:11
相关论文
共 89 条
[1]  
Alan O., 2009, P ISCIS 2009 C GUZ 1
[2]  
Amaral de Almeida M., 1999, Foundations of Intelligent Systems. 11th International Symposium, ISMIS'99. Proceedings, P565
[3]   A systematic and comprehensive investigation of methods to build and evaluate fault prediction models [J].
Arisholm, Erik ;
Briand, Lionel C. ;
Johannessen, Eivind B. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (01) :2-17
[4]   Regression via Classification applied on software defect estimation [J].
Bibi, S. ;
Tsoumakas, G. ;
Stamelos, I. ;
Vlahavas, I. .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) :2091-2101
[5]   Prediction of run-time failures using static product quality metrics [J].
Binkley, AB ;
Schach, SR .
SOFTWARE QUALITY JOURNAL, 1998, 7 (02) :141-147
[6]  
Binkley D., 2007, SOFTW FAULT PRED US
[7]  
Boetticher G., 2006, SERIES SOFTWARE ENG
[8]   DEVELOPING INTERPRETABLE MODELS WITH OPTIMIZED SET REDUCTION FOR IDENTIFYING HIGH-RISK SOFTWARE COMPONENTS [J].
BRIAND, LC ;
BASILI, VR ;
HETMANSKI, CJ .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) :1028-1044
[9]  
Catal C., 2009, P WORLD C ENG WCE 20, V1
[10]   Unlabelled extra data do not always mean extra performance for semi-supervised fault prediction [J].
Catal, Cagatay ;
Diri, Banu .
EXPERT SYSTEMS, 2009, 26 (05) :458-471