Software defect prediction: Heuristics for weighted Naive Bayes

被引:0
作者
Turhan, Burak [1 ]
Bener, Ayse [1 ]
机构
[1] Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
来源
ICSOFT 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL SE: SOFTWARE ENGINEERING | 2007年
关键词
empirical software engineering; software quality; defect prediction; software metrics; Naive Bayes; feature weighting;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Defect prediction is an important topic in software quality research. Statistical models for defect prediction can be built on project repositories. Project repositories store software metrics and defect information. This information is then matched with software modules. Naive Bayes is a well known, simple statistical technique that assumes the 'independence' and 'equal importance' of features, which are not true in many problems. However, Naive Bayes achieves high performances on a wide spectrum of prediction problems. This paper addresses the 'equal importance' of features assumption of Naive Bayes. We propose that by means of heuristics we can assign weights to features according to their importance and improve defect prediction performance. We compare the weighted Naive Bayes and the standard Naive Bayes predictors' performances on publicly available datasets. Our experimental results indicate that assigning weights to software metrics increases the prediction performance significantly.
引用
收藏
页码:244 / 249
页数:6
相关论文
共 26 条
[1]  
ALPAYDIN E, 2004, INTO MACHINE LEARNIN
[2]  
[Anonymous], P C UNC ART INT
[3]  
[Anonymous], 2004, P WORKSH PRED SOFTW
[4]   Optimal project feature weights in analogy-based cost estimation: Improvement and limitations [J].
Auer, M ;
Trendowicz, A ;
Graser, B ;
Haunschmid, E ;
Biffl, S .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2006, 32 (02) :83-92
[5]   A validation of object-oriented design metrics as quality indicators [J].
Basili, VR ;
Briand, LC ;
Melo, WL .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (10) :751-761
[6]   On the optimality of the simple Bayesian classifier under zero-one loss [J].
Domingos, P ;
Pazzani, M .
MACHINE LEARNING, 1997, 29 (2-3) :103-130
[7]   Quantitative analysis of faults and failures in a complex software system [J].
Fenton, NE ;
Ohlsson, N .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2000, 26 (08) :797-814
[8]   A critique of software defect prediction models [J].
Fenton, NE ;
Neil, M .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (05) :675-689
[9]   A decision tree-based attribute weighting filter for naive Bayes [J].
Hall, Mark .
KNOWLEDGE-BASED SYSTEMS, 2007, 20 (02) :120-126
[10]  
Harrold Mary Jean, 2000, P C FUT SOFTW ENG, P61