Predicting defect-prone software modules using support vector machines

被引:305
作者
Elish, Karim O. [1 ]
Elish, Mahmoud O. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Dept Informat & Comp Sci, Dhahran 31261, Saudi Arabia
关键词
software metrics; defect-prone modules; support vector machines; predictive models;
D O I
10.1016/j.jss.2007.07.040
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Effective prediction of defect-prone software modules can enable software developers to focus quality assurance activities and allocate effort and resources more efficiently. Support vector machines (SVM) have been successfully applied for solving both classification and regression problems in many applications. This paper evaluates the capability of SVM in predicting defect-prone software modules and compares its prediction performance against eight statistical and machine learning models in the context of four NASA datasets. The results indicate that the prediction performance of SVM is generally better than, or at least, is competitive against the compared models. (C) 2007 Elsevier Inc. All rights reserved.
引用
收藏
页码:649 / 660
页数:12
相关论文
共 53 条
[1]  
Abe S., 2005, ADV PTRN RECOGNIT, P343
[2]  
[Anonymous], THESIS U YORK
[3]   Combining and adapting software quality predictive models by genetic algorithms [J].
Azar, D ;
Precup, D ;
Bouktif, S ;
Kégl, B ;
Sahraoui, H .
ASE 2002: 17TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, 2002, :285-288
[4]   Identifying genes related to drug anticancer mechanisms using support vector machine [J].
Bao, L ;
Sun, ZR .
FEBS LETTERS, 2002, 521 (1-3) :109-114
[5]   A validation of object-oriented design metrics as quality indicators [J].
Basili, VR ;
Briand, LC ;
Melo, WL .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1996, 22 (10) :751-761
[6]   THE TAME PROJECT - TOWARDS IMPROVEMENT-ORIENTED SOFTWARE ENVIRONMENTS [J].
BASILI, VR ;
ROMBACH, HD .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1988, 14 (06) :758-773
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   DEVELOPING INTERPRETABLE MODELS WITH OPTIMIZED SET REDUCTION FOR IDENTIFYING HIGH-RISK SOFTWARE COMPONENTS [J].
BRIAND, LC ;
BASILI, VR ;
HETMANSKI, CJ .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1993, 19 (11) :1028-1044
[9]   Drug design by machine learning: support vector machines for pharmaceutical data analysis [J].
Burbidge, R ;
Trotter, M ;
Buxton, B ;
Holden, S .
COMPUTERS & CHEMISTRY, 2001, 26 (01) :5-14
[10]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167