With-in-project defect prediction using bootstrap aggregation based diverse ensemble learning technique

被引:18
作者
Bhutamapuram, Umamaheswara Sharma [1 ]
Sadam, Ravichandra [1 ]
机构
[1] Natl Inst Technol Warangal, Dept Comp Sci & Engn, Warangal 506001, Telangana, India
关键词
Bagging; Diversity generation; Ensemble learning; Software defect prediction; With-in-project defect prediction; FEATURE-SELECTION; SOFTWARE; CLASSIFIERS; QUALITY;
D O I
10.1016/j.jksuci.2021.09.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting the defect-proneness of a module can reduce the time, effort, manpower, and consequently the cost to develop a software project. Since the causes of software defects are difficult to identify, a wide range of machine learning models are still being developed to build a high performing prediction sys-tems. For this reason, an hybrid approach called - diverse ensemble learning technique (DELT), that adopts two diversity generation schemes such as bootstrap aggregation and multi-inducer concepts, is proposed for with-in-project defect prediction (WPDP) problem in order to mitigate the low classification rates of the prediction model. To predict the final class-label for any unlabeled test module, the proposed DELT employs the principle of majority voting. An extensive set of experiments are conducted on 43 pub-licly available PROMISE and NASA datasets. The experimental results are promising since it improves the generalization performance in classifying the defect proneness of the software module.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:8675 / 8691
页数:17
相关论文
共 57 条
[31]  
Lyu M. R., 1996, HDB SOFTWARE RELIABI
[32]  
Malhotra R, 2020, PROCEEDINGS OF THE CONFLUENCE 2020: 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING, P300, DOI [10.1109/Confluence47617.2020.9058124, 10.1109/confluence47617.2020.9058124]
[33]   Improving malware detection by applying multi-inducer ensemble [J].
Menahem, Eitan ;
Shabtai, Asaf ;
Rokach, Lior ;
Elovici, Yuval .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (04) :1483-1494
[34]   An industrial case study of classifier ensembles for locating software defects [J].
Misirli, Ayse Tosun ;
Bener, Ayse Basar ;
Turhan, Burak .
SOFTWARE QUALITY JOURNAL, 2011, 19 (03) :515-536
[35]   Intrusion detection using an ensemble of intelligent paradigms [J].
Mukkamala, S ;
Sung, AH ;
Abraham, A .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2005, 28 (02) :167-182
[36]   SOFTWARE-RELIABILITY ENGINEERING - TECHNOLOGY FOR THE 1990S [J].
MUSA, JD ;
EVERETT, WW .
IEEE SOFTWARE, 1990, 7 (06) :36-43
[37]   Heterogeneous Defect Prediction [J].
Nam, Jaechang ;
Fu, Wei ;
Kim, Sunghun ;
Menzies, Tim ;
Tan, Lin .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2018, 44 (09) :874-896
[38]   An empirical study on pareto based multi-objective feature selection for software defect prediction [J].
Ni, Chao ;
Chen, Xiang ;
Wu, Fangfang ;
Shen, Yuxiang ;
Gu, Qing .
JOURNAL OF SYSTEMS AND SOFTWARE, 2019, 152 :215-238
[39]  
Opitz D., 1999, J. Artif. Intell. Res., V11, P169, DOI DOI 10.1613/JAIR.614
[40]   ENSEMBLE OF SOFTWARE DEFECT PREDICTORS: AN AHP-BASED EVALUATION METHOD [J].
Peng, Yi ;
Kou, Gang ;
Wang, Guoxun ;
Wu, Wenshuai ;
Shi, Yong .
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2011, 10 (01) :187-206