With-in-project defect prediction using bootstrap aggregation based diverse ensemble learning technique

被引:18
作者
Bhutamapuram, Umamaheswara Sharma [1 ]
Sadam, Ravichandra [1 ]
机构
[1] Natl Inst Technol Warangal, Dept Comp Sci & Engn, Warangal 506001, Telangana, India
关键词
Bagging; Diversity generation; Ensemble learning; Software defect prediction; With-in-project defect prediction; FEATURE-SELECTION; SOFTWARE; CLASSIFIERS; QUALITY;
D O I
10.1016/j.jksuci.2021.09.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Predicting the defect-proneness of a module can reduce the time, effort, manpower, and consequently the cost to develop a software project. Since the causes of software defects are difficult to identify, a wide range of machine learning models are still being developed to build a high performing prediction sys-tems. For this reason, an hybrid approach called - diverse ensemble learning technique (DELT), that adopts two diversity generation schemes such as bootstrap aggregation and multi-inducer concepts, is proposed for with-in-project defect prediction (WPDP) problem in order to mitigate the low classification rates of the prediction model. To predict the final class-label for any unlabeled test module, the proposed DELT employs the principle of majority voting. An extensive set of experiments are conducted on 43 pub-licly available PROMISE and NASA datasets. The experimental results are promising since it improves the generalization performance in classifying the defect proneness of the software module.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:8675 / 8691
页数:17
相关论文
共 57 条
[11]   An Empirical Study on Heterogeneous Defect Prediction Approaches [J].
Chen, Haowen ;
Jing, Xiao-Yuan ;
Li, Zhiqiang ;
Wu, Di ;
Peng, Yi ;
Huang, Zhiguo .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (12) :2803-2822
[12]   Software defect prediction using relational association rule mining [J].
Czibula, Gabriela ;
Marian, Zsuzsanna ;
Czibula, Istvan Gergely .
INFORMATION SCIENCES, 2014, 264 :260-278
[13]   Evaluating defect prediction approaches: a benchmark and an extensive comparison [J].
D'Ambros, Marco ;
Lanza, Michele ;
Robbes, Romain .
EMPIRICAL SOFTWARE ENGINEERING, 2012, 17 (4-5) :531-577
[14]   Dynamic Selection of Classifiers in Bug Prediction: An Adaptive Method [J].
Di Nucci, Dario ;
Palomba, Fabio ;
Oliveto, Rocco ;
De Lucia, Andrea .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2017, 1 (03) :202-212
[15]   Predicting defect-prone software modules using support vector machines [J].
Elish, Karim O. ;
Elish, Mahmoud O. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (05) :649-660
[16]   A critique of software defect prediction models [J].
Fenton, NE ;
Neil, M .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1999, 25 (05) :675-689
[17]   A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (04) :463-484
[18]   A Large-Scale Study of the Impact of Feature Selection Techniques on Defect Classification Models [J].
Ghotra, Baljinder ;
McIntosh, Shane ;
Hassan, Ahmed E. .
2017 IEEE/ACM 14TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES (MSR 2017), 2017, :146-157
[19]   Revisiting the Impact of Classification Techniques on the Performance of Defect Prediction Models [J].
Ghotra, Baljinder ;
McIntosh, Shane ;
Hassan, Ahmed E. .
2015 IEEE/ACM 37TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, VOL 1, 2015, :789-800
[20]   Improving malware detection using big data and ensemble learning [J].
Gupta, Deepak ;
Rani, Rinkle .
COMPUTERS & ELECTRICAL ENGINEERING, 2020, 86