Prediction of software fault-prone classes using ensemble random forest with adaptive synthetic sampling algorithm

被引:24
作者
Balaram, A. [1 ]
Vasundra, S. [1 ]
机构
[1] JNTUA Univ, Dept CSE, Anantapur, Andhra Pradesh, India
关键词
Adaptive synthetic sampling; Butterfly optimization; Ensemble random forest; Imbalanced data; Software fault prediction; MACHINE;
D O I
10.1007/s10515-021-00311-z
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The process of predicting fault module in software is known as Software Fault Prediction (SFP) which is important for releasing software versions that are dependent on the predefined metrics due to historical faults in software. The fault prediction in software such as components, classes and modules, at an early stage in the development cycle, is important as it significantly contributes to time reduction and cost reduction. Therefore, the modules that are used for processing each step is reduced by the unnecessary efforts eliminated the faults during development process. However, the problem of imbalanced dataset becomes a significant challenge during SFP for software fault prediction at an early stage. The limitations such as inclusion of software metric for SFP models, cost effectiveness of the fault and the fault density prediction, are still few obstacles faced by research. The proposed Butterfly optimization performs feature selection that helps to predict meticulous and remarkable results by developing the applications of Machine Learning techniques. The present research uses Ensemble Random Forest with Adaptive Synthetic Sampling (E-RF-ADASYN) for fault prediction by using various classifiers which is mentioned in the proposed method section. The proposed E-RF-ADASYN obtained Area Under Curve (AUC) of 0.854767 better when compared with the existing method Rough-KNN Noise-Filtered Easy Ensemble (RKEE) of 0.771.
引用
收藏
页数:21
相关论文
共 23 条
[1]  
Ahmed Md Razu, 2020, ICCDE 2020: Proceedings of 2020 the 6th International Conference on Computing and Data Engineering, P247, DOI 10.1145/3379247.3379278
[2]   The Influence of Deep Learning Algorithms Factors in Software Fault Prediction [J].
Al Qasem, Osama ;
Akour, Mohammed ;
Alenezi, Mamdouh .
IEEE ACCESS, 2020, 8 :63945-63960
[3]   Software fault prediction using particle swarm algorithm with genetic algorithm and support vector machine classifier [J].
Alsghaier, Hiba ;
Akour, Mohammed .
SOFTWARE-PRACTICE & EXPERIENCE, 2020, 50 (04) :407-427
[4]   An improved petri net for fault analysis of an electronic system with hybrid fault of software and hardware [J].
Bai, Song ;
Li, Yan-Feng ;
Huang, Hong-Zhong ;
Yu, Aodi ;
Zeng, Ying .
ENGINEERING FAILURE ANALYSIS, 2021, 120
[5]  
Bakka K. V., 2016, INT J ENG RES, V5, P404
[6]   WR-ELM: Weighted Regularization Extreme Learning Machine for Imbalance Learning in Software Fault Prediction [J].
Bal, Pravas Ranjan ;
Kumar, Sandeep .
IEEE TRANSACTIONS ON RELIABILITY, 2020, 69 (04) :1355-1375
[7]   Improving Software Fault Localization by Combining Spectrum and Mutation [J].
Cui, Zhanqi ;
Jia, Minghua ;
Chen, Xiang ;
Zheng, Liwei ;
Liu, Xiulei .
IEEE ACCESS, 2020, 8 :172296-172307
[8]   An Artificial Intelligence paradigm for troubleshooting software bugs [J].
Elmishali, Amir ;
Stern, Roni ;
Kalech, Meir .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 69 :147-156
[9]   A Fault Prediction Algorithm Based on Rough Sets and Back Propagation Neural Network for Vehicular Networks [J].
Geng, Rong ;
Wang, Xiaojie ;
Ye, Ning ;
Liu, Jun .
IEEE ACCESS, 2018, 6 :74984-74992
[10]   A dimensionality reduction-based efficient software fault prediction using Fisher linear discriminant analysis (FLDA) [J].
Kalsoom, Anum ;
Maqsood, Muazzam ;
Ghazanfar, Mustansar Ali ;
Aadil, Farhan ;
Rho, Seungmin .
JOURNAL OF SUPERCOMPUTING, 2018, 74 (09) :4568-4602