Software Defect Prediction Based on Fuzzy Cost Broad Learning System

被引:0
作者
Cao, Heling [1 ,2 ,3 ,4 ]
Cui, Zhiying [1 ,2 ,3 ,4 ]
Chu, Yonghe [1 ,2 ,3 ,4 ]
Gong, Lina [5 ]
Liu, Guangen [1 ,2 ,3 ,4 ]
Wang, Yun [1 ,2 ,3 ,4 ]
Tian, Fangchao [1 ,2 ,3 ,4 ]
Li, Peng [4 ]
Ge, Haoyang [1 ,2 ,3 ,4 ]
机构
[1] Henan Univ Technol, Key Lab Grain Informat Proc & Control, Minist Educ, Zhengzhou, Peoples R China
[2] Henan Univ Technol, Henan Key Lab Grain Photoelect Detect & Control, Zhengzhou, Peoples R China
[3] Henan Univ Technol, Coll Informat Sci & Engn, Zhengzhou, Peoples R China
[4] Henan Univ Technol, Ctr Complex Sci, Zhengzhou, Peoples R China
[5] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
关键词
broad learning system; cost matrix; feature space; fuzzy membership functions; software defect prediction; NEURAL-NETWORKS;
D O I
10.1155/int/6463038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software defect prediction (SDP) is an effective approach to ensure software reliability. Machine learning models have been widely employed in SDP, but they ignore the impact of class imbalance, noise and outliers on the prediction performance. This study proposes a fuzzy cost broad learning system (FC-BLS). FC-BLS not only handles class imbalance problems but also considers the specific sample distribution to address noise and outliers in software defect datasets. Our approach draws fully on the idea of the cost matrix and fuzzy membership functions. It introduces them to BLS, where the cost matrix prioritises the training errors on the minority samples. Hence, the classification hyperplane position is more reasonable, and fuzzy membership functions calculate the membership degree of the sample in a feature mapping space to remove the prediction error caused by noise and outlier samples. Then, the optimisation problem is constructed based on the idea that the minority class and normal instances have relatively high costs. By contrast, the majority class and noise and outlier instances have relatively small costs. This study conducted experiments on nine NASA SDP datasets, and the experimental findings demonstrated the effectiveness of the proposed methodology on most datasets.
引用
收藏
页数:13
相关论文
共 47 条
[31]  
Rodriguez D., 2014, PRELIMINARY COMP TEC
[32]  
Shanthin A., Analyzing the Effect of Bagged Ensemble Approach for Software Fault Prediction in Class Level and Package Level Metrics, P1
[33]   Improving Software Defect Prediction in Noisy Imbalanced Datasets [J].
Shi, Haoxiang ;
Ai, Jun ;
Liu, Jingyu ;
Xu, Jiaxi .
APPLIED SCIENCES-BASEL, 2023, 13 (18)
[34]   Graph Neural Network for Source Code Defect Prediction [J].
Sikic, Lucija ;
Kurdija, Adrian Satja ;
Vladimir, Klemo ;
Silic, Marin .
IEEE ACCESS, 2022, 10 :10402-10415
[35]   Machine learning in software defect prediction: A business-driven systematic mapping study [J].
Stradowski, Szymon ;
Madeyski, Lech .
INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 155
[36]   Software defect prediction ensemble learning algorithm based on adaptive variable sparrow search algorithm [J].
Tang, Yu ;
Dai, Qi ;
Yang, Mengyuan ;
Du, Tony ;
Chen, Lifang .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (06) :1967-1987
[37]   Application of neural networks for software quality prediction using object-oriented metrics [J].
Thwin, MMT ;
Quah, TS .
JOURNAL OF SYSTEMS AND SOFTWARE, 2005, 76 (02) :147-156
[38]  
Torgo L., 2013, PORTUGUESE C ARTIFIC, V378, P389
[39]   A Software Defect Prediction Method That Simultaneously Addresses Class Overlap and Noise Issues after Oversampling [J].
Wang, Renliang ;
Liu, Feng ;
Bai, Yanhui .
ELECTRONICS, 2024, 13 (20)
[40]   Using Class Imbalance Learning for Software Defect Prediction [J].
Wang, Shuo ;
Yao, Xin .
IEEE TRANSACTIONS ON RELIABILITY, 2013, 62 (02) :434-443