Iterated feature selection algorithms with layered recurrent neural network for software fault prediction

被引:108
作者
Turabieh, Hamza [1 ]
Mafarja, Majdi [2 ]
Li, Xiaodong [3 ]
机构
[1] Taif Univ, Dept Informat Technol, At Taif, Saudi Arabia
[2] Birzeit Univ, Dept Comp Sci, Birzeit, Palestine
[3] RMIT Univ, Sch Sci, Melbourne, Vic, Australia
关键词
Software fault prediction; Feature selection; Layered recurrent neural network; OBJECT-ORIENTED METRICS; DEFECT PREDICTION; OPTIMIZATION; MODELS; SYSTEM;
D O I
10.1016/j.eswa.2018.12.033
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Software fault prediction (SFP) is typically used to predict faults in software components. Machine learning techniques (e.g., classification) are widely used to tackle this problem. With the availability of the huge amount of data that can be obtained from mining software historical repositories, it becomes possible to have some features (metrics) that are not correlated with the faults, which consequently mislead the learning algorithm and thus decrease its performance. One possible solution to eliminate those metrics is Feature Selection (FS). In this paper, a novel FS approach is proposed to enhance the performance of a layered recurrent neural network (L-RNN), which is used as a classification technique for the SFP problem. Three different wrapper FS algorithms (i.e, Binary Genetic Algorithm (BGA), Binary Particle Swarm Optimization (BPSO), and Binary Ant Colony Optimization (BACO)) were employed iteratively. To assess the performance of the proposed approach, 19 real-world software projects from PROMISE repository are investigated and the experimental results are discussed. Receiver operating characteristic- area under the curve (ROC-AUC) is used as a performance measure. The results are compared with other state of -art approaches including Naive Bayes (NB), Artificial Neural Network (ANN), logistic regression (LR), the k-nearest neighbors (k-NN) and C4.5 decision trees, in terms of area under the curve (AUC). Our results have demonstrated that the proposed approach can outperform other existing methods. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:27 / 42
页数:16
相关论文
共 50 条
[41]   Fault Prediction Method for Distribution Network Outage based on Feature Selection and Ensemble Learning [J].
Zhang, Wen ;
Sheng, Wanxing ;
Liu, Keyan ;
Du, Songhuai ;
Jia, Dongli ;
Hu, Lijuan .
2018 5TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2018), 2018, :226-231
[42]   Correlation-Based Ensemble Feature Selection Using Bioinspired Algorithms and Classification Using Backpropagation Neural Network [J].
Christo, V. R. Elgin ;
Nehemiah, H. Khanna ;
Minu, B. ;
Kannan, A. .
COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2019, 2019
[43]   Impact of feature selection on neural network prediction of fused deposition modelling (FDM) print part properties [J].
Enemuoh, Emmanuel U. ;
Asante-Okyere, Solomon .
INTERNATIONAL JOURNAL OF INTERACTIVE DESIGN AND MANUFACTURING - IJIDEM, 2024, 18 (10) :7413-7427
[44]   FSBOA: feature selection using bat optimization algorithm for software fault detection [J].
Pethe Y.S. ;
Gourisaria M.K. ;
Singh P.K. ;
Das H. .
Discover Internet of Things, 2024, 4 (01)
[45]   Choosing software metrics for defect prediction: an investigation on feature selection techniques [J].
Gao, Kehan ;
Khoshgoftaar, Taghi M. ;
Wang, Huanjing ;
Seliya, Naeem .
SOFTWARE-PRACTICE & EXPERIENCE, 2011, 41 (05) :579-606
[46]   Training the Recurrent neural network by the Fuzzy Min-Max algorithm for fault prediction [J].
Zemouri, Ryad ;
Racoceanu, Daniel ;
Zerhouni, Noureddine ;
Minca, Eugenia ;
Filip, Florin .
INTELLIGENT SYSTEMS AND AUTOMATION, 2009, 1107 :85-+
[47]   CrDrcnn: Design and Development of Crow Optimization-Based Deep Recurrent Neural Network for Software Defect Prediction [J].
Reddy, S. Sai Satyanarayana ;
Kumar, Ashwani ;
Mounica, N. ;
Geetha, Donakanti .
ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I, 2022, 1613 :13-25
[48]   Adaptive recurrent neural network for software defect prediction with the aid of quantum theory- particle swarm optimization [J].
Anju, A. J. ;
Judith, J. E. .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) :16257-16278
[49]   Investigating the effect of dataset size, metrics sets, and feature selection techniques on software fault prediction problem [J].
Catal, Cagatay ;
Diri, Banu .
INFORMATION SCIENCES, 2009, 179 (08) :1040-1058
[50]   Dermatology Diagnosis with Feature Selection Methods and Artificial Neural Network [J].
Abdul-Rahman, Shuzlina ;
Norhan, Ahmad Khairil ;
Yusoff, Marina ;
Mohamed, Azlinah ;
Mutalib, Sofianita .
2012 IEEE EMBS CONFERENCE ON BIOMEDICAL ENGINEERING AND SCIENCES (IECBES), 2012,