Improving software effort estimation using bio-inspired algorithms to select relevant features: An empirical study

被引:15
作者
Ali, Asad [1 ]
Gravino, Carmine [1 ]
机构
[1] Univ Salerno, Dept Comp Sci, Via Giovanni Paolo II,132, I-84084 Fisciano, SA, Italy
关键词
Bio-inspired algorithms; Feature selection; Effort estimation; Estimation techniques; OPTIMIZATION; MODEL; PREDICTION; CLASSIFICATION; VALIDATION; REGRESSION;
D O I
10.1016/j.scico.2021.102621
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Context: Bio-inspired feature selection algorithms got the attention of the researchers in the domain of Software Development Effort Estimations (SDEE) because they can improve the prediction accuracy of existing estimation techniques, such as machine learning methods. Objective: This paper aims to analyze different feature selection algorithms and assess the role they can play to increase the accuracy of software development effort predictions. Method: We have performed an empirical study considering commonly used bio-inspired feature selection algorithms in the domain of SDEE, i.e., Genetic Algorithm (GA), Particle Swarm Optimization, Ant Colony Optimization, Tabu Search, Harmony Search (HS), and Firefly algorithm, and four traditional non-bio-inspired algorithms, i.e., Best-First Search (BFS), Greedy Stepwise, Subset Forward Selection, and Random Search, used in combination with five widely used estimation techniques and applied to eight widely used SDEE datasets. Results: The performed analysis suggests that almost all (bio-inspired) feature selection algorithms have outperformed the baseline estimation techniques (i.e., techniques employed without any feature selection algorithms) in the majority of the experiments and hence we can conclude that feature selection algorithms can help in the domain of SDEE to increase the prediction accuracy. Similarly, HS and GA are considered as best performed bio-inspired algorithms because they provided significantly better results than the non-bio-inspired algorithms in a greater number of experiments. Moreover, we also compared the results of various employed bio-inspired algorithms, and, again, GA and HS came out as the best performed bio-inspired feature selection algorithms. Conclusion: From our results, if we have to pick feature selection algorithms (from both bio- and non-bio-inspired) and recommend them for future investigations, we would suggest HS because it provided better effort predictions in more combinations of datasets and estimation techniques than the other considered bio- and non-bio-inspired algorithms. Among the non-bio-inspired algorithms, BFS is the one that provided better predictions. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:27
相关论文
共 113 条
[1]   SOFTWARE FUNCTION, SOURCE LINES OF CODE, AND DEVELOPMENT EFFORT PREDICTION - A SOFTWARE SCIENCE VALIDATION [J].
ALBRECHT, AJ ;
GAFFNEY, JE .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) :639-648
[2]   Enhancing the Harmony Search Algorithm Performance on Constrained Numerical Optimization [J].
Alfredo Portilla-Flores, Edgar ;
Sanchez-Marquez, Alvaro ;
Flores-Pulido, Leticia ;
Vega-Alvarado, Eduardo ;
Calva Yanez, Maria Barbara ;
Alexander Aponte-Rodriguez, Jorge ;
Andrea Nino-Suarez, Paola .
IEEE ACCESS, 2017, 5 :25759-25780
[3]  
Algabri M, 2015, 2015 5TH NATIONAL SYMPOSIUM ON INFORMATION TECHNOLOGY: TOWARDS NEW SMART WORLD (NSITNSW)
[4]   A systematic literature review of software effort prediction using machine learning methods [J].
Ali, Asad ;
Gravino, Carmine .
JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2019, 31 (10)
[5]   Genetic Algorithms for Randomized Unit Testing [J].
Andrews, James H. ;
Menzies, Tim ;
Li, Felix C. H. .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2011, 37 (01) :80-94
[6]  
[Anonymous], 2010, Ant colony optimization
[7]  
[Anonymous], 2008, PROMISE'08, DOI DOI 10.1145/1370788.1370804
[8]  
[Anonymous], 2017, IEEE T SOFTWARE ENG, DOI DOI 10.1109/TSE.2016.2584050
[9]  
[Anonymous], 2010, ACS IEEE INT C COMP
[10]  
[Anonymous], 2012, P SAC 12