Using tabu search to configure support vector regression for effort estimation

被引:0
作者
A. Corazza
S. Di Martino
F. Ferrucci
C. Gravino
F. Sarro
E. Mendes
机构
[1] University of Napoli “Federico II”,
[2] University of Salerno,undefined
[3] Zayed University,undefined
来源
Empirical Software Engineering | 2013年 / 18卷
关键词
Effort estimation; Search based techniques; Support vector regression; Tabu search;
D O I
暂无
中图分类号
学科分类号
摘要
Recent studies have reported that Support Vector Regression (SVR) has the potential as a technique for software development effort estimation. However, its prediction accuracy is heavily influenced by the setting of parameters that needs to be done when employing it. No general guidelines are available to select these parameters, whose choice also depends on the characteristics of the dataset being used. This motivated the work described in (Corazza et al. 2010), extended herein. In order to automatically select suitable SVR parameters we proposed an approach based on the use of the meta-heuristics Tabu Search (TS). We designed TS to search for the parameters of both the support vector algorithm and of the employed kernel function, namely RBF. We empirically assessed the effectiveness of the approach using different types of datasets (single and cross-company datasets, Web and not Web projects) from the PROMISE repository and from the Tukutuku database. A total of 21 datasets were employed to perform a 10-fold or a leave-one-out cross-validation, depending on the size of the dataset. Several benchmarks were taken into account to assess both the effectiveness of TS to set SVR parameters and the prediction accuracy of the proposed approach with respect to widely used effort estimation techniques. The use of TS allowed us to automatically obtain suitable parameters’ choices required to run SVR. Moreover, the combination of TS and SVR significantly outperformed all the other techniques. The proposed approach represents a suitable technique for software development effort estimation.
引用
收藏
页码:506 / 546
页数:40
相关论文
共 80 条
  • [1] Albrecht AJ(1983)Software function, source lines of code, and development effort prediction: a software science validation IEEE Trans Softw Eng 9 639-648
  • [2] Gaffney JE(2001)Can genetic programming improve software effort estimation? A comparative evaluation Inf Softw Technol 43 863-873
  • [3] Burgess CJ(2004)Practical selection of SVM parameters and noise estimation for SVM Regression Neural Netw 17 113-126
  • [4] Lefley M(2007)The adjusted analogy-based software effort estimation based on similarity distances J Syst Software 80 628-640
  • [5] Cherkassky V(1977)Detection of influential observations in linear regression Technometrics 19 15-18
  • [6] Ma Y(2011)Investigating the use of Support Vector Regression for Web Effort Estimation Empir Softw Eng 16 211-243
  • [7] Chiu N-H(2009)The WEKA data mining software: an update SIGKDD Explorations 11 10-18
  • [8] Huang S-J(2008)Kernel methods in machine learning Ann Stat 36 1171-1220
  • [9] Cook RD(2000)A comparative study of two software development cost modeling techniques using multi-organizational and company-specific data Inf Softw Technol 42 1009-1016
  • [10] Corazza A(1987)An empirical validation of software cost estimation models Commun ACM 30 416-429