Improve Analogy-Based Software Effort Estimation using Principal Components Analysis and Correlation Weighting

被引:27
作者
Wen, Jianfeng [1 ]
Li, Shixian [1 ]
Tang, Linyan [2 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Guangdong Ind Tech Coll, Dept Comp Engn, Guangzhou, Guangdong, Peoples R China
来源
APSEC 09: SIXTEENTH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE, PROCEEDINGS | 2009年
关键词
Software Effort Estimation; Analogy; Principal Components Analysis; Correlation Weighting; COST ESTIMATION; SELECTION;
D O I
10.1109/APSEC.2009.40
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Software development cost overruns often induce project managers to cut down manpower cost at the expense of software quality. Accurate effort estimation is beneficial to the prevention of cost overruns. Analogy-based effort estimation predicts the effort of a new project by using the information of its similar historical projects, where the similarity is measured via Euclidean distance. To calculate the Euclidean distance, traditional analogy-based effort estimation methods usually adopt the original project features and assign uniform weights to them. However, it would lead to inappropriate similarity measure and result in inaccurate effort estimate if the original features are interdependent or have unequal impacts on the project effort. In this paper, we propose to use principal components analysis (PCA) to extract independent features, and then use Pearson correlation coefficients between the extracted features and the project effort as the weights for Euclidean distance calculation in similarity measure. Extensive experiments were further conducted on three benchmark datasets: COCOMO, Desharnais, and NASA. The experimental results show that our approach significantly improves prediction accuracy and reliability over the traditional method, either by using correlation weighting alone or by using PCA combined with correlation weighting. The comparison of our approach with other approaches reported in literature also suggests that our approach is competitive.
引用
收藏
页码:179 / +
页数:2
相关论文
共 16 条
[1]  
[Anonymous], 1981, Software Engineering Economics
[2]   Optimal project feature weights in analogy-based cost estimation: Improvement and limitations [J].
Auer, M ;
Trendowicz, A ;
Graser, B ;
Haunschmid, E ;
Biffl, S .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2006, 32 (02) :83-92
[3]  
BAILEY JW, 1981, P 5 INT C SOFTW ENG, P107
[4]  
Desharnais J. M., 1989, Analyse statistique de la productivitie des projects informatique a partie de la technique des point des function
[5]  
Johnson R. A, 2007, APPL MULTIVARIATE ST, V6th, DOI DOI 10.4236/JWARP.2010.26066
[6]   Analogy-X: Providing statistical inference to analogy-based software cost estimation [J].
Keung, Jacky Wai ;
Kitchenham, Barbara A. ;
Jeffery, David Ross .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2008, 34 (04) :471-484
[7]   Estimating maintenance effort by analogy [J].
Leung H.K.N. .
Empirical Software Engineering, 2002, 7 (02) :157-175
[8]   Analysis of attribute weighting heuristics for analogy-based software effort estimation method AQUA+ [J].
Li, Jingzhou ;
Ruhe, Guenther .
EMPIRICAL SOFTWARE ENGINEERING, 2008, 13 (01) :63-96
[9]   A flexible method for software effort estimation by analogy [J].
Li, Jingzhou ;
Ruhe, Guenther ;
Al-Emran, Ahmed ;
Richter, Michael M. .
EMPIRICAL SOFTWARE ENGINEERING, 2007, 12 (01) :65-106
[10]   A study of mutual information based feature selection for case based reasoning in software cost estimation [J].
Li, Y. F. ;
Xie, M. ;
Go, T. N. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5921-5931