Using public domain metrics to estimate software development effort

被引:80
作者
Jeffery, R [1 ]
Ruhe, M [1 ]
Wieczorek, I [1 ]
机构
[1] Univ New S Wales, CAESAR, Sydney, NSW 2052, Australia
来源
SEVENTH INTERNATIONAL SOFTWARE METRICS SYMPOSIUM - METRICS 2001, PROCEEDINGS | 2000年
关键词
D O I
10.1109/METRIC.2001.915512
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we investigate the accuracy of cost estimates when applying most commonly used modeling techniques to a large-scale industrial data set which is professionally maintained by the International Software Standards Benchmarking Group (ISBSG). The modeling techniques applied are ordinary least squares regression (OLS), Analogy-based estimation, stepwise ANOVA, CART, and robust regression. The questions we address in this study are related to important issues. The first is the appropriate selection of a technique in a given context The second is the assessment of the feasibility of using multi-organizational data compared to the benefits from company-specific data collection. We compare company-specific models with models based on multi-company data. This is done by using the estimates derived for one company that contributed to the ISBSG data set and estimates from using carefully marched data from the rest of the ISBSG data. When using the ISBSG data set to derive estimates for the company generally poor results were obtained. Robust regression and OLS performed most accurately. When using the company's own data as the basis for estimation OLS, a CART-variant, and Analogy performed best. In contrast to previous studies, the estimation accuracy when using the company's data is significantly higher than when using the rest of the ISBSG data set. Thus, from these results, the company that contributed to the ISBSG data set, would be better off when using ifs own data for cost estimation.
引用
收藏
页码:16 / 27
页数:12
相关论文
共 30 条
[1]   A simulation tool for efficient analogy based cost estimation [J].
Angelis L. ;
Stamelos I. .
Empirical Software Engineering, 2000, 5 (1) :35-68
[2]  
BISIO R, 1995, P 1 INT C CAS BAS RE, P11
[3]  
Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
[4]  
Briand L. C., 1999, Proceedings of the 1999 International Conference on Software Engineering (IEEE Cat. No.99CB37002), P313, DOI 10.1109/ICSE.1999.841022
[5]  
Briand L. C., 2000, Proceedings of the 2000 International Conference on Software Engineering. ICSE 2000 the New Millennium, P377, DOI 10.1109/ICSE.2000.870428
[6]   A PATTERN-RECOGNITION APPROACH FOR SOFTWARE ENGINEERING DATA-ANALYSIS [J].
BRIAND, LC ;
BASILI, VR ;
THOMAS, WM .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1992, 18 (11) :931-942
[7]  
BRIAND LC, UNPUB ENCY SOFTWARE
[8]  
BRIAND LC, P 20 INT C SOFTW ENG, P390
[9]  
Conte S.D., 1986, SOFTWARE ENG METRICS
[10]   A comparison of software effort estimation techniques: Using function points with neural networks, case-based reasoning and regression models [J].
Finnie, GR ;
Wittig, GE ;
Desharnais, JM .
JOURNAL OF SYSTEMS AND SOFTWARE, 1997, 39 (03) :281-289