How to Make Best Use of Cross-Company Data in Software Effort Estimation?

被引:36
作者
Minku, Leandro L. [1 ]
Yao, Xin [1 ]
机构
[1] Univ Birmingham, Sch Comp Sci, CERCIA, Birmingham B15 2TT, W Midlands, England
来源
36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014) | 2014年
关键词
Software effort estimation; cross-company learning; transfer learning; online learning; ensembles of learning machines; PREDICTION;
D O I
10.1145/2568225.2568228
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Previous works using Cross-Company (CC) data for making Within-Company (WC) Software Effort Estimation (SEE) try to use CC data or models directly to provide predictions in the WC context. So, these data or models are only helpful when they match the WC context well. When they do not, a fair amount of WC training data, which are usually expensive to acquire, are still necessary to achieve good performance. We investigate how to make best use of CC data, so that we can reduce the amount of WC data while maintaining or improving performance in comparison to WCSEE models. This is done by proposing a new framework to learn the relationship between CC and WC projects explicitly, allowing CC models to be mapped to the WC context. Such mapped models can be useful even when the CC models themselves do not match the WC context directly. Our study shows that a new approach instantiating this frame work is able not only to use substantially less WC data than a corresponding WC model, but also to achieve similar/better performance. This approach can also be used to provide insight into the behaviour of a company in comparison to others.
引用
收藏
页码:446 / 456
页数:11
相关论文
共 28 条
[1]  
[Anonymous], 2013, P 9 INT C PRED MOD S, DOI [DOI 10.1145/2499393.2499394, 10.1145/2499393.2499394]
[2]  
[Anonymous], 1981, Software Engineering Economics
[3]  
[Anonymous], 2012, P PROMISE 12
[4]  
Briand L, 2000, ICSE, P377
[5]   Data Mining Techniques for Software Effort Estimation: A Comparative Study [J].
Dejaeger, Karel ;
Verbeke, Wouter ;
Martens, David ;
Baesens, Bart .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2012, 38 (02) :375-397
[6]   A simulation study of the model evaluation criterion MMRE [J].
Foss, T ;
Stensrud, E ;
Kitchenham, B ;
Myrtveit, I .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2003, 29 (11) :985-995
[7]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[8]   A comparative study of two software development cost modeling techniques using multi-organizational and company-specific data [J].
Jeffery, R ;
Ruhe, M ;
Wieczorek, I .
INFORMATION AND SOFTWARE TECHNOLOGY, 2000, 42 (14) :1009-1016
[9]   An empirical study of maintenance and development estimation accuracy [J].
Kitchenham, B ;
Pfleeger, SL ;
McColl, B ;
Eagan, S .
JOURNAL OF SYSTEMS AND SOFTWARE, 2002, 64 (01) :57-77
[10]  
Kitchenham BA, 2004, EMPIRICAL ASSESSMENT, P47