Systematic literature review of machine learning based software development effort estimation models

被引:322
作者
Wen, Jianfeng [1 ]
Li, Shixian [1 ]
Lin, Zhiyong [2 ]
Hu, Yong [3 ]
Huang, Changqin [4 ]
机构
[1] Sun Yat Sen Univ, Dept Comp Sci, Guangzhou 510275, Guangdong, Peoples R China
[2] Guangdong Polytech Normal Univ, Dept Comp Sci, Guangzhou, Guangdong, Peoples R China
[3] Sun Yat Sen Univ, Inst Business Intelligence & Knowledge Discovery, Dept Commerce E, Guangdong Univ Foreign Studies, Guangzhou 510275, Guangdong, Peoples R China
[4] S China Normal Univ, Engn Res Ctr Comp Network & Informat Syst, Guangzhou, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Software effort estimation; Machine learning; Systematic literature review; DEVELOPMENT COST ESTIMATION; ARTIFICIAL NEURAL-NETWORKS; EFFORT PREDICTION; EMPIRICAL VALIDATION; GENETIC ALGORITHM; PROJECT EFFORT; ANALOGY; REGRESSION; INFORMATION; SELECTION;
D O I
10.1016/j.infsof.2011.09.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Context: Software development effort estimation (SDEE) is the process of predicting the effort required to develop a software system. In order to improve estimation accuracy, many researchers have proposed machine learning (ML) based SDEE models (ML models) since 1990s. However, there has been no attempt to analyze the empirical evidence on ML models in a systematic way. Objective: This research aims to systematically analyze ML models from four aspects: type of ML technique, estimation accuracy, model comparison, and estimation context. Method: We performed a systematic literature review of empirical studies on ML model published in the last two decades (1991-2010). Results: We have identified 84 primary studies relevant to the objective of this research. After investigating these studies, we found that eight types of ML techniques have been employed in SDEE models. Overall speaking, the estimation accuracy of these ML models is close to the acceptable level and is better than that of non-ML models. Furthermore, different ML models have different strengths and weaknesses and thus favor different estimation contexts. Conclusion: ML models are promising in the field of SDEE. However, the application of ML models in industry is still limited, so that more effort and incentives are needed to facilitate the application of ML models. To this end, based on the findings of this review, we provide recommendations for researchers as well as guidelines for practitioners. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:41 / 59
页数:19
相关论文
共 126 条
[71]   Software development cost estimation: Integrating neural network with cluster analysis [J].
Lee, A ;
Cheng, CH ;
Balakrishnan, J .
INFORMATION & MANAGEMENT, 1998, 34 (01) :1-9
[72]   Analysis of attribute weighting heuristics for analogy-based software effort estimation method AQUA+ [J].
Li, Jingzhou ;
Ruhe, Guenther .
EMPIRICAL SOFTWARE ENGINEERING, 2008, 13 (01) :63-96
[73]   A flexible method for software effort estimation by analogy [J].
Li, Jingzhou ;
Ruhe, Guenther ;
Al-Emran, Ahmed ;
Richter, Michael M. .
EMPIRICAL SOFTWARE ENGINEERING, 2007, 12 (01) :65-106
[74]   A study of the non-linear adjustment for analogy based software cost estimation [J].
Li, Y. F. ;
Xie, M. ;
Goh, T. N. .
EMPIRICAL SOFTWARE ENGINEERING, 2009, 14 (06) :603-643
[75]   A study of mutual information based feature selection for case based reasoning in software cost estimation [J].
Li, Y. F. ;
Xie, M. ;
Go, T. N. .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) :5921-5931
[76]   A study of project selection and feature weighting for analogy based software cost estimation [J].
Li, Y. F. ;
Xie, M. ;
Goh, T. N. .
JOURNAL OF SYSTEMS AND SOFTWARE, 2009, 82 (02) :241-252
[77]   Combining techniques to optimize effort predictions in software project management [J].
MacDonell, SG ;
Shepperd, MJ .
JOURNAL OF SYSTEMS AND SOFTWARE, 2003, 66 (02) :91-98
[78]  
MACDONELL SG, 2007, P EMP SOFTW ENG MEAS, P401
[79]   An investigation of machine learning based prediction systems [J].
Mair, C ;
Kadoda, G ;
Lefley, M ;
Phalp, K ;
Schofield, C ;
Shepperd, M ;
Webster, S .
JOURNAL OF SYSTEMS AND SOFTWARE, 2000, 53 (01) :23-29
[80]   The consistency of empirical comparisons of regression and analogy-based software project cost prediction [J].
Mair, C ;
Shepperd, M .
2005 INTERNATIONAL SYMPOSIUM ON EMPIRICAL SOFTWARE ENGINEERING (ISESE), PROCEEDINGS, 2005, :491-500