Source code size prediction using use case metrics: an empirical comparison with use case points

被引:6
作者
Badri M. [1 ]
Badri L. [1 ]
Flageol W. [1 ]
Toure F. [1 ]
机构
[1] Software Engineering Research Laboratory, Department of Mathematics and Computer Science, University of Quebec, Trois-Rivières, QC
基金
加拿大自然科学与工程研究理事会;
关键词
C4.5; k-NN; Linear regression; Logistic regression; LOO cross validation; Multilayer perceptron neural network; Naïve Bayes; Prediction models; Random forest; ROC and AUC analysis; Source code size; Use case metrics; Use case points; Use cases;
D O I
10.1007/s11334-016-0285-7
中图分类号
学科分类号
摘要
Software source code size, in terms of source lines of code (SLOC), is an important parameter of many parametric software development effort estimation methods. In this paper, we investigate empirically the early prediction of SLOC for object-oriented software using use case metrics. We used different modeling techniques to build the prediction models. We used the univariate logistic regression and the simple linear regression methods to evaluate the individual effect of each use case metric on SLOC, and the multivariate logistic regression and the multiple linear regression methods to explore the combined effect of the use case metrics on SLOC. We also used in the study different machine learning methods (k-NN, naïve Bayes, C4.5, random forest, and multilayer perceptron neural network). The prediction models were evaluated using the receiver operating characteristic analysis, particularly the area under the curve measure, and leave-one-out cross validation. An empirical study, using data collected from five open source Java projects, is reported in the paper. The use case metrics have been compared to the well-known use case points method. Results provide evidence that the use case metrics-based approach gives a more accurate prediction of SLOC than the use case points-based approach. © 2016, Springer-Verlag London.
引用
收藏
页码:143 / 159
页数:16
相关论文
共 73 条
[11]  
Robiolo G., Orosco R., An alternative method employing use cases for early effort estimation, Software Engineering Workshop SEW ‘07, (2007)
[12]  
Robiolo G., Orosco R., Employing use cases to early estimate effort with simpler metrics, Innov Syst Softw Eng, 4, pp. 31-43, (2008)
[13]  
Robiolo G., Badano C., Orosco R., Transactions and paths: two use case based metrics which improve the early effort estimation. In: Proceedings of the 3rd international symposium on empirical software engineering and measurement, IEEE Computer Society, (2009)
[14]  
Fan W., Xiaohu Y., Xiaochun Z., Lu C., Extended use case points method for software cost estimation, (2009)
[15]  
Albrecht A., Measuring application development productivity, (1979)
[16]  
Albrecht A.J., Gaffney J.E., Software function, source lines of code and development effort prediction: a software science validation, IEEE Trans Softw Eng, 9, 6, pp. 639-648, (1983)
[17]  
Anda B., Benestad H.C., Hove S.E., A multiple case study of software effort estimation based on use case points. In: Proceedings of the international symposium on empirical software engineering (ISESE’05), (2005)
[18]  
Boehm B.W., Software engineering economics, (1981)
[19]  
Kemerer C.F., An empirical validation of software cost estimation, Commun ACM, 30, 5, pp. 416-429, (1987)
[20]  
Bourque P., Cote V., An experiment in software sizing with structured analysis metrics, J Syst Softw, 15, 2, pp. 159-172, (1991)