Predicting Software Effort Estimation Using Machine Learning Techniques

被引:0
作者
BaniMustafa, Ahmed [1 ]
机构
[1] Amer Univ, Madaba, Jordan
来源
2018 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (CSIT) | 2018年
关键词
Software Effort Estimation; COCOMO Data Mining; Mac hine Learning; Naive Bayes; Logistic Regression; Random Forests; ARTIFICIAL NEURAL-NETWORK; REGRESSION-MODELS; ANALOGY;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In software engineering, estimation plays a vital r ole in software development. Thus, affecting its cost and required effort and consequently influencing the overall success of sof tware development. The error margin in Expert-Based, Anal ogy-Based and algorithmic based methods including: COCOMO, Fu nction Point Analysis and Use-Case-Points is quite signifi cant, which exposes software projects to the danger of delays a nd running over-budget. To obtain better estimation, we propos e an alternative method through performing data mining o n historical data. This paper suggests performing this predictio n using three machine learning techniques that were applied to a preprocessed COCOMO NASA benchmark data which covered 93 project s: Naive Bayes, Logistic Regression and Random Forests. The generated models were tested using five folds cross -validation and were evaluated using Classification Accuracy, Preci sion, Recall, and AUC. The estimation results were then compared to COCOMO estimation. All the applied techniques were successful in achieving better results than the compared COCOM O model. However, the best performance was obtained using bo th Naive Bayes and Random Forests. Despite the fact that Nai ve Bayes outperformed both of the other two techniques in it s ROC curve and Recall score, Random Forests has a better Confu sion Matrix and scored better in both Classification Accuracy, and Precision measures. The results of this work confirm the vali mining in general and the applied technique in part software estimation.
引用
收藏
页码:249 / 256
页数:8
相关论文
共 65 条
[1]  
Albrech A., 1979, PROC JOINT SHAREGUID, P83
[2]   SOFTWARE FUNCTION, SOURCE LINES OF CODE, AND DEVELOPMENT EFFORT PREDICTION - A SOFTWARE SCIENCE VALIDATION [J].
ALBRECHT, AJ ;
GAFFNEY, JE .
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) :639-648
[3]  
[Anonymous], 2004, P 10 ACM SIGKDD INT
[4]  
[Anonymous], 7 INT C FUZZ THEOR T
[5]  
Azzeh M, 2008, ICSOFT 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON SOFTWARE AND DATA TECHNOLOGIES, VOL SE/GSDCA/MUSE, P127
[6]   Fuzzy Model Tree For Early Effort Estimation [J].
Azzeh, Mohammad ;
Nassif, Ali Bou .
2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, :117-121
[7]  
Banerjee G., 2004, Use Case Estimation
[8]  
Banimustafa Ahmed Hmaidan, 2012, Methods Mol Biol, V860, P317, DOI 10.1007/978-1-61779-594-7_18
[9]  
Bardsiri V. K., 2016, J ADV COMPUTER ENG T, V2, P11
[10]   Software Effort Estimation Using Data Mining Techniques [J].
Benala, Tirimula Rao ;
Mall, Rajib ;
Srikavya, P. ;
HariPriya, M. Vani .
ICT AND CRITICAL INFRASTRUCTURE: PROCEEDINGS OF THE 48TH ANNUAL CONVENTION OF COMPUTER SOCIETY OF INDIA - VOL I, 2014, 248 :85-92