Investigating the use of random forest in software effort estimation

被引:51
作者
Abdelali, Zakrani [1 ]
Mustapha, Hain [1 ]
Abdelwahed, Namir [2 ]
机构
[1] ENSAM, 150 Blvd Nile, Casablanca 20700, Morocco
[2] Fac Sci Ben Msik, Blvd Driss El Harti, Casablanca, Morocco
来源
SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018) | 2019年 / 148卷
关键词
Software effort estimation; random forest; regression trees; accuracy evaluation;
D O I
10.1016/j.procs.2019.01.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last two decades, there has been an important increase in studies dealing with the software development effort estimation (SDEE) using machine learning (ML) techniques that aimed to improve the accuracy of the estimates and to understand the process used to generate these estimates. Among these ML techniques, decision tree-based models have received a considerable scholarly attention thanks to their generalization ability and understandability. However, very few studies have investigated the use of random forest (RF) in software effort estimation. In this paper, a RF model is designed and optimized empirically by varying the values of its key parameters. The performance of the RF is compared with that of classical regression tree (RT). The evaluation was performed through the 30% hold-out validation method using three datasets: ISBSG R8, Tukutuku and COCOMO. To identify the most accurate techniques, we used three widely known accuracy measures: Pred(0.25), MMRE and MdMRE. The results show that the optimized random forest outperforms the regression trees model on all evaluation criteria. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:343 / 352
页数:10
相关论文
共 38 条
[1]   SOFTWARE DEVELOPMENT EFFORT ESTIMATION USING CLASSICAL AND FUZZY ANALOGY: A CROSS-VALIDATION COMPARATIVE STUDY [J].
Amazal, Fatima Azzahra ;
Idri, Ali ;
Abran, Alain .
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2014, 13 (03)
[2]  
Andreou Andreas S., 2008, 2008 23rd IEEE/ACM International Conference on Automated Software Engineering, P371, DOI 10.1109/ASE.2008.51
[3]  
[Anonymous], 2013, P 28 ANN ACM S APPL
[4]  
[Anonymous], 2010, INT REV COMPUT SOFTW
[5]  
Azzeh M., 2011, 7 INT C PRED MOD SOF
[6]  
Basri S, 2016, ASIA PAC SOFWR ENG, P177, DOI [10.1109/APSEC.2016.57, 10.1109/APSEC.2016.034]
[7]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669
[8]  
Boehm B. W., 1981, SOFTWARE ENG EC
[9]  
Boehm Barry, 2000, SOFTWARE COST ESTIMA
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32