Comparing Stacking Ensemble and Deep Learning for Software Project Effort Estimation

被引:3
作者
Hoc, Huynh Thai [1 ]
Silhavy, Radek [1 ]
Prokopova, Zdenka [1 ]
Silhavy, Petr [1 ]
机构
[1] Tomas Bata Univ Zlin, Fac Appl Informat, Zlin 76001, Czech Republic
关键词
Software effort estimation; ensemble; function point analysis; deep learning; inductive transfer learning; COST ESTIMATION; PREDICTION; STATISTICS; REGRESSION; SYSTEMS; POINTS;
D O I
10.1109/ACCESS.2023.3286372
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study focuses on improving the accuracy of effort estimation by employing ensemble, deep learning, and transfer learning techniques. An ensemble approach is utilized, incorporating XGBoost, Random Forest, and Histogram Gradient Boost as generators to enhance predictive capabilities. The performance of the ensemble method is compared against both the deep learning approach and the PFA-IFPUG technique. Statistical criteria including MAE, SA, MMRE, PRED(0.25), MBRE, MIBRE, and relevant information related to MMRE and PRED(0.25) are employed for evaluation. The results demonstrate that combining regression models with Random Forest as the final regressor and XGBoost and Histogram Gradient Boost as prior generators yields more accurate effort estimation than other combinations. Furthermore, the findings highlight the potential of transfer learning in the deep learning method, which exhibits superior performance over the ensemble approach. This approach leverages pre-trained models and continuously improves performance by training on new datasets, providing valuable insights for cross-company and cross-time effort estimation problems. The ISBSG dataset is used to build the pre-trained model, and the inductive transfer learning approach is verified based on the Desharnais, Albrecht, Kitchenham, and China datasets. The study underscores the significance of transfer learning and the integration of domain-specific knowledge from existing models to enhance the performance of new models, thereby improving accuracy, reducing errors, and enhancing predictive capabilities in effort estimation.
引用
收藏
页码:60590 / 60604
页数:15
相关论文
共 68 条
  • [1] Investigating the use of random forest in software effort estimation
    Abdelali, Zakrani
    Mustapha, Hain
    Abdelwahed, Namir
    [J]. SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2018), 2019, 148 : 343 - 352
  • [2] Agarwal N, 2020, SMART INNOVATIONS CO, P145, DOI [10.1007/978-981-15-5345-513, DOI 10.1007/978-981-15-5345-5_13]
  • [3] Ahmad F. B., 2022, J ED SCI, V2, P23
  • [4] Albrech A., 1979, PROC JOINT SHAREGUID, P83
  • [5] SOFTWARE FUNCTION, SOURCE LINES OF CODE, AND DEVELOPMENT EFFORT PREDICTION - A SOFTWARE SCIENCE VALIDATION
    ALBRECHT, AJ
    GAFFNEY, JE
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 1983, 9 (06) : 639 - 648
  • [6] Software Defect Prediction using Tree-Based Ensembles
    Aljamaan, Hamoud
    Alazba, Amal
    [J]. PROCEEDINGS OF THE 16TH ACM INTERNATIONAL CONFERENCE ON PREDICTIVE MODELS AND DATA ANALYTICS IN SOFTWARE ENGINEERING, PROMISE 2020, 2020, : 1 - 10
  • [7] [Anonymous], 2010, FUNCT POINT COUNT PR
  • [8] [Anonymous], 1989, THESIS U MONTREAL MO
  • [9] Ardiansyah, 2019, INT C ADV COMP SCI I, P15, DOI [10.1109/ICACSIS47736.2019.8979851, 10.1109/icacsis47736.2019.8979851]
  • [10] Project productivity evaluation in early software effort estimation
    Azzeh, Mohammad
    Nassif, Ali Bou
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (12)