Heterogeneous Ensemble Imputation for Software Development Effort Estimation

被引:4
|
作者
Abnane, Ibtissam [1 ]
Idri, Ali [1 ,2 ]
Hosni, Mohamed [1 ,3 ]
Abran, Alain [4 ]
机构
[1] Mohammed V Univ, Software Project Management Res Team, ENSIAS, Rabat, Morocco
[2] Mohammed VI Polytech Univ, MSDA, Ben Guerir, Morocco
[3] Moulay Ismail Univ, MOSI, L2M3S, ENSAM Meknes, Meknes, Morocco
[4] Univ Quebec, Dept Software Engn & Informat Technol, ETS, Montreal, PQ, Canada
关键词
Software development effort estimation; missing data; imputation; heterogeneous ensemble; standardized accuracy; machine learning; MISSING DATA; INCOMPLETE DATA; COST ESTIMATION; ANALOGY; PREDICTION; REGRESSION; SYSTEMS;
D O I
10.1145/3475960.3475984
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Choosing the appropriate Missing Data (MD) imputation technique for a given Software development effort estimation (SDEE) technique is not a trivial task. In fact, the impact of the MD imputation on the estimation output depends on the dataset and the SDEE technique used and there is no best imputation technique in all contexts. Thus, an attractive solution is to use more than one single imputation technique and combine their results for a final imputation outcome. This concept is called ensemble imputation and can help to significantly improve the estimation accuracy. This paper develops and evaluates a heterogeneous ensemble imputation whose members were the four single imputation techniques: K-Nearest Neighbors (KNN), Expectation Maximization (EM), Support Vector Regression (SVR), and Decision Trees (DT). The impact of the ensemble imputation was evaluated and compared with those of the four single imputation techniques on the accuracy measured in terms of the standardized accuracy criterion of four SDEE techniques: Case Based Reasoning (CBR), Multi-Layers Perceptron (MLP), Support Vector Regression (SVR) and Reduced Error Pruning Tree (REPTree). The Wilcoxon statistical test was also performed in order to assess whether the results are significant. All the empirical evaluations were carried out over the six datasets, namely, ISBSG, China, COCOMO81, Desharnais, Kemerer, and Miyazaki. Results show that the use of heterogeneous ensemble-based imputation instead single imputation significantly improved the accuracy of the four SDEE techniques. Indeed, the ensemble imputation technique was ranked either first or second in all contexts.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [41] Analyzing the Effect of Variables in the Software Development Effort Estimation
    Velarde, H.
    Santiesteban, C.
    Garcia, A.
    Casillas, J.
    IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (08) : 3797 - 3803
  • [42] Effort Estimation in Agile Software Development: An Updated Review
    Dantas, Emanuel
    Perkusich, Mirko
    Dilorenzo, Ednaldo
    Santos, Danilo F. S.
    Almeida, Hyggo
    Perkusich, Angelo
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2018, 28 (11-12) : 1811 - 1831
  • [43] The state-of-the-art in software development effort estimation
    Gautam, Swarnima Singh
    Singh, Vrijendra
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (12)
  • [44] Estimation of Software Development Effort: A Differential Evolution Approach
    Singal, Prerna
    Kumari, A. Charan
    Sharma, Prabha
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 2643 - 2652
  • [45] Effort, Duration and Cost Estimation in Agile Software Development
    Owais, Mohd.
    Ramakishore, R.
    2016 NINTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2016, : 150 - 154
  • [46] Software development effort estimation: a systematic mapping study
    Eduardo Carbonera, Carlos
    Farias, Kleinner
    Bischoff, Vinicius
    IET SOFTWARE, 2020, 14 (04) : 328 - 344
  • [47] Influence of Outliers on Estimation Accuracy of Software Development Effort
    Ono, Kenichi
    Tsunoda, Masateru
    Monden, Akito
    Matsumoto, Kenichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (01) : 91 - 105
  • [48] Effort estimation in agile global software development context
    Britto, Ricardo, 1600, Springer Verlag (199):
  • [49] Measuring software complexity for early estimation of development effort
    Tran-Cao, D
    Lévsque, G
    Meunier, JG
    COMPUTATIONAL METHODS AND EXPERIMENTAL MEASUREMENTS XII, 2005, 41 : 27 - 39
  • [50] Effort Estimation in Agile Software Development Using Autoencoders
    Rodriguez Sanchez, Eduardo
    Vazquez Santacruz, Eduardo
    Cervantes Maceda, Humberto
    2023 12TH INTERNATIONAL CONFERENCE ON SOFTWARE PROCESS IMPROVEMENT, CIMPS 2023, 2023, : 1 - 7