Heterogeneous Ensemble Imputation for Software Development Effort Estimation

被引:4
|
作者
Abnane, Ibtissam [1 ]
Idri, Ali [1 ,2 ]
Hosni, Mohamed [1 ,3 ]
Abran, Alain [4 ]
机构
[1] Mohammed V Univ, Software Project Management Res Team, ENSIAS, Rabat, Morocco
[2] Mohammed VI Polytech Univ, MSDA, Ben Guerir, Morocco
[3] Moulay Ismail Univ, MOSI, L2M3S, ENSAM Meknes, Meknes, Morocco
[4] Univ Quebec, Dept Software Engn & Informat Technol, ETS, Montreal, PQ, Canada
关键词
Software development effort estimation; missing data; imputation; heterogeneous ensemble; standardized accuracy; machine learning; MISSING DATA; INCOMPLETE DATA; COST ESTIMATION; ANALOGY; PREDICTION; REGRESSION; SYSTEMS;
D O I
10.1145/3475960.3475984
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Choosing the appropriate Missing Data (MD) imputation technique for a given Software development effort estimation (SDEE) technique is not a trivial task. In fact, the impact of the MD imputation on the estimation output depends on the dataset and the SDEE technique used and there is no best imputation technique in all contexts. Thus, an attractive solution is to use more than one single imputation technique and combine their results for a final imputation outcome. This concept is called ensemble imputation and can help to significantly improve the estimation accuracy. This paper develops and evaluates a heterogeneous ensemble imputation whose members were the four single imputation techniques: K-Nearest Neighbors (KNN), Expectation Maximization (EM), Support Vector Regression (SVR), and Decision Trees (DT). The impact of the ensemble imputation was evaluated and compared with those of the four single imputation techniques on the accuracy measured in terms of the standardized accuracy criterion of four SDEE techniques: Case Based Reasoning (CBR), Multi-Layers Perceptron (MLP), Support Vector Regression (SVR) and Reduced Error Pruning Tree (REPTree). The Wilcoxon statistical test was also performed in order to assess whether the results are significant. All the empirical evaluations were carried out over the six datasets, namely, ISBSG, China, COCOMO81, Desharnais, Kemerer, and Miyazaki. Results show that the use of heterogeneous ensemble-based imputation instead single imputation significantly improved the accuracy of the four SDEE techniques. Indeed, the ensemble imputation technique was ranked either first or second in all contexts.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Evaluating ensemble imputation in software effort estimation
    Ibtissam Abnane
    Ali Idri
    Imane Chlioui
    Alain Abran
    Empirical Software Engineering, 2023, 28
  • [2] Evaluating ensemble imputation in software effort estimation
    Abnane, Ibtissam
    Idri, Ali
    Chlioui, Imane
    Abran, Alain
    EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (02)
  • [3] Heterogeneous Ensemble Dynamic Selection for Software Development Effort Estimation
    Cabral, Jose Thiago H. de A.
    Araujo, Ricardo de A.
    Nobrega, Jarley P.
    de Oliveira, Adriano L., I
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 210 - 217
  • [4] Analogy Software Effort Estimation Using Ensemble KNN Imputation
    Abnane, Ibtissam
    Hosni, Mohamed
    Idri, Ali
    Abran, Alain
    2019 45TH EUROMICRO CONFERENCE ON SOFTWARE ENGINEERING AND ADVANCED APPLICATIONS (SEAA 2019), 2019, : 228 - 235
  • [5] Empirical Study of Homogeneous and Heterogeneous Ensemble Models for Software Development Effort Estimation
    Elish, Mahmoud O.
    Helmy, Tarek
    Hussain, Muhammad Imtiaz
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [6] Heterogeneous Ensemble Model to Optimize Software Effort Estimation Accuracy
    Ali, Syed Sarmad
    Ren, Jian
    Zhang, Kui
    Wu, Ji
    Liu, Chao
    IEEE ACCESS, 2023, 11 : 27759 - 27792
  • [7] Heterogeneous Ensembles for Software Development Effort Estimation
    Hosni, Mohamed
    Idri, Ali
    Nassif, Ali Bou
    Abran, Alain
    2016 3RD INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI 2016), 2016, : 174 - 178
  • [8] Ensemble Learning Approach for Effective Software Development Effort Estimation with Future Ranking
    Rao, K. Eswara
    Pydi, Balamurali
    Naidu, P. Annan
    Prasann, U. D.
    Anjaneyulu, P.
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2023, 12 (01):
  • [9] SOFTWARE EFFORT ESTIMATION USING A NEURAL NETWORK ENSEMBLE
    Pai, Dinesh R.
    McFall, Kevin S.
    Subramanian, Girish H.
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2013, 53 (04) : 49 - 58
  • [10] Support vector regression-based imputation in analogy-based software development effort estimation
    Idri, Ali
    Abnane, Ibtissam
    Abran, Alain
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (12)