Analogy Software Effort Estimation Using Ensemble KNN Imputation

被引:16
|
作者
Abnane, Ibtissam [1 ]
Hosni, Mohamed [1 ]
Idri, Ali [1 ]
Abran, Alain [2 ]
机构
[1] Univ Mohammed 5, ENSIAS, Software Project Management Res Team, Rabat, Morocco
[2] Univ Quebec, Dept Software Engn & Informat Technol, ETS, Montreal, PQ, Canada
关键词
Analogy-based software effort estimation; standardized accuracy; missing data; imputation; ensemble; grid search; parameter optimization; COST ESTIMATION; SYSTEMS;
D O I
10.1109/SEAA.2019.00044
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Missing data are a serious issue that influences the prediction accuracy of software development effort estimation (SDEE) techniques and especially analogy-based software effort estimation (ASEE). Hence, appropriate handling of missing data is necessary in order to ensure best performance. To deal with this issue K-nearest neighbors (KNN) imputation has been widely used. However, none of the studies investigating KNN imputation in SDEE have addressed the impact of parameter settings on the imputation process given that parameter optimization techniques are often used at the prediction level, as they highly impact the performance of SDEE techniques including ASEE. This paper proposes and evaluates an ensemble KNN imputation technique for ASEE. Thereafter, we compare ASEE performance using ensemble KNN imputation with those using either a grid search based single KNN imputation or KNN imputation without parameter optimization. For the six datasets used for comparison, the ensemble KNN imputation significantly improved ASEE performance compared with KNN imputation without optimization. Moreover, ensemble KNN imputation and grid search-based imputation behaved similarly. Given that grid search is time consuming, the ensemble KNN imputation may be an alternative to deal with missing data in the ASEE process.
引用
收藏
页码:228 / 235
页数:8
相关论文
共 50 条
  • [11] Software effort estimation by analogy and "regression toward the mean"
    Jorgensen, M
    Indahl, U
    Sjoberg, DIK
    JOURNAL OF SYSTEMS AND SOFTWARE, 2003, 68 (03) : 253 - 262
  • [13] Improved estimation of software development effort using Classical and Fuzzy Analogy ensembles
    Idri, Ali
    Hosni, Mohamed
    Abran, Alain
    APPLIED SOFT COMPUTING, 2016, 49 : 990 - 1019
  • [14] SENSE: software effort estimation using novel stacking ensemble learning
    Kaushik, Anupama
    Sheoran, Kavita
    Kapur, Ritvik
    Bhutani, Nikhil
    Singh, Bhavesh
    Sharma, Harsh
    INNOVATIONS IN SYSTEMS AND SOFTWARE ENGINEERING, 2024,
  • [15] MINN: A Missing Data Imputation Technique for Analogy-based Effort Estimation
    Shah, Muhammad Arif
    Jawawi, Dayang N. A.
    Isa, Mohd Adham
    Wakil, Karzan
    Younas, Muhammad
    Mustafa, Ahmed
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (02) : 222 - 232
  • [16] Influence of Outliers on Analogy Based Software Development Effort Estimation
    Ono, Kenichi
    Monden, Akito
    Tsunoda, Masateru
    Matsumoto, Kenichi
    2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 849 - 854
  • [17] Stacking regularization in analogy-based software effort estimation
    Anupama Kaushik
    Prabhjot Kaur
    Nisha Choudhary
    Soft Computing, 2022, 26 : 1197 - 1216
  • [18] Software effort estimation by analogy using attribute selection based on rough set analysis
    Li, Jingzhou
    Ruhe, Guenther
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2008, 18 (01) : 1 - 23
  • [19] Empirical study of analogy-based software effort estimation
    Walkerden F.
    Jeffery R.
    Empirical Software Engineering, 1999, 4 (2) : 135 - 158
  • [20] Stacking regularization in analogy-based software effort estimation
    Kaushik, Anupama
    Kaur, Prabhjot
    Choudhary, Nisha
    Priyanka
    SOFT COMPUTING, 2022, 26 (03) : 1197 - 1216