MINN: A Missing Data Imputation Technique for Analogy-based Effort Estimation

被引:0
|
作者
Shah, Muhammad Arif [1 ,2 ]
Jawawi, Dayang N. A. [1 ]
Isa, Mohd Adham [1 ]
Wakil, Karzan [3 ]
Younas, Muhammad [1 ,4 ]
Mustafa, Ahmed [1 ]
机构
[1] Univ Teknol Malaysia, Fac Engn, Sch Comp, Dept Software Engn, Johor Baharu, Malaysia
[2] City Univ Sci & Informat Technol, Peshawar, Pakistan
[3] Sulaimani Polytech Univ, Res Ctr, Sulaimani 46001, Kurdistan Regio, Iraq
[4] Govt Coll Univ Faisalabad, Dept Comp Sci, Faisalabad, Pakistan
关键词
Analogy-based estimation; effort estimation; missing data imputation; software development; SOFTWARE COST ESTIMATION; ACCURACY; IMPROVE;
D O I
10.14569/ijacsa.2019.0100230
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Success and failure of a complex software project are strongly associated with the accurate estimation of development effort. There are numerous estimation models developed but the most widely used among those is Analogy-Based Estimation (ABE). ABE model follows human nature as it estimates the future project's effort by making analogies with the past project's data. Since ABE relies on the historical datasets, the quality of the datasets affects the accuracy of estimation. Most of the software engineering datasets have missing values. The researchers either delete the projects containing missing values or avoid treating the missing values which reduce the ABE performance. In this study, Numeric Cleansing (NC), K-Nearest Neighbor Imputation (KNNI) and Median Imputation of the Nearest Neighbor (MINN) methods are used to impute the missing values in Desharnais and DesMiss datasets for ABE. MINN technique is introduced in this study. A comparison among these imputation methods is performed to identify the suitable missing data imputation method for ABE. The results suggested that MINN imputes more realistic values in the missing datasets as compared to values imputed through NC and KNNI. It was also found that the imputation treatment method helped in better prediction of the software development effort on ABE model.
引用
收藏
页码:222 / 232
页数:11
相关论文
共 50 条
  • [1] Fuzzy C-mean Missing Data Imputation for Analogy-based Effort Estimation
    AlMutlaq, Ayman Jalal
    Jawawi, Dayang N. A.
    Arbain, Adila Firdaus Binti
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 628 - 640
  • [2] Missing data techniques in analogy-based software development effort estimation
    Idri, Ali
    Abnane, Ibtissam
    Abran, Alain
    JOURNAL OF SYSTEMS AND SOFTWARE, 2016, 117 : 595 - 611
  • [3] Improved Analogy-based Effort Estimation with Incomplete Mixed Data
    Abnane, Ibtissam
    Idri, Ali
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 1015 - 1024
  • [4] Support vector regression-based imputation in analogy-based software development effort estimation
    Idri, Ali
    Abnane, Ibtissam
    Abran, Alain
    JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2018, 30 (12)
  • [5] An Optimized Analogy-Based Project Effort Estimation
    Azzeh, Mohammad
    Elsheikh, Yousef
    Alseid, Marwan
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (04) : 6 - 11
  • [7] Exploiting the Essential Assumptions of Analogy-Based Effort Estimation
    Kocaguneli, Ekrem
    Menzies, Tim
    Bener, Ayse Basar
    Keung, Jacky W.
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2012, 38 (02) : 425 - 438
  • [8] Empirical study of analogy-based software effort estimation
    Walkerden F.
    Jeffery R.
    Empirical Software Engineering, 1999, 4 (2) : 135 - 158
  • [9] Stacking regularization in analogy-based software effort estimation
    Kaushik, Anupama
    Kaur, Prabhjot
    Choudhary, Nisha
    Priyanka
    SOFT COMPUTING, 2022, 26 (03) : 1197 - 1216
  • [10] An evolutionary ensemble analogy-based software effort estimation
    Shahpar, Zahra
    Bardsiri, Vahid Khatibi
    Bardsiri, Amid Khatibi
    SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (04): : 929 - 946