Estimation of Incomplete Data in Mixed Dataset

被引:7
|
作者
Sen, Suhani [1 ]
Das, Madhabananda [1 ]
Chatterjee, Rajdeep [1 ]
机构
[1] KIIT Univ, Sch Comp Engn, Bhubaneswar, Odisha, India
关键词
Fuzzy sets; Fuzzy knn; Kernel functions; Partial distance strategy; Hellinger distance; MULTIPLE IMPUTATION; ALGORITHM;
D O I
10.1007/978-981-10-3373-5_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper puts forward a fresh approach which is a modification of original fuzzy kNN for dealing with categorical missing values in categorical and mixed attribute datasets. We have removed the irrelevant missing samples through list-wise deletion. Then, rest of the missing samples is estimated using kernel-based fuzzy kNN technique and partial distance strategy. We have calculated the errors at different percentage of missing values. Results highlight that mixture kernel gives minimum average of MAE, MAPE and RMSE at different missing percentage when implemented on lenses, SPECT heart and abalone dataset.
引用
收藏
页码:483 / 492
页数:10
相关论文
共 50 条
  • [1] Improved Analogy-based Effort Estimation with Incomplete Mixed Data
    Abnane, Ibtissam
    Idri, Ali
    PROCEEDINGS OF THE 2018 FEDERATED CONFERENCE ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (FEDCSIS), 2018, : 1015 - 1024
  • [2] Identification and estimation with incomplete data
    Horowitz, JL
    Manski, CF
    FOUNDATIONS OF STATISTICAL INFERENCE, 2003, : 17 - 29
  • [3] A Method to Identify Missing Data Mechanism in Incomplete Dataset
    Tshering, Sonam
    Okazaki, Takeo
    Endo, Satoshi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2013, 13 (03): : 14 - 22
  • [4] Selecting prototypes in mixed incomplete data
    García-Borroto, M
    Ruiz-Shulcloper, J
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2005, 3773 : 450 - 459
  • [5] Pattern recognition with mixed and incomplete data
    Ruiz-Shulcloper J.
    Pattern Recogn. Image Anal., 2008, 4 (563-576): : 563 - 576
  • [6] Software cost estimation with incomplete data
    Strike, K
    El Emam, K
    Madhavji, N
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (10) : 890 - 908
  • [7] On density and regression estimation with incomplete data
    Mojirsheibani, Majid
    Manley, Kevin
    Pouliot, William
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (23) : 11688 - 11711
  • [8] MINIMAX ESTIMATION IN PROBLEMS WITH INCOMPLETE DATA
    DODUNEKOVA, RD
    RUSSIAN MATHEMATICAL SURVEYS, 1984, 39 (01) : 145 - 146
  • [9] Efficient GMM Estimation with Incomplete Data
    Muris, Chris
    REVIEW OF ECONOMICS AND STATISTICS, 2020, 102 (03) : 518 - 530