A fuzzy K-nearest neighbor classifier to deal with imperfect data

被引:17
|
作者
Cadenas, Jose M. [1 ]
Carmen Garrido, M. [1 ]
Martinez, Raquel [2 ]
Munoz, Enrique [3 ]
Bonissone, Piero P. [4 ]
机构
[1] Univ Murcia, Dept Informat & Commun Engn, Murcia, Spain
[2] Catholic Univ Murcia, Dept Comp Engn, Murcia, Spain
[3] Univ Milan, Dept Comp Sci, Crema, Italy
[4] Piero P Bonissone Analyt LLC, San Diego, CA USA
关键词
k-nearest neighbors; Classification; Imperfect data; Distance/dissimilarity measures; Combination methods; PERFORMANCE; RULES; ALGORITHMS;
D O I
10.1007/s00500-017-2567-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbors method (kNN) is a nonparametric, instance-based method used for regression and classification. To classify a new instance, the kNN method computes its k nearest neighbors and generates a class value from them. Usually, this method requires that the information available in the datasets be precise and accurate, except for the existence of missing values. However, data imperfection is inevitable when dealing with real-world scenarios. In this paper, we present the kNN(imp) classifier, a k-nearest neighbors method to perform classification from datasets with imperfect value. The importance of each neighbor in the output decision is based on relative distance and its degree of imperfection. Furthermore, by using external parameters, the classifier enables us to define the maximum allowed imperfection, and to decide if the final output could be derived solely from the greatest weight class (the best class) or from the best class and a weighted combination of the closest classes to the best one. To test the proposed method, we performed several experiments with both synthetic and real-world datasets with imperfect data. The results, validated through statistical tests, show that the kNN(imp) classifier is robust when working with imperfect data and maintains a good performance when compared with other methods in the literature, applied to datasets with or without imperfection.
引用
收藏
页码:3313 / 3330
页数:18
相关论文
共 50 条
  • [31] Validation of k-Nearest Neighbor Classifiers
    Bax, Eric
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2012, 58 (05) : 3225 - 3234
  • [32] Analysis of the k-nearest neighbor classification
    Li, Jing
    Cheng, Ming
    INFORMATION SCIENCE AND MANAGEMENT ENGINEERING, VOLS 1-3, 2014, 46 : 1911 - 1917
  • [33] Weighted K-Nearest Neighbor Revisited
    Bicego, M.
    Loog, M.
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 1642 - 1647
  • [34] A Modified K-Nearest Neighbor Algorithm to Handle Uncertain Data
    Agrawal, Rashmi
    Ram, Babu
    2015 5TH INTERNATIONAL CONFERENCE ON IT CONVERGENCE AND SECURITY (ICITCS), 2015,
  • [35] A New K-Nearest Neighbors Classifier for Big Data Based on Efficient Data Pruning
    Saadatfar, Hamid
    Khosravi, Samiyeh
    Joloudari, Javad Hassannataj
    Mosavi, Amir
    Shamshirband, Shahaboddin
    MATHEMATICS, 2020, 8 (02)
  • [36] K-Nearest Neighbor Search by Random Projection Forests
    Yan, Donghui
    Wang, Yingjie
    Wang, Jin
    Wang, Honggang
    Li, Zhenpeng
    IEEE TRANSACTIONS ON BIG DATA, 2021, 7 (01) : 147 - 157
  • [37] A NOVEL INTERVAL TYPE-2 FUZZY K-NEAREST NEIGHBOR CLASSIFIER FOR REMOTELY SENSED HYPERSPECTRAL IMAGE CLASSIFICATION
    Yang, Jinn-Min
    2014 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2014, : 3718 - 3721
  • [38] K-nearest Neighbor Search by Random Projection Forests
    Yan, Donghui
    Wang, Yingjie
    Wang, Jin
    Wang, Honggang
    Li, Zhenpeng
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 4775 - 4781
  • [39] Classification of EEG Signals Using Dempster Shafer Theory and a K-Nearest Neighbor Classifier
    Yazdani, Ashkan
    Ebrahimi, Touradj
    Hoffmann, Utrich
    2009 4TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING, 2009, : 320 - +
  • [40] BAOA: Binary Arithmetic Optimization Algorithm With K-Nearest Neighbor Classifier for Feature Selection
    Khodadadi, Nima
    Khodadadi, Ehsan
    Al-Tashi, Qasem
    El-Kenawy, El-Sayed M.
    Abualigah, Laith
    Abdulkadir, Said Jadid
    Alqushaibi, Alawi
    Mirjalili, Seyedali
    IEEE ACCESS, 2023, 11 : 94094 - 94115