A fuzzy K-nearest neighbor classifier to deal with imperfect data

被引:17
|
作者
Cadenas, Jose M. [1 ]
Carmen Garrido, M. [1 ]
Martinez, Raquel [2 ]
Munoz, Enrique [3 ]
Bonissone, Piero P. [4 ]
机构
[1] Univ Murcia, Dept Informat & Commun Engn, Murcia, Spain
[2] Catholic Univ Murcia, Dept Comp Engn, Murcia, Spain
[3] Univ Milan, Dept Comp Sci, Crema, Italy
[4] Piero P Bonissone Analyt LLC, San Diego, CA USA
关键词
k-nearest neighbors; Classification; Imperfect data; Distance/dissimilarity measures; Combination methods; PERFORMANCE; RULES; ALGORITHMS;
D O I
10.1007/s00500-017-2567-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The k-nearest neighbors method (kNN) is a nonparametric, instance-based method used for regression and classification. To classify a new instance, the kNN method computes its k nearest neighbors and generates a class value from them. Usually, this method requires that the information available in the datasets be precise and accurate, except for the existence of missing values. However, data imperfection is inevitable when dealing with real-world scenarios. In this paper, we present the kNN(imp) classifier, a k-nearest neighbors method to perform classification from datasets with imperfect value. The importance of each neighbor in the output decision is based on relative distance and its degree of imperfection. Furthermore, by using external parameters, the classifier enables us to define the maximum allowed imperfection, and to decide if the final output could be derived solely from the greatest weight class (the best class) or from the best class and a weighted combination of the closest classes to the best one. To test the proposed method, we performed several experiments with both synthetic and real-world datasets with imperfect data. The results, validated through statistical tests, show that the kNN(imp) classifier is robust when working with imperfect data and maintains a good performance when compared with other methods in the literature, applied to datasets with or without imperfection.
引用
收藏
页码:3313 / 3330
页数:18
相关论文
共 50 条
  • [21] K-Nearest Neighbor Classifier for Signature Verification System
    Abdelrahaman, Ahmed A. A.
    Abdallah, Ahmed M. E.
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 58 - 62
  • [22] Feature-weighted k-nearest neighbor classifier
    Vivencio, Diego P.
    Hruschka, Estevarn R., Jr.
    Nicoletti, M. do Carmo
    dos Santos, Edimilson B.
    Galvao, Sebastian D. C. O.
    2007 IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTATIONAL INTELLIGENCE, VOLS 1 AND 2, 2007, : 481 - +
  • [23] An Improvement To The k-Nearest Neighbor Classifier For ECG Database
    Jaafar, Haryati
    Ramli, Nur Hidayah
    Nasir, Aimi Salihah Abdul
    MALAYSIAN TECHNICAL UNIVERSITIES CONFERENCE ON ENGINEERING AND TECHNOLOGY 2017 (MUCET 2017), 2018, 318
  • [24] Privacy-preserving k-Nearest Neighbor Classifier
    Xu J.
    Wang A.-D.
    Bi M.
    Zhou F.-C.
    Ruan Jian Xue Bao/Journal of Software, 2019, 30 (11): : 3503 - 3517
  • [25] A K-Nearest Neighbor Classifier for Ship Route Prediction
    Lo Duca, Angelica
    Bacciu, Clara
    Marchetti, Andrea
    OCEANS 2017 - ABERDEEN, 2017,
  • [26] Use of K-Nearest Neighbor classifier for intrusion detection
    Liao, YH
    Vemuri, VR
    COMPUTERS & SECURITY, 2002, 21 (05) : 439 - 448
  • [27] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [28] FUZZY K-NEAREST NEIGHBOR ALGORITHM.
    Keller, James M.
    Gray, Michael R.
    Givens, James A.
    IEEE Transactions on Systems, Man and Cybernetics, 1985, SMC-15 (04): : 580 - 585
  • [29] A New Fuzzy Rule-Based Initialization Method for K-Nearest Neighbor Classifier
    Chua, TeckWee
    Tan, WoeiWan
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 415 - 420
  • [30] Data Oriented Approximate K-Nearest Neighbor Classifier for Touch Modality Recognition
    Younes, Hamoud
    Ibrahim, Ali
    Rizk, Mostafa
    Valle, Maurizio
    2019 15TH CONFERENCE ON PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS (PRIME), 2019, : 241 - 244