An Empirical Analysis of Data Reduction Techniques for k-NN Classification

被引:0
作者
Eleftheriadis, Stylianos [1 ]
Evangelidis, Georgios [1 ]
Ougiaroglou, Stefanos [2 ]
机构
[1] Univ Macedonia, Sch Informat Sci, Dept Appl Informat, Thessaloniki 54636, Greece
[2] Int Hellen Univ, Sch Engn, Dept Informat & Elect Engn, Thessaloniki 57400, Greece
来源
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024 | 2024年 / 714卷
关键词
prototype generation; prototype selection; data reduction techniques; data mining; data cleaning; PROTOTYPE SELECTION; NEAREST;
D O I
10.1007/978-3-031-63223-5_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores Data Reduction Techniques (DRTs) in the realm of lazy classification algorithms like k-NN, focusing on Prototype Selection (PS) and Prototype Generation (PG) methods. The research provides an in-depth examination of these methodologies, categorizing DRTs into two primary categories: PS and PG, and further dividing them into three sub-categories: condensation methods, edition methods, and hybrid methods. An experimental study compares a total of 20 new and state-of-the-art DRTs across 20 datasets. The objective is to draw performance conclusions within both the primary and subcategories, offering valuable insights into how these techniques enhance the effectiveness and robustness of the k-NN classifier. The paper provides a comprehensive overview of DRTs, clarifying their strategies and relative performances.
引用
收藏
页码:83 / 97
页数:15
相关论文
共 50 条
  • [41] APPLICATION OF DATA MINING TECHNIQUES TO PATENT ANALYSIS - EMPIRICAL INVESTIGATION OVER PATENT LIFE IN CHINA
    Chen, Xiangdong
    Xu, Ke
    Jiang, Shan
    ICIM 2008: PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON INDUSTRIAL MANAGEMENT, 2008, : 878 - 882
  • [42] An Approach to Data Reduction and Integrated Machine Classification
    Ireneusz Czarnowski
    Piotr Jȩdrzejowicz
    New Generation Computing, 2010, 28 : 21 - 40
  • [43] An Approach to Data Reduction and Integrated Machine Classification
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    NEW GENERATION COMPUTING, 2010, 28 (01) : 21 - 40
  • [44] Data reduction for classification with ant colony algorithms
    Salama, Khalid M.
    Abdelbar, Ashraf M.
    Anwar, Ismail M.
    INTELLIGENT DATA ANALYSIS, 2016, 20 (05) : 1021 - 1059
  • [45] Comparative Analysis of K-Nearest Neighbor and Modified K-Nearest Neighbor Algorithm for Data Classification
    Okfalisa
    Mustakim
    Gazalba, Ikbal
    Reza, Nurul Gayatri Indah
    2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 294 - 298
  • [46] Stacking and Rotation-based Technique for Machine Learning Classification with Data Reduction
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 55 - 60
  • [47] Big data analytics in health care by data mining and classification techniques
    Jayasri, N. P.
    Aruna, R.
    ICT EXPRESS, 2022, 8 (02): : 250 - 257
  • [48] Visual Data Mining Techniques for Classification of Diabetic Patients
    Velu, C. M.
    Kashwan, K. R.
    PROCEEDINGS OF THE 2013 3RD IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2013, : 1070 - 1075
  • [49] Prediction of Stroke using Data Mining Classification Techniques
    Almadani, Ohoud
    Alshammari, Riyad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (01) : 457 - 460
  • [50] Classification of Product Rating Using Data Mining Techniques
    Nath, Pinku Deb
    Das, Sowvik Kanti
    Islam, Fabiha Nazmi
    Tahmid, Kifayat
    Shanto, Raufir Ahmed
    Rahman, Rashedur M.
    ADVANCED TOPICS IN INTELLIGENT INFORMATION AND DATABASE SYSTEMS, 2017, 710 : 27 - 36