An Empirical Analysis of Data Reduction Techniques for k-NN Classification

被引:0
|
作者
Eleftheriadis, Stylianos [1 ]
Evangelidis, Georgios [1 ]
Ougiaroglou, Stefanos [2 ]
机构
[1] Univ Macedonia, Sch Informat Sci, Dept Appl Informat, Thessaloniki 54636, Greece
[2] Int Hellen Univ, Sch Engn, Dept Informat & Elect Engn, Thessaloniki 57400, Greece
来源
ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024 | 2024年 / 714卷
关键词
prototype generation; prototype selection; data reduction techniques; data mining; data cleaning; PROTOTYPE SELECTION; NEAREST;
D O I
10.1007/978-3-031-63223-5_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study explores Data Reduction Techniques (DRTs) in the realm of lazy classification algorithms like k-NN, focusing on Prototype Selection (PS) and Prototype Generation (PG) methods. The research provides an in-depth examination of these methodologies, categorizing DRTs into two primary categories: PS and PG, and further dividing them into three sub-categories: condensation methods, edition methods, and hybrid methods. An experimental study compares a total of 20 new and state-of-the-art DRTs across 20 datasets. The objective is to draw performance conclusions within both the primary and subcategories, offering valuable insights into how these techniques enhance the effectiveness and robustness of the k-NN classifier. The paper provides a comprehensive overview of DRTs, clarifying their strategies and relative performances.
引用
收藏
页码:83 / 97
页数:15
相关论文
共 50 条
  • [21] Hesitant k-Nearest Neighbor (HK-nn) Classifier for Document Classification and Numerical Result Analysis
    Sahu, Neeraj
    Thakur, R. S.
    Thakur, G. S.
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 631 - 638
  • [22] EPRENNID: An evolutionary prototype reduction based ensemble for nearest neighbor classification of imbalanced data
    Vluymans, Sarah
    Triguero, Isaac
    Cornelis, Chris
    Saeys, Yvan
    NEUROCOMPUTING, 2016, 216 : 596 - 610
  • [23] Comprehensive Analysis of Various Big Data Classification Techniques: A Challenging Overview
    Abdalla, Hemn Barzan
    Abuhaija, Belal
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2023, 22 (01)
  • [24] CHRONIC KIDNEY DISEASE ANALYSIS USING DATA MINING CLASSIFICATION TECHNIQUES
    Kunwar, Veenita
    Chandel, Khushboo
    Sabitha, A. Sai
    Bansal, Abhay
    2016 6TH INTERNATIONAL CONFERENCE - CLOUD SYSTEM AND BIG DATA ENGINEERING (CONFLUENCE), 2016, : 300 - 305
  • [25] Determining Consumer Default Risk with Data Mining Techniques: An Empirical Analysis on Turkey
    Cigsar, Begum
    Boga, Semra
    Unal, Deniz
    INTERNATIONAL JOURNAL OF CONTEMPORARY ECONOMICS AND ADMINISTRATIVE SCIENCES, 2023, 13 (01): : 85 - 100
  • [26] An Empirical Study on applying Data Mining Techniques for the Analysis and Prediction of Heart Disease
    Sivagowry, S.
    Durairaj, M.
    Persia, A.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 265 - 270
  • [27] An empirical analysis on auto corporation training program planning by data mining techniques
    Lin, W. T.
    Wang, S. J.
    Wu, Y. C.
    Ye, T. C.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 5841 - 5850
  • [28] Data reduction techniques to support data mining
    Dullea, James
    Patel, Namrata
    Koka, Divya
    WMSCI 2005: 9th World Multi-Conference on Systemics, Cybernetics and Informatics, Vol 1, 2005, : 273 - 277
  • [29] A Study On Classification Techniques in Data Mining
    Kesavaraj, G.
    Sukumaran, S.
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [30] A comprehensive review on data preprocessing techniques in data analysis
    Cetin, Volkan
    Yildiz, Oktay
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2022, 28 (02): : 299 - 312