A New Fuzzy Adaptive Algorithm to Classify Imbalanced Data

被引:10
作者
Patel, Harshita [1 ]
Rajput, Dharmendra Singh [1 ]
Stan, Ovidiu Petru [2 ]
Miclea, Liviu Cristian [2 ]
机构
[1] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore 632014, Tamil Nadu, India
[2] Tech Univ Cluj Napoca, Fac Automat & Comp Sci, Cluj Napoca 400114, Romania
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 01期
关键词
Machine learning; fuzzy classification; nearest neighbor; adaptive approach; optimal weights; K-NEAREST NEIGHBOR; CLASSIFICATION; PERFORMANCE; TRENDS;
D O I
10.32604/cmc.2022.017114
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of imbalanced data is a well explored issue in the data mining and machine learning community where one class representation is overwhelmed by other classes. The Imbalanced distribution of data is a natural occurrence in real world datasets, so needed to be dealt with carefully to get important insights. In case of imbalance in data sets, traditional classifiers have to sacrifice their performances, therefore lead to misclassifications. This paper suggests a weighted nearest neighbor approach in a fuzzy manner to deal with this issue. We have adapted the 'existing algorithm modification solution' to learn from imbalanced datasets that classify data without manipulating the natural distribution of data unlike the other popular data balancing meth-ods. The K nearest neighbor is a non-parametric classification method that is mostly used in machine learning problems. Fuzzy classification with the nearest neighbor clears the belonging of an instance to classes and optimal weights with improved nearest neighbor concept helping to correctly classify imbalanced data. The proposed hybrid approach takes care of imbalance nature of data and reduces the inaccuracies appear in applications of original and traditional classifiers. Results show that it performs well over the exist-ing fuzzy nearest neighbor and weighted neighbor strategies for imbalanced learning.
引用
收藏
页码:73 / 89
页数:17
相关论文
共 50 条
  • [41] An Adaptive Oversampling Technique for Imbalanced Datasets
    Shahee, Shaukat Ali
    Ananthakumar, Usha
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 1 - 16
  • [42] Novel Algorithm to Classify Sleep Stages
    Pham Son Lam
    Nguyen Dinh Son
    Hoang Phuong Chi
    Nguyen Thi Phuoc Van
    Nguyen Duc Minh
    2019 13TH INTERNATIONAL CONFERENCE ON SENSING TECHNOLOGY (ICST), 2019,
  • [43] A Novel Region Adaptive SMOTE Algorithm for Intrusion Detection on Imbalanced Problem
    Yan, BingHao
    Han, GuoDong
    Sun, MeiDong
    Ye, ShengZhao
    PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2017, : 1281 - 1286
  • [44] Neighborhood repartition-based oversampling algorithm for multiclass imbalanced data with label noise
    Shen, Shiyi
    Li, Zhixin
    Huan, Zhan
    Shang, Fanqi
    Wang, Yongsong
    Chen, Ying
    NEUROCOMPUTING, 2024, 600
  • [45] Combined Cleaning and Resampling algorithm for multi-class imbalanced data with label noise
    Koziarski, Michal
    Wozniak, Michal
    Krawczyk, Bartosz
    KNOWLEDGE-BASED SYSTEMS, 2020, 204 (204)
  • [46] An Adjusted Nearest Neighbor Algorithm Maximizing the F-Measure from Imbalanced Data
    Viola, Remi
    Emonet, Remi
    Habrard, Amaury
    Metzler, Guillaume
    Riou, Sebastien
    Sebban, Marc
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 243 - 250
  • [47] Novel Oversampling Algorithm for Handling Imbalanced Data Classification Novel Oversampling Algorithm
    More, Anjali S.
    Rana, Dipti P.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (08) : 491 - 496
  • [48] A comparative analysis of machine learning techniques for imbalanced data
    Mrad, Ali Ben
    Lahiani, Amine
    Mefteh-Wali, Salma
    Mselmi, Nada
    ANNALS OF OPERATIONS RESEARCH, 2024,
  • [49] New Data Level Approach for Imbalanced Data Classification Improvement
    Borowska, Katarzyna
    Topczewska, Magdalena
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS, CORES 2015, 2016, 403 : 283 - 294
  • [50] Oversampling technique based on fuzzy representativeness difference for classifying imbalanced data
    Ren, Ruonan
    Yang, Youlong
    Sun, Liqin
    APPLIED INTELLIGENCE, 2020, 50 (08) : 2465 - 2487