A New Fuzzy Adaptive Algorithm to Classify Imbalanced Data

被引:10
作者
Patel, Harshita [1 ]
Rajput, Dharmendra Singh [1 ]
Stan, Ovidiu Petru [2 ]
Miclea, Liviu Cristian [2 ]
机构
[1] Vellore Inst Technol, Sch Informat Technol & Engn, Vellore 632014, Tamil Nadu, India
[2] Tech Univ Cluj Napoca, Fac Automat & Comp Sci, Cluj Napoca 400114, Romania
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 70卷 / 01期
关键词
Machine learning; fuzzy classification; nearest neighbor; adaptive approach; optimal weights; K-NEAREST NEIGHBOR; CLASSIFICATION; PERFORMANCE; TRENDS;
D O I
10.32604/cmc.2022.017114
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Classification of imbalanced data is a well explored issue in the data mining and machine learning community where one class representation is overwhelmed by other classes. The Imbalanced distribution of data is a natural occurrence in real world datasets, so needed to be dealt with carefully to get important insights. In case of imbalance in data sets, traditional classifiers have to sacrifice their performances, therefore lead to misclassifications. This paper suggests a weighted nearest neighbor approach in a fuzzy manner to deal with this issue. We have adapted the 'existing algorithm modification solution' to learn from imbalanced datasets that classify data without manipulating the natural distribution of data unlike the other popular data balancing meth-ods. The K nearest neighbor is a non-parametric classification method that is mostly used in machine learning problems. Fuzzy classification with the nearest neighbor clears the belonging of an instance to classes and optimal weights with improved nearest neighbor concept helping to correctly classify imbalanced data. The proposed hybrid approach takes care of imbalance nature of data and reduces the inaccuracies appear in applications of original and traditional classifiers. Results show that it performs well over the exist-ing fuzzy nearest neighbor and weighted neighbor strategies for imbalanced learning.
引用
收藏
页码:73 / 89
页数:17
相关论文
共 50 条
  • [31] Multi-granularity relabeled under-sampling algorithm for imbalanced data
    Dai, Qi
    Liu, Jian-wei
    Liu, Yang
    APPLIED SOFT COMPUTING, 2022, 124
  • [32] Clustering Based Bagging Algorithm on Imbalanced Data Sets
    Sun, Xiao-Yan
    Zhang, Hua-Xiang
    Wang, Zhi-Chao
    INTEGRATED UNCERTAINTY IN KNOWLEDGE MODELLING AND DECISION MAKING, 2011, 7027 : 179 - 186
  • [33] The PerfSim Algorithm for Concept Drift Detection in Imbalanced Data
    Antwi, Daniel K.
    Viktor, Herna L.
    Japkowicz, Nathalie
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 619 - 628
  • [34] Resampling algorithm for imbalanced data based on their neighbor relationship
    Li R.-F.
    Li W.-H.
    Sun Y.-L.
    Wu Y.-Y.
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2021, 43 (06): : 862 - 869
  • [35] Exploring of clustering algorithm on class-imbalanced data
    Li Xuan
    Chen Zhigang
    Yang Fan
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 89 - 93
  • [36] Power distribution fault cause identification with imbalanced data using the data mining-based fuzzy classification E-algorithm
    Xu, Le
    Chow, Mo-Yuen
    Taylor, Leroy S.
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2007, 22 (01) : 164 - 171
  • [37] A MeanShift-guided oversampling with self-adaptive sizes for imbalanced data classification
    Tao, Xinmin
    Zhang, Xiaohan
    Zheng, Yujia
    Qi, Lin
    Fan, Zhiting
    Huang, Shan
    INFORMATION SCIENCES, 2024, 672
  • [38] Learning from class-imbalanced data: review of data driven methods and algorithm driven methods
    Huang, Cui Yin
    Dai, Hong Liang
    DATA SCIENCE IN FINANCE AND ECONOMICS, 2021, 1 (01): : 21 - 36
  • [39] A GEV-Based Classification Algorithm for Imbalanced Data
    Fu J.
    Liu G.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2018, 55 (11): : 2361 - 2371
  • [40] A resistance outlier sampling algorithm for imbalanced data prediction
    Pan, Xiaoying
    Jia, Rong
    Huang, Jiahao
    Wang, Hao
    INTELLIGENT DATA ANALYSIS, 2022, 26 (03) : 583 - 598