Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data

被引:0
|
作者
Camacho-Nieto, Oscar [1 ]
Yanez-Marquez, Cornelio [2 ]
Villuendas-Rey, Yenny [1 ]
机构
[1] Inst Politecn Nacl, CIDETEC, Cdmx, Mexico
[2] Inst Politecn Nacl, CIC, Cdmx, Mexico
关键词
undersampling; imbalanced data; hybrid and incomplete data; SOFTWARE TOOL; DATA-SETS; CLASSIFICATION; ALGORITHMS; ENSEMBLES; KEEL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal is based on a novel instance importance measure (also introduced in this paper), and is able to balance hybrid and incomplete data. The numerical experiments carried out show the proposed undersampling algorithm outperforms others algorithms of the state of art, in well-known imbalanced datasets.
引用
收藏
页码:698 / 719
页数:22
相关论文
共 50 条
  • [31] A hybrid stacking classifier with feature selection for handling imbalanced data
    Abraham A.
    Kayalvizhi R.
    Mohideen H.S.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 9103 - 9117
  • [32] Undersampling with Support Vectors for Multi-Class Imbalanced Data Classification
    Krawczyk, Bartosz
    Bellinger, Colin
    Corizzo, Roberto
    Japkowicz, Nathalie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [33] Neighbourhood-based undersampling approach for handling imbalanced and overlapped data
    Vuttipittayamongkol, Pattaramon
    Elyan, Eyad
    INFORMATION SCIENCES, 2020, 509 : 47 - 70
  • [34] PSU: Particle Stacking Undersampling Method for Highly Imbalanced Big Data
    Jeon, Yong-Seok
    Lim, Dong-Joon
    IEEE ACCESS, 2020, 8 : 131920 - 131927
  • [35] Fuzzy Distance-based Undersampling Technique for Imbalanced Flood Data
    Mahamud, Ku Ruhana Ku
    Zorkeflee, Maisarah
    Din, Aniza Mohamed
    PROCEEDINGS OF KNOWLEDGE MANAGEMENT INTERNATIONAL CONFERENCE (KMICE) 2016, 2016, : 509 - 513
  • [36] A fuzzy rough set-based undersampling approach for imbalanced data
    Zhang, Xiao
    He, Zhaoqian
    Yang, Yanyan
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2799 - 2810
  • [37] CSMOUTE: Combined Synthetic Oversampling and Undersampling Technique for Imbalanced Data Classification
    Koziarski, Michal
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [38] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Feng, Fang
    Li, Kuan-Ching
    Yang, Erfu
    Zhou, Qingguo
    Han, Lihong
    Hussain, Amir
    Cai, Mingjiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3231 - 3267
  • [39] A novel oversampling and feature selection hybrid algorithm for imbalanced data classification
    Fang Feng
    Kuan-Ching Li
    Erfu Yang
    Qingguo Zhou
    Lihong Han
    Amir Hussain
    Mingjiang Cai
    Multimedia Tools and Applications, 2023, 82 : 3231 - 3267
  • [40] UFFDFR: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classification
    Zheng, Ming
    Li, Tong
    Zheng, Xiaoyao
    Yu, Qingying
    Chen, Chuanming
    Zhou, Ding
    Lv, Changlong
    Yang, Weiyi
    INFORMATION SCIENCES, 2021, 576 (576) : 658 - 680