Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data

被引:0
|
作者
Camacho-Nieto, Oscar [1 ]
Yanez-Marquez, Cornelio [2 ]
Villuendas-Rey, Yenny [1 ]
机构
[1] Inst Politecn Nacl, CIDETEC, Cdmx, Mexico
[2] Inst Politecn Nacl, CIC, Cdmx, Mexico
关键词
undersampling; imbalanced data; hybrid and incomplete data; SOFTWARE TOOL; DATA-SETS; CLASSIFICATION; ALGORITHMS; ENSEMBLES; KEEL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal is based on a novel instance importance measure (also introduced in this paper), and is able to balance hybrid and incomplete data. The numerical experiments carried out show the proposed undersampling algorithm outperforms others algorithms of the state of art, in well-known imbalanced datasets.
引用
收藏
页码:698 / 719
页数:22
相关论文
共 50 条
  • [1] Hybrid Undersampling and Oversampling for Handling Imbalanced Credit Card Data
    Alamri, Maram
    Ykhlef, Mourad
    IEEE ACCESS, 2024, 12 : 14050 - 14060
  • [2] Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification
    Tusell-Rey, Claudia C.
    Camacho-Nieto, Oscar
    Yanez-Marquez, Cornelio
    Villuendas-Rey, Yenny
    SUSTAINABILITY, 2022, 14 (21)
  • [3] Efficient hybrid oversampling and intelligent undersampling for imbalanced big data classification
    Vairetti, Carla
    Assadi, Jose Luis
    Maldonado, Sebastian
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 246
  • [4] Cluster-Based Instance Selection for the Imbalanced Data Classification
    Czarnowski, Ireneusz
    Jedrzejowicz, Piotr
    COMPUTATIONAL COLLECTIVE INTELLIGENCE, ICCCI 2018, PT II, 2018, 11056 : 191 - 200
  • [5] GP with a Hybrid Tree-vector Representation for Instance Selection and Symbolic Regression on Incomplete Data
    Al-Helali, Baligh
    Chen, Qi
    Xue, Bing
    Zhang, Mengjie
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 604 - 611
  • [6] Ant-Based Feature and Instance Selection for Multiclass Imbalanced Data
    Villuendas-Rey, Yenny
    Yanez-Marquez, Cornelio
    Camacho-Nieto, Oscar
    IEEE ACCESS, 2024, 12 : 133952 - 133968
  • [7] OligoIS: Scalable Instance Selection for Class-Imbalanced Data Sets
    Garcia-Pedrajas, Nicolas
    Perez-Rodriguez, Javier
    de Haro-Garcia, Aida
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (01) : 332 - 346
  • [8] Evolutionary Undersampling for Imbalanced Big Data Classification
    Triguero, I.
    Galar, M.
    Vluymans, S.
    Cornelis, C.
    Bustince, H.
    Herrera, F.
    Saeys, Y.
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 715 - 722
  • [9] A Hybrid Surrogate Model for Evolutionary Undersampling in Imbalanced Classification
    Le, Hoang Lam
    Landa-Silva, Dario
    Galar, Mikel
    Garcia, Salvador
    Triguero, I
    2020 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2020,
  • [10] A hybrid adaptive approach for instance transfer learning with dynamic and imbalanced data
    Zhang, Xiangzhou
    Liu, Kang
    Yuan, Borong
    Wang, Hongnian
    Chen, Shaoyong
    Xue, Yunfei
    Chen, Weiqi
    Liu, Mei
    Hu, Yong
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (12) : 11582 - 11599