Undersampling Instance Selection for Hybrid and Incomplete Imbalanced Data

被引:0
|
作者
Camacho-Nieto, Oscar [1 ]
Yanez-Marquez, Cornelio [2 ]
Villuendas-Rey, Yenny [1 ]
机构
[1] Inst Politecn Nacl, CIDETEC, Cdmx, Mexico
[2] Inst Politecn Nacl, CIC, Cdmx, Mexico
关键词
undersampling; imbalanced data; hybrid and incomplete data; SOFTWARE TOOL; DATA-SETS; CLASSIFICATION; ALGORITHMS; ENSEMBLES; KEEL;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper proposes a novel undersampling method, for dealing with imbalanced datasets. The proposal is based on a novel instance importance measure (also introduced in this paper), and is able to balance hybrid and incomplete data. The numerical experiments carried out show the proposed undersampling algorithm outperforms others algorithms of the state of art, in well-known imbalanced datasets.
引用
收藏
页码:698 / 719
页数:22
相关论文
共 50 条
  • [21] An approach for classification of highly imbalanced data using weighting and undersampling
    Ashish Anand
    Ganesan Pugalenthi
    Gary B. Fogel
    P. N. Suganthan
    Amino Acids, 2010, 39 : 1385 - 1391
  • [22] SELF-CONFIGURING HYBRID EVOLUTIONARY ALGORITHM FOR FUZZY IMBALANCED CLASSIFICATION WITH ADAPTIVE INSTANCE SELECTION
    Stanovov, Vladimir
    Semenkin, Eugene
    Semenkina, Olga
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2016, 6 (03) : 173 - 188
  • [23] Overlap-Based Undersampling for Improving Imbalanced Data Classification
    Vuttipittayamongkol, Pattaramon
    Elyan, Eyad
    Petrovski, Andrei
    Jayne, Chrisina
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 689 - 697
  • [24] Threshold optimization and random undersampling for imbalanced credit card data
    Leevy, Joffrey L. L.
    Johnson, Justin M. M.
    Hancock, John
    Khoshgoftaar, Taghi M. M.
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [25] Threshold optimization and random undersampling for imbalanced credit card data
    Joffrey L. Leevy
    Justin M. Johnson
    John Hancock
    Taghi M. Khoshgoftaar
    Journal of Big Data, 10
  • [26] An approach for classification of highly imbalanced data using weighting and undersampling
    Anand, Ashish
    Pugalenthi, Ganesan
    Fogel, Gary B.
    Suganthan, P. N.
    AMINO ACIDS, 2010, 39 (05) : 1385 - 1391
  • [27] A Membership Probability-Based Undersampling Algorithm for Imbalanced Data
    Ahn, Gilseung
    Park, You-Jin
    Hur, Sun
    JOURNAL OF CLASSIFICATION, 2021, 38 (01) : 2 - 15
  • [28] A First Attempt on Global Evolutionary Undersampling for Imbalanced Big Data
    Triguero, I.
    Galar, M.
    Bustince, H.
    Herrera, F.
    2017 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2017, : 2054 - 2061
  • [29] Clustering-based undersampling in class-imbalanced data
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Hu, Ya-Han
    Jhang, Jing-Shang
    INFORMATION SCIENCES, 2017, 409 : 17 - 26
  • [30] Undersampling method based on minority class density for imbalanced data
    Sun, Zhongqiang
    Ying, Wenhao
    Zhang, Wenjin
    Gong, Shengrong
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249