The class-imbalance problem for high-dimensional class prediction

被引:7
|
作者
Lusa, Lara [1 ]
Blagus, Rok [1 ]
机构
[1] Univ Ljubljana, Inst Biostat & Med Informat, Ljubljana, Slovenia
来源
2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 2 | 2012年
关键词
class-imbalance; high -dimensional data; classification;
D O I
10.1109/ICMLA.2012.223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of class prediction studies is to develop rules to accurately predict the class membership of new subjects. The classifiers differ in the way they combine the values of the variables available for each subject. Frequently the classifiers are developed using class-imbalanced data, where the number of samples in each class is not equal. Standard classification methods used on class-imbalanced data are often biased towards the majority class: they classify most new samples in the majority class and they do not accurately predict the minority class. Data are high-dimensional when the number of variables greatly exceeds the number of subjects. In this paper we show how the high-dimensionality poses additional challenges when dealing with class-imbalanced prediction. Here we present new simulation studies for five classifiers, where we expand our previous results to correlated variables, and briefly discuss the results.
引用
收藏
页码:123 / 126
页数:4
相关论文
共 50 条
  • [41] The class imbalance problem
    Fadel M. Megahed
    Ying-Ju Chen
    Aly Megahed
    Yuya Ong
    Naomi Altman
    Martin Krzywinski
    Nature Methods, 2021, 18 : 1270 - 1272
  • [42] MEASURING CLASS-IMBALANCE SENSITIVITY OF DETERMINISTIC PERFORMANCE EVALUATION METRICS
    Ahmadzadeh, Azim
    Angryk, Rafal A.
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 51 - 55
  • [43] Ss-InfoGAN for Class-Imbalance Classification of Bearing Faults
    Wu, Jingyao
    Zhao, Zhibin
    Sun, Chuang
    Yan, Ruqiang
    Chen, Xuefeng
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON THROUGH-LIFE ENGINEERING SERVICES (TESCONF 2019), 2020, 49 : 99 - 104
  • [44] Novel resampling algorithms with maximal cliques for class-imbalance problems☆
    Wang, Long-hui
    Dai, Qi
    Du, Tony
    Chen, Li-fang
    COMPUTERS & INDUSTRIAL ENGINEERING, 2025, 199
  • [45] Graph-Based Class-Imbalance Learning With Label Enhancement
    Du, Guodong
    Zhang, Jia
    Jiang, Min
    Long, Jinyi
    Lin, Yaojin
    Li, Shaozi
    Tan, Kay Chen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (09) : 6081 - 6095
  • [46] A systematic review for class-imbalance in semi-supervised learning
    Willian Dihanster Gomes de Oliveira
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 2349 - 2382
  • [47] Large-Scale Distributed Sparse Class-Imbalance Learning
    Maurya, Chandresh Kumar
    Toshniwal, Durga
    INFORMATION SCIENCES, 2018, 456 : 1 - 12
  • [48] Ensemble of Cost-Sensitive Hypernetworks for Class-Imbalance Learning
    Wang, Jin
    Huang, Ping-li
    Sun, Kai-wei
    Cao, Bao-lin
    Zhao, Rui
    2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 1883 - 1888
  • [49] Towards Class-Imbalance Aware Multi-Label Learning
    Zhang, Min-Ling
    Li, Yu-Kun
    Liu, Xu-Ying
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 4041 - 4047
  • [50] An Ensemble Learning Approach with Gradient Resampling for Class-Imbalance Problems
    Zhao, Hongke
    Zhao, Chuang
    Zhang, Xi
    Liu, Nanlin
    Zhu, Hengshu
    Liu, Qi
    Xiong, Hui
    INFORMS JOURNAL ON COMPUTING, 2023, 35 (04) : 747 - 763