Classifier Ensemble Based on Multiview Optimization for High-Dimensional Imbalanced Data Classification

被引:15
|
作者
Xu, Yuhong [1 ]
Yu, Zhiwen [1 ]
Chen, C. L. Philip [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Feature extraction; Optimization; Costs; Learning systems; Diversity reception; Data mining; Convolutional neural networks; Class imbalanced data; classification; ensemble learning; high-dimensional data; subview optimization; DATA-SETS; FEATURE-SELECTION; SMOTE; PERFORMANCE; PREDICTION; DIVERSITY; MACHINE; IMPROVE;
D O I
10.1109/TNNLS.2022.3177695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional class imbalanced data have plagued the performance of classification algorithms seriously. Because of a large number of redundant/invalid features and the class imbalanced issue, it is difficult to construct an optimal classifier for high-dimensional imbalanced data. Classifier ensemble has attracted intensive attention since it can achieve better performance than an individual classifier. In this work, we propose a multiview optimization (MVO) to learn more effective and robust features from high-dimensional imbalanced data, based on which an accurate and robust ensemble system is designed. Specifically, an optimized subview generation (OSG) in MVO is first proposed to generate multiple optimized subviews from different scenarios, which can strengthen the classification ability of features and increase the diversity of ensemble members simultaneously. Second, a new evaluation criterion that considers the distribution of data in each optimized subview is developed based on which a selective ensemble of optimized subviews (SEOS) is designed to perform the subview selective ensemble. Finally, an oversampling approach is executed on the optimized view to obtain a new class rebalanced subset for the classifier. Experimental results on 25 high-dimensional class imbalanced datasets indicate that the proposed method outperforms other mainstream classifier ensemble methods.
引用
收藏
页码:870 / 883
页数:14
相关论文
共 50 条
  • [1] Observation points classifier ensemble for high-dimensional imbalanced classification
    He, Yulin
    Li, Xu
    Fournier-Viger, Philippe
    Huang, Joshua Zhexue
    Li, Mianjie
    Salloum, Salman
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (02) : 500 - 517
  • [2] Adaptive Subspace Optimization Ensemble Method for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    Liu, Zhulin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (05) : 2284 - 2297
  • [3] Adaptive Classifier Ensemble Method Based on Spatial Perception for High-Dimensional Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Cao, Wenming
    Chen, C. L. Philip
    You, Jane
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (07) : 2847 - 2862
  • [4] A Novel Classifier Ensemble Method Based on Subspace Enhancement for High-Dimensional Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Cao, Wenming
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (01) : 16 - 30
  • [5] Improved Contraction-Expansion Subspace Ensemble for High-Dimensional Imbalanced Data Classification
    Xu, Yuhong
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5194 - 5205
  • [6] Discriminative Ridge Machine: A Classifier for High-Dimensional Data or Imbalanced Data
    Peng, Chong
    Cheng, Qiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2595 - 2609
  • [7] Ensemble of Trees for Classifying High-Dimensional Imbalanced Genomic Data
    Farid, Dewan Md.
    Nowe, Ann
    Manderick, Bernard
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 1, 2018, 15 : 172 - 187
  • [8] Ensemble Method for Classification of High-Dimensional Data
    Piao, Yongjun
    Park, Hyun Woo
    Jin, Cheng Hao
    Ryu, Keun Ho
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 245 - +
  • [9] HIBoost: A hubness-aware ensemble learning algorithm for high-dimensional imbalanced data classification
    Wu, Qin
    Lin, Yaping
    Zhu, Tuanfei
    Zhang, Yue
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (01) : 133 - 144
  • [10] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266