Robust multiclass classification for learning from imbalanced biomedical data

被引:0
|
作者
Phoungphol, Piyaphol [1 ]
Zhang, Yanqing [1 ]
Zhao, Yichuan [2 ]
机构
[1] Department of Computer Science, Georgia State University, Atlanta, GA 30302-3994, United States
[2] Department of Mathematics and Statistics, Georgia State University, Atlanta, GA 30302-3994, USA, United States
关键词
Biomedical data - Classification performance - Classification tasks - Imbalanced data - Multi-class classification - Multiclass support vector machines - Research efforts - Research interests;
D O I
10.1109/TST.2012.6374363
中图分类号
学科分类号
摘要
Imbalanced data is a common and serious problem in many biomedical classification tasks. It causes a bias on the training of classifiers and results in lower accuracy of minority classes prediction. This problem has attracted a lot of research interests in the past decade. Unfortunately, most research efforts only concentrate on 2-class problems. In this paper, we study a new method of formulating a multiclass Support Vector Machine (SVM) problem for imbalanced biomedical data to improve the classification performance. The proposed method applies cost-sensitive approach and ramp loss function to the Crammer and Singer multiclass SVM formulation. Experimental results on multiple biomedical datasets show that the proposed solution can effectively cure the problem when the datasets are noisy and highly imbalanced. © 1996-2012 Tsinghua University Press.
引用
收藏
页码:619 / 628
相关论文
共 50 条
  • [31] Customized Instance Random Undersampling to Increase Knowledge Management for Multiclass Imbalanced Data Classification
    Tusell-Rey, Claudia C.
    Camacho-Nieto, Oscar
    Yanez-Marquez, Cornelio
    Villuendas-Rey, Yenny
    SUSTAINABILITY, 2022, 14 (21)
  • [32] A comprehensive active learning method for multiclass imbalanced data streams with concept drift
    Liu, Weike
    Zhang, Hang
    Ding, Zhaoyun
    Liu, Qingbao
    Zhu, Cheng
    KNOWLEDGE-BASED SYSTEMS, 2021, 215
  • [33] Learning from Imbalanced Data
    He, Haibo
    Garcia, Edwardo A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) : 1263 - 1284
  • [34] ENSEMBLE CLASSIFIER AND RESAMPLING FOR IMBALANCED MULTICLASS LEARNING
    Sainin, Mohd Shamrie
    Ahmad, Faudziah
    Alfred, Rayner
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 751 - 756
  • [35] A Direct Ensemble Classifier for Imbalanced Multiclass Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 59 - 66
  • [36] Robust semi-supervised classification for imbalanced and incomplete data
    Chen, Mengxing
    Dou, Jun
    Fan, Yali
    Song, Yan
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (02) : 2781 - 2797
  • [37] LICIC: Less Important Components for Imbalanced Multiclass Classification
    Dentamaro, Vincenzo
    Impedovo, Donato
    Pirlo, Giuseppe
    INFORMATION, 2018, 9 (12)
  • [38] On Validation Setup for Multiclass Imbalanced Data Sets
    Silva, Evandro J. R.
    Zanchettin, Cleber
    PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 468 - 473
  • [39] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [40] Multiset Feature Learning for Highly Imbalanced Data Classification
    Jing, Xiao-Yuan
    Zhang, Xinyu
    Zhu, Xiaoke
    Wu, Fei
    You, Xinge
    Gao, Yang
    Shan, Shiguang
    Yang, Jing-Yu
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 139 - 156