mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework

被引:34
|
作者
Xia, Shuyin [1 ]
Chen, Baiyun [1 ]
Wang, Guoyin [1 ]
Zheng, Yong [1 ]
Gao, Xinbo [2 ]
Giem, Elisabeth [3 ]
Chen, Zizhong [3 ]
机构
[1] Chongqing Univ Telecommun & Posts, Coll Chongqing, Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing 400065, Peoples R China
[3] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
中国国家自然科学基金;
关键词
Noise measurement; Training; Random forests; Tagging; Learning systems; Telecommunications; Support vector machines; Complete random forest; label noise; multiclass classification; relative density; VS-ONE STRATEGY; PERFORMANCE; SMOTE;
D O I
10.1109/TNNLS.2020.3047046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mitigating label noise is a crucial problem in classification. Noise filtering is an effective method of dealing with label noise which does not need to estimate the noise rate or rely on any loss function. However, most filtering methods focus mainly on binary classification, leaving the more difficult counterpart problem of multiclass classification relatively unexplored. To remedy this deficit, we present a definition for label noise in a multiclass setting and propose a general framework for a novel label noise filtering learning method for multiclass classification. Two examples of noise filtering methods for multiclass classification, multiclass complete random forest (mCRF) and multiclass relative density, are derived from their binary counterparts using our proposed framework. In addition, to optimize the NI_threshold hyperparameter in mCRF, we propose two new optimization methods: a new voting cross-validation method and an adaptive method that employs a 2-means clustering algorithm. Furthermore, we incorporate SMOTE into our label noise filtering learning framework to handle the ubiquitous problem of imbalanced data in multiclass classification. We report experiments on both synthetic data sets and UCI benchmarks to demonstrate our proposed methods are highly robust to label noise in comparison with state-of-the-art baselines. All code and data results are available at https://github.com/syxiaa/Multiclass-Label-Noise-Filtering-Learning.
引用
收藏
页码:2916 / 2930
页数:15
相关论文
共 16 条
  • [1] A label noise filtering and label missing supplement framework based on game theory
    Liu, Yuwen
    Yao, Rongju
    Jia, Song
    Wang, Fan
    Wang, Ruili
    Ma, Rui
    Qi, Lianyong
    DIGITAL COMMUNICATIONS AND NETWORKS, 2023, 9 (04) : 887 - 895
  • [2] Label-Weighted Graph-Based Learning for Semi-Supervised Classification Under Label Noise
    Liang, Naiyao
    Yang, Zuyuan
    Chen, Junhang
    Li, Zhenni
    Xie, Shengli
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (01) : 55 - 65
  • [3] Effects of Label Noise on Deep Learning-Based Skin Cancer Classification
    Hekler, Achim
    Kather, Jakob N.
    Krieghoff-Henning, Eva
    Utikal, Jochen S.
    Meier, Friedegund
    Gellrich, Frank F.
    Belzen, Julius Upmeier Zu
    French, Lars
    Schlager, Justin G.
    Ghoreschi, Kamran
    Wilhelm, Tabea
    Kutzner, Heinz
    Berking, Carola
    Heppt, Markus, V
    Haferkamp, Sebastian
    Sondermann, Wiebke
    Schadendorf, Dirk
    Schilling, Bastian
    Izar, Benjamin
    Maron, Roman
    Schmitt, Max
    Froehling, Stefan
    Lipka, Daniel B.
    Brinker, Titus J.
    FRONTIERS IN MEDICINE, 2020, 7
  • [4] Impact of Label Noise on the Learning Based Models for a Binary Classification of Physiological Signal
    Ding, Cheng
    Pereira, Tania
    Xiao, Ran
    Lee, Randall J.
    Hu, Xiao
    SENSORS, 2022, 22 (19)
  • [5] Novel breast cancer classification framework based on deep learning
    Salama, Wessam M.
    Elbagoury, Azza M.
    Aly, Moustafa H.
    IET IMAGE PROCESSING, 2020, 14 (13) : 3254 - 3259
  • [6] Joint Sample Position Based Noise Filtering and Mean Shift Clustering for Imbalanced Classification Learning
    Duan, Lilong
    Xue, Wei
    Huang, Jun
    Zheng, Xiao
    TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (01): : 216 - 231
  • [7] Joint negative-positive-learning based sample reweighting for hyperspectral image classification with label noise
    Liao, Qiming
    Zhao, Lin
    Luo, Wenqiang
    Li, Xinping
    Zhang, Guoyun
    PATTERN RECOGNITION LETTERS, 2024, 183 : 98 - 103
  • [8] Meta-learning-based sample discrimination framework for improving dynamic selection of classifiers under label noise
    Xu, Che
    Zhu, Yingming
    Zhu, Peng
    Cui, Longqing
    KNOWLEDGE-BASED SYSTEMS, 2024, 295
  • [9] A robust approach to model-based classification based on trimming and constraintsSemi-supervised learning in presence of outliers and label noise
    Andrea Cappozzo
    Francesca Greselin
    Thomas Brendan Murphy
    Advances in Data Analysis and Classification, 2020, 14 : 327 - 354
  • [10] A robust approach to model-based classification based on trimming and constraints Semi-supervised learning in presence of outliers and label noise
    Cappozzo, Andrea
    Greselin, Francesca
    Murphy, Thomas Brendan
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2020, 14 (02) : 327 - 354