mCRF and mRD: Two Classification Methods Based on a Novel Multiclass Label Noise Filtering Learning Framework

被引:34
|
作者
Xia, Shuyin [1 ]
Chen, Baiyun [1 ]
Wang, Guoyin [1 ]
Zheng, Yong [1 ]
Gao, Xinbo [2 ]
Giem, Elisabeth [3 ]
Chen, Zizhong [3 ]
机构
[1] Chongqing Univ Telecommun & Posts, Coll Chongqing, Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing 400065, Peoples R China
[3] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
中国国家自然科学基金;
关键词
Noise measurement; Training; Random forests; Tagging; Learning systems; Telecommunications; Support vector machines; Complete random forest; label noise; multiclass classification; relative density; VS-ONE STRATEGY; PERFORMANCE; SMOTE;
D O I
10.1109/TNNLS.2020.3047046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mitigating label noise is a crucial problem in classification. Noise filtering is an effective method of dealing with label noise which does not need to estimate the noise rate or rely on any loss function. However, most filtering methods focus mainly on binary classification, leaving the more difficult counterpart problem of multiclass classification relatively unexplored. To remedy this deficit, we present a definition for label noise in a multiclass setting and propose a general framework for a novel label noise filtering learning method for multiclass classification. Two examples of noise filtering methods for multiclass classification, multiclass complete random forest (mCRF) and multiclass relative density, are derived from their binary counterparts using our proposed framework. In addition, to optimize the NI_threshold hyperparameter in mCRF, we propose two new optimization methods: a new voting cross-validation method and an adaptive method that employs a 2-means clustering algorithm. Furthermore, we incorporate SMOTE into our label noise filtering learning framework to handle the ubiquitous problem of imbalanced data in multiclass classification. We report experiments on both synthetic data sets and UCI benchmarks to demonstrate our proposed methods are highly robust to label noise in comparison with state-of-the-art baselines. All code and data results are available at https://github.com/syxiaa/Multiclass-Label-Noise-Filtering-Learning.
引用
收藏
页码:2916 / 2930
页数:15
相关论文
共 16 条
  • [11] Classification of breast tumors by using a novel approach based on deep learning methods and feature selection
    Kutluer, Nizamettin
    Solmaz, Ozgen Arslan
    Yamacli, Volkan
    Eristi, Belkis
    Eristi, Huseyin
    BREAST CANCER RESEARCH AND TREATMENT, 2023, 200 (02) : 183 - 192
  • [12] Classification of breast tumors by using a novel approach based on deep learning methods and feature selection
    Nizamettin Kutluer
    Ozgen Arslan Solmaz
    Volkan Yamacli
    Belkis Eristi
    Huseyin Eristi
    Breast Cancer Research and Treatment, 2023, 200 : 183 - 192
  • [13] A Unified Label Noise-Tolerant Framework of Deep Learning-Based Fault Diagnosis via a Bounded Neural Network
    He, Sudao
    Ao, Wai Kei
    Ni, Yi-Qing
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 15
  • [14] Two-Phase Multitask Autoencoder-Based Deep Learning Framework for Subject-Independent EEG Motor Imagery Classification
    Jin, Changgyun
    Song, Andrew H.
    Kim, Seong-Eun
    IEEE ACCESS, 2024, 12 : 77356 - 77367
  • [15] TSDLPP: A Novel Two-Stage Deep Learning Framework For Prognosis Prediction Based on Whole Slide Histopathological Images
    Liu, Yu
    Li, Ao
    Liu, Jiangshu
    Meng, Gang
    Wang, Minghui
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (04) : 2523 - 2532
  • [16] Deep Learning Framework for Colorectal Cancer Classification Using ResNet18 Based on Dietary Habits Related to Meat Intake and Cooking Methods
    Prasath, S. Thanga
    Navaneethan, C.
    IEEE ACCESS, 2024, 12 : 99453 - 99468