ReliaMatch: Semi-Supervised Classification with Reliable Match

被引:3
|
作者
Jiang, Tao [1 ]
Chen, Luyao [1 ]
Chen, Wanqing [1 ]
Meng, Wenjuan [2 ]
Qi, Peihan [3 ]
机构
[1] Xidian Univ, Sch Cyber Engn, Xian 710126, Peoples R China
[2] Northwest A&F Univ, Coll Informat Engn, Xianyang 712100, Peoples R China
[3] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 15期
基金
中国国家自然科学基金;
关键词
deep learning; semi-supervised learning; pseudo labels; classification; ReliaMatch;
D O I
10.3390/app13158856
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Deep learning has been widely used in various tasks such as computer vision, natural language processing, predictive analysis, and recommendation systems in the past decade. However, practical scenarios often lack labeled data, posing challenges for traditional supervised methods. Semi-supervised classification methods address this by leveraging both labeled and unlabeled data to enhance model performance, but they face challenges in effectively utilizing unlabeled data and distinguishing reliable information from unreliable sources. This paper introduced ReliaMatch, a semi-supervised classification method that addresses these challenges by using a confidence threshold. It incorporates a curriculum learning stage, feature filtering, and pseudo-label filtering to improve classification accuracy and reliability. The feature filtering module eliminates ambiguous semantic features by comparing labeled and unlabeled data in the feature space. The pseudo-label filtering module removes unreliable pseudo-labels with low confidence, enhancing algorithm reliability. ReliaMatch employs a curriculum learning training mode, gradually increasing training dataset difficulty by combining selected samples and pseudo-labels with labeled data. This supervised approach enhances classification performance. Experimental results show that ReliaMatch effectively overcomes challenges associated with the underutilization of unlabeled data and the introduction of error information, outperforming the pseudo-label strategy in semi-supervised classification.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Reliable Semi-supervised Learning
    Shao, Junming
    Huang, Chen
    Yang, Qinli
    Luo, Guangchun
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1197 - 1202
  • [2] Dual Pseudo Supervision for Semi-Supervised Text Classification with a Reliable Teacher
    Li, Shujie
    Yang, Min
    Li, Chengming
    Xu, Ruifeng
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2513 - 2518
  • [3] Semi-supervised classification trees
    Levatic, Jurica
    Ceci, Michelangelo
    Kocev, Dragi
    Dzeroski, Saso
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2017, 49 (03) : 461 - 486
  • [4] Watersheds for Semi-Supervised Classification
    Challa, Aditya
    Danda, Sravan
    Sagar, B. S. Daya
    Najman, Laurent
    IEEE SIGNAL PROCESSING LETTERS, 2019, 26 (05) : 720 - 724
  • [5] Semi-supervised classification trees
    Jurica Levatić
    Michelangelo Ceci
    Dragi Kocev
    Sašo Džeroski
    Journal of Intelligent Information Systems, 2017, 49 : 461 - 486
  • [6] Mutual match for semi-supervised online evolutive learning
    Di Li
    Xiaoguang Zhu
    Liang Song
    Applied Intelligence, 2023, 53 : 3336 - 3350
  • [7] Mutual match for semi-supervised online evolutive learning
    Li, Di
    Zhu, Xiaoguang
    Song, Liang
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3336 - 3350
  • [8] Semi-Supervised Learning for ECG Classification
    Rodrigues, Rui
    Couto, Paula
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [9] Semi-Supervised Network Traffic Classification
    Erman, Jeffrey
    Mahanti, Anirban
    Arlitt, Martin
    Cohen, Ira
    Williamson, Carey
    SIGMETRICS'07: PROCEEDINGS OF THE 2007 INTERNATIONAL CONFERENCE ON MEASUREMENT & MODELING OF COMPUTER SYSTEMS, 2007, 35 (01): : 369 - 370
  • [10] Augmentation Learning for Semi-Supervised Classification
    Frommknecht, Tim
    Zipf, Pedro Alves
    Fan, Quanfu
    Shvetsova, Nina
    Kuehne, Hilde
    PATTERN RECOGNITION, DAGM GCPR 2022, 2022, 13485 : 85 - 98