Dual Pseudo Supervision for Semi-Supervised Text Classification with a Reliable Teacher

被引:4
|
作者
Li, Shujie [1 ,5 ]
Yang, Min [2 ]
Li, Chengming [3 ]
Xu, Ruifeng [4 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
[3] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen, Peoples R China
[4] Harbin Inst Technol, Peng Cheng Lab, Shenzhen, Peoples R China
[5] Chinese Acad Sci, SIAT, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Semi-supervised text classification; Pseudo labeling; Meta Learning; Consistency regularization;
D O I
10.1145/3477495.3531887
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study the semi-supervised text classification (SSTC) by exploring both labeled and extra unlabeled data. One of the most popular SSTC techniques is pseudo-labeling which assigns pseudo labels for unlabeled data via a teacher classifier trained on labeled data. These pseudo labeled data is then applied to train a student classifier. However, when the pseudo labels are inaccurate, the student classifier will learn from inaccurate data and get even worse performance than the teacher. To mitigate this issue, we propose a simple yet efficient pseudo-labeling framework called Dual Pseudo Supervision (DPS), which exploits the feedback signal from the student to guide the teacher to generate better pseudo labels. In particular, we alternately update the student based on the pseudo labeled data annotated by the teacher and optimize the teacher based on the student's performance via meta learning. In addition, we also design a consistency regularization term to further improve the stability of the teacher. With the above two strategies, the learned reliable teacher can provide more accurate pseudo-labels to the student and thus improve the overall performance of text classification. We conduct extensive experiments on three benchmark datasets (i.e., AG News, Yelp and Yahoo) to verify the effectiveness of our DPS method. Experimental results show that our approach achieves substantially better performance than the strong competitors. For reproducibility, we will release our code and data of this paper publicly at https://github.com/GRIT621/DPS.
引用
收藏
页码:2513 / 2518
页数:6
相关论文
共 50 条
  • [1] Prototype-Guided Pseudo Labeling for Semi-Supervised Text Classification
    Yang, Weiyi
    Zhang, Richong
    Chen, Junfan
    Wang, Lihong
    Kim, Jaein
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 16369 - 16382
  • [2] Cross teacher pseudo supervision: Enhancing semi-supervised crack segmentation with consistency learning
    Jian, Zheng
    Liu, Jianbo
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [3] An Exploration of Semi-supervised Text Classification
    Lien, Henrik
    Biermann, Daniel
    Palumbo, Fabrizio
    Goodwin, Morten
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 477 - 488
  • [4] Semi-supervised collaborative text classification
    Jin, Rong
    Wu, Ming
    Sukthankar, Rahul
    MACHINE LEARNING: ECML 2007, PROCEEDINGS, 2007, 4701 : 600 - +
  • [5] Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
    Chen, Xiaokang
    Yuan, Yuhui
    Zeng, Gang
    Wang, Jingdong
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 2613 - 2622
  • [6] Improving Semi-Supervised Text Classification with Dual Meta-Learning
    Li, Shujie
    Yuan, Guanghu
    Yang, Min
    Shen, Ying
    Li, Chengming
    Xu, Ruifeng
    Zhao, Xiaoyan
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (04)
  • [7] ReliaMatch: Semi-Supervised Classification with Reliable Match
    Jiang, Tao
    Chen, Luyao
    Chen, Wanqing
    Meng, Wenjuan
    Qi, Peihan
    APPLIED SCIENCES-BASEL, 2023, 13 (15):
  • [8] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [9] Robust Semi-supervised Medical Image Classification: Leveraging Reliable Pseudo-labels
    Kumar, Devesh
    Sikka, Geeta
    Singh, Samayveer
    COMPUTER VISION AND IMAGE PROCESSING, CVIP 2023, PT III, 2024, 2011 : 27 - 38
  • [10] Text Classification Using Semi-Supervised Clustering
    Zhang, Wen
    Yoshida, Taketoshi
    Tang, Xijin
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 197 - 200