DISCo: Distilled Student Models Co-training for Semi-supervised Text Mining

被引:0
|
作者
Jiang, Weifeng [1 ,2 ]
Mao, Qianren [2 ]
Lin, Chenghua [3 ]
Li, Jianxin [2 ,4 ]
Deng, Ting [4 ]
Yang, Weiyi [4 ]
Wang, Zheng [5 ]
机构
[1] Nanyang Technol Univ, SCSE, Singapore, Singapore
[2] Zhongguancun Lab, Beijing, Peoples R China
[3] Univ Manchester, Dept Comp Sci, Manchester, Lancs, England
[4] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[5] Univ Leeds, Sch Comp, Leeds, W Yorkshire, England
来源
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023 | 2023年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many text mining models are constructed by fine-tuning a large deep pre-trained language model (PLM) in downstream tasks. However, a significant challenge nowadays is maintaining performance when we use a lightweight model with limited labelled samples. We present DisCo, a semi-supervised learning (SSL) framework for fine-tuning a cohort of small student models generated from a large PLM using knowledge distillation. Our key insight is to share complementary knowledge among distilled student cohorts to promote their SSL effectiveness. DisCo employs a novel co-training technique to optimize a cohort of multiple small student models by promoting knowledge sharing among students under diversified views: model views produced by different distillation strategies and data views produced by various input augmentations. We evaluate DisCo on both semi-supervised text classification and extractive summarization tasks. Experimental results show that DisCo can produce student models that are 7.6x smaller and 4.8x faster in inference than the baseline PLMs while maintaining comparable performance. We also show that DisCo-generated student models outperform the similar-sized models elaborately tuned in distinct tasks.
引用
收藏
页码:4015 / 4030
页数:16
相关论文
共 50 条
  • [31] Question classification based on co-training style semi-supervised learning
    Yu, Zhengtao
    Su, Lei
    Li, Lina
    Zhao, Quan
    Mao, Cunli
    Guo, Jianyi
    PATTERN RECOGNITION LETTERS, 2010, 31 (13) : 1975 - 1980
  • [32] Using co-training and self-training in semi-supervised multiple classifier systems
    Didaci, Luca
    Roli, Fabio
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2006, 4109 : 522 - 530
  • [33] Abnormal Voice Detection Algorithm Based on Semi-supervised Co-training Algorithm
    Zhao, YaHui
    Wang, HongLi
    Cui, RongYi
    ADVANCED BUILDING MATERIALS AND STRUCTURAL ENGINEERING, 2012, 461 : 117 - 122
  • [34] Semi-Supervised Root-Cause Analysis with Co-Training for Integrated Systems
    Pan, Renjian
    Li, Xin
    Chakrabarty, Krishnendu
    2022 IEEE 40TH VLSI TEST SYMPOSIUM (VTS), 2022,
  • [35] SEMI-SUPERVISED CO-TRAINING AND ACTIVE LEARNING FRAMEWORK FOR HYPERSPECTRAL IMAGE CLASSIFICATION
    Samiappan, Sathishkumar
    Moorhead, Robert J., II
    2015 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2015, : 401 - 404
  • [36] Development of Co-training Support Vector Machine Model for Semi-supervised Classification
    Chen, Yinghao
    Pan, Tianhong
    Chen, Shan
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11077 - 11080
  • [37] SEMI-SUPERVISED PYRAMID FEATURE CO-TRAINING NETWORK FOR LIDAR DATA CLASSIFICATION
    Wang, Zexin
    Wang, Haoran
    Jiao, Licheng
    Liu, Xu
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 2471 - 2474
  • [38] Semi-supervised instance object detection method based on SVD co-training
    Wang R.
    Fan S.
    Xu J.
    Wen Z.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2023, 31 (13): : 2000 - 2007
  • [39] GCT: Graph Co-Training for Semi-Supervised Few-Shot Learning
    Xu, Rui
    Xing, Lei
    Shao, Shuai
    Zhao, Lifei
    Liu, Baodi
    Liu, Weifeng
    Zhou, Yicong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (12) : 8674 - 8687
  • [40] Addressing Cold Start in Recommender Systems: A Semi-supervised Co-training Algorithm
    Zhang, Mi
    Tang, Jie
    Zhang, Xuchen
    Xue, Xiangyang
    SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, : 73 - 82