T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions

被引:3
|
作者
Wang, Mingye [1 ]
Xie, Pan [1 ]
Du, Yao [1 ]
Hu, Xiaohui [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100045, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 12期
关键词
natural language processing; automatic text summarization; abstractive summarization; semi-supervised learning; consistency loss function;
D O I
10.3390/app13127111
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Text summarization is a prominent task in natural language processing (NLP) that condenses lengthy texts into concise summaries. Despite the success of existing supervised models, they often rely on datasets of well-constructed text pairs, which can be insufficient for languages with limited annotated data, such as Chinese. To address this issue, we propose a semi-supervised learning method for text summarization. Our method is inspired by the cycle-consistent adversarial network (CycleGAN) and considers text summarization as a style transfer task. The model is trained by using a similar procedure and loss function to those of CycleGAN and learns to transfer the style of a document to its summary and vice versa. Our method can be applied to multiple languages, but this paper focuses on its performance on Chinese documents. We trained a T5-based model and evaluated it on two datasets, CSL and LCSTS, and the results demonstrate the effectiveness of the proposed method.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Consistency-Based Semi-supervised Evidential Active Learning for Diagnostic Radiograph Classification
    Balaram, Shafa
    Nguyen, Cuong M.
    Kassim, Ashraf
    Krishnaswamy, Pavitra
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT I, 2022, 13431 : 675 - 685
  • [22] Semi-supervised learning dehazing algorithm based on the OSV model
    Zhu, Lijun
    Wei, Weibo
    Pan, Zhenkuan
    Ji, Lianshun
    Song, Jintao
    Li, Jinhan
    IET IMAGE PROCESSING, 2023, 17 (03) : 872 - 885
  • [23] Attention-based label consistency for semi-supervised deep learning based image classification
    Chen, Jiaming
    Yang, Meng
    Ling, Jie
    NEUROCOMPUTING, 2021, 453 : 731 - 741
  • [24] Semi-supervised Learning Method for Object Detection based on Adjacent Frame Consistency Measurement
    Miao, Yinxiao
    Cheng, Zhonghao
    Zhang, Xiujian
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6452 - 6457
  • [25] On the consistency of graph-based Bayesian semi-supervised learning and the scalability of sampling algorithms
    Trillos, Nicolas Garcia
    Kaplan, Zachary
    Samakhoana, Thabo
    Sanz-Alonso, Daniel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
  • [26] Fuzziness based semi-supervised learning approach for intrusion detection system
    Ashfaq, Rana Aamir Raza
    Wang, Xi-Zhao
    Huang, Joshua Zhexue
    Abbas, Haider
    He, Yu-Lin
    INFORMATION SCIENCES, 2017, 378 : 484 - 497
  • [27] Semi-supervised Learning Using Variational Autoencoder - A Cluster Based Approach
    Vengalil, Sunil Kumar
    Sinha, Neelam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 529 - 536
  • [28] 3D Model Annotation based on Semi-Supervised Learning
    Zhou, Kai
    Tian, Feng
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2014, 14 (08): : 9 - 13
  • [29] Semi-supervised deep learning model based on u-wordMixup
    Tang H.-L.
    Song S.-M.
    Liu X.-Y.
    Dou Q.-S.
    Lu M.-Y.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (06): : 1646 - 1652
  • [30] RL-SSI Model: Adapting a Supervised Learning Approach to a Semi-Supervised Approach for Human Action Recognition
    dos Santos, Lucas Lisboa
    Winkler, Ingrid
    Sperandio Nascimento, Erick Giovani
    ELECTRONICS, 2022, 11 (09)