T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions

被引:3
|
作者
Wang, Mingye [1 ]
Xie, Pan [1 ]
Du, Yao [1 ]
Hu, Xiaohui [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Chinese Acad Sci, Inst Software, Sci & Technol Integrated Informat Syst Lab, Beijing 100045, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 12期
关键词
natural language processing; automatic text summarization; abstractive summarization; semi-supervised learning; consistency loss function;
D O I
10.3390/app13127111
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Text summarization is a prominent task in natural language processing (NLP) that condenses lengthy texts into concise summaries. Despite the success of existing supervised models, they often rely on datasets of well-constructed text pairs, which can be insufficient for languages with limited annotated data, such as Chinese. To address this issue, we propose a semi-supervised learning method for text summarization. Our method is inspired by the cycle-consistent adversarial network (CycleGAN) and considers text summarization as a style transfer task. The model is trained by using a similar procedure and loss function to those of CycleGAN and learns to transfer the style of a document to its summary and vice versa. Our method can be applied to multiple languages, but this paper focuses on its performance on Chinese documents. We trained a T5-based model and evaluated it on two datasets, CSL and LCSTS, and the results demonstrate the effectiveness of the proposed method.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Semi-supervised learning for refining image annotation based on random walk model
    Tian, Dongping
    KNOWLEDGE-BASED SYSTEMS, 2014, 72 : 72 - 80
  • [42] Classification Model on Big Data in Medical Diagnosis Based on Semi-Supervised Learning
    Wang, Lei
    Qian, Qing
    Zhang, Qiang
    Wang, Jishuai
    Cheng, Wenbo
    Yan, Wei
    COMPUTER JOURNAL, 2022, 65 (02) : 177 - 191
  • [43] Semi-Supervised Detection Model Based on Adaptive Ensemble Learning for Medical Images
    Li, Jingchen
    Shi, Haobin
    Chen, Wenbai
    Liu, Naijun
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 237 - 248
  • [44] Feature Level Mining of Online Reviews Based on a Semi-Supervised Learning Model
    Wang, Minxi
    Li, Xin
    LISS 2014, 2015, : 709 - 715
  • [45] A Novel Semi-Supervised Learning Model for Smartphone-Based Health Telemonitoring
    Gaw, Nathan
    Li, Jing
    Yoon, Hyunsoo
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (01) : 428 - 441
  • [46] Semi-supervised surface object detection based on multi-view cross-consistency learning
    Feng J.
    Li B.
    Tian L.
    Dong C.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2023, 55 (04): : 107 - 114
  • [47] A semi-supervised segmentation method for microscopic hyperspectral pathological images based on multi-consistency learning
    Fang, Jinghui
    FRONTIERS IN ONCOLOGY, 2024, 14
  • [48] A Few-Shot Learning Model Based on Semi-Supervised with Pseudo Label
    Yu Y.
    Feng L.
    Wang G.-G.
    Xu Q.-F.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2019, 47 (11): : 2284 - 2291
  • [49] A Generic Semi-Supervised Deep Learning-Based Approach for Automated Surface Inspection
    Zheng, Xiaoqing
    Wang, Hongcheng
    Chen, Jie
    Kong, Yaguang
    Zheng, Song
    IEEE ACCESS, 2020, 8 : 114088 - 114099
  • [50] A novel anomaly detection approach based on ensemble semi-supervised active learning (ADESSA)
    Niu, Zequn
    Guo, Wenjie
    Xue, Jingfeng
    Wang, Yong
    Kong, Zixiao
    Huang, Lu
    COMPUTERS & SECURITY, 2023, 129