A novel abstractive summarization model based on topic-aware and contrastive learning

被引:0
|
作者
Tang, Huanling [1 ,3 ]
Li, Ruiquan [2 ]
Duan, Wenhao [2 ]
Dou, Quansheng [1 ,3 ]
Lu, Mingyu [4 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Shandong, Peoples R China
[2] Shandong Technol & Business Univ, Sch Informat & Elect Engn, Yantai 264005, Shandong, Peoples R China
[3] Shandong Coll & Univ Future Intelligent Comp, Coinnovat Ctr, Yantai 264005, Shandong, Peoples R China
[4] Dalian Maritime Univ, Informat Sci & Technol Coll, Dalian 116026, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Abstractive summarization; Neural topic model; Contrastive learning; Seq2Seq model;
D O I
10.1007/s13042-024-02263-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of abstractive summarization models are designed based on the Sequence-to-Sequence(Seq2Seq) architecture. These models are able to capture syntactic and contextual information between words. However, Seq2Seq-based summarization models tend to overlook global semantic information. Moreover, there exist inconsistency between the objective function and evaluation metrics of this model. To address these limitations, a novel model named ASTCL is proposed in this paper. It integrates the neural topic model into the Seq2Seq framework innovatively, aiming to capture the text's global semantic information and guide the summary generation. Additionally, it incorporates contrastive learning techniques to mitigate the discrepancy between the objective loss and the evaluation metrics through scoring multiple candidate summaries. On CNN/DM XSum and NYT datasets, the experimental results demonstrate that the ASTCL model outperforms the other generic models in summarization task.
引用
收藏
页码:5563 / 5577
页数:15
相关论文
共 50 条
  • [21] ChatGPT based contrastive learning for radiology report summarization
    Luo, Zhenjie
    Jiang, Zuowei
    Wang, Mingyang
    Cai, Xiaoyan
    Gao, Dehong
    Yang, Libin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [22] An Abstractive Summarization Model Based on Joint-Attention Mechanism and a Priori Knowledge
    Li, Yuanyuan
    Huang, Yuan
    Huang, Weijian
    Yu, Junhao
    Huang, Zheng
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [23] TA-BiLSTM: An Interpretable Topic-Aware Model for Misleading Information Detection in Mobile Social Networks
    Shuyu Chang
    Rui Wang
    Haiping Huang
    Jian Luo
    Mobile Networks and Applications, 2021, 26 : 2298 - 2314
  • [24] TA-BiLSTM: An Interpretable Topic-Aware Model for Misleading Information Detection in Mobile Social Networks
    Chang, Shuyu
    Wang, Rui
    Huang, Haiping
    Luo, Jian
    MOBILE NETWORKS & APPLICATIONS, 2021, 26 (06) : 2298 - 2314
  • [25] DCDSum: An interpretable extractive summarization framework based on contrastive learning method
    Zhang, Jiaqi
    Lu, Ling
    Zhang, Liang
    Chen, Yinong
    Liu, Wanping
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [26] T5-Based Model for Abstractive Summarization: A Semi-Supervised Learning Approach with Consistency Loss Functions
    Wang, Mingye
    Xie, Pan
    Du, Yao
    Hu, Xiaohui
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [27] A Novel Contrastive Learning Model for Aerial Images
    Zhen, Taihang
    Chen, Kai
    Gao, Yang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [28] Structure-aware deep clustering network based on contrastive learning
    Chen, Bowei
    Xu, Sen
    Xu, Heyang
    Bian, Xuesheng
    Guo, Naixuan
    Xu, Xiufang
    Hua, Xiaopeng
    NEURAL NETWORKS, 2023, 167 : 118 - 128
  • [29] A Performance Analysis of Deep-Learning-Based Thai News Abstractive Summarization: Word Positions and Document Length
    Jumpathong, Sawittree
    Theeramunkong, Thanaruk
    Supnithi, Thepchai
    Okumura, Manabu
    2022 7TH INTERNATIONAL CONFERENCE ON BUSINESS AND INDUSTRIAL RESEARCH (ICBIR2022), 2022, : 279 - 284
  • [30] Retinopathy identification in optical coherence tomography images based on a novel class-aware contrastive learning approach
    Li, Yuan
    Huang, Chenxi
    Zheng, Bowen
    Zheng, Zhiyuan
    Tang, Hongying
    Ju, Shenghong
    Xu, Jun
    Luo, Yuemei
    KNOWLEDGE-BASED SYSTEMS, 2025, 310