Sentence salience contrastive learning for abstractive text summarization

被引:6
|
作者
Huang, Ying [1 ,2 ]
Li, Zhixin [1 ,2 ]
Chen, Zhenbin [1 ,2 ]
Zhang, Canlong [1 ,2 ]
Ma, Huifang [3 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin 541004, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
[3] Northwest Normal Univ, Coll Comp Sci & Engn, Lanzhou 730070, Peoples R China
基金
中国国家自然科学基金;
关键词
Contrastive learning; Abstractive text summarization; Semantic similarity; Sentence salience; NETWORKS;
D O I
10.1016/j.neucom.2024.127808
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text summarization aims to generate a short summary for a document while preserving salient information. Recently, contrastive learning has been extended from visual representation to summarization tasks. At present, the methods of contrastive learning summarization focus on modeling the global semantics of source documents, targets and candidate summaries to maximize their similarities. However, they ignore the influence of sentence semantics in the document. In this paper, we propose a sentence-level salience contrastive learning method to help the model capture salient information and denoise. The model expresses the sentence salience according to the semantic similarity between the summaries and sentences of the source document, and integrates the similarity distance into the contrastive loss in the form of soft weights. Therefore, our model maximize the similarity between summaries and salient information, while minimizing the similarity between summaries and potential noise. We have verified our method in three widely used datasets, CNN/Daily Mail, XSum and PubMed. The experimental results show that the proposed method can significantly improve the baseline performance and achieve competitive performance in the existing contrastive learning methods.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Abstractive Text Summarization for the Urdu Language: Data and Methods
    Awais, Muhammad
    Muhammad Adeel Nawab, Rao
    IEEE ACCESS, 2024, 12 : 61198 - 61210
  • [22] Reinforced Abstractive Text Summarization With Semantic Added Reward
    Jang, Heewon
    Kim, Wooju
    IEEE ACCESS, 2021, 9 : 103804 - 103810
  • [23] Sentence Pair Embeddings Based Evaluation Metric for Abstractive and Extractive Summarization
    Akula, Ramya
    Garibay, Ivan
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6009 - 6017
  • [24] Question-driven text summarization using an extractive-abstractive framework
    Kia, Mahsa Abazari
    Garifullina, Aygul
    Kern, Mathias
    Chamberlain, Jon
    Jameel, Shoaib
    COMPUTATIONAL INTELLIGENCE, 2024, 40 (03)
  • [25] Joint learning of text alignment and abstractive summarization for long documents via unbalanced optimal transport
    Shen, Xin
    Lam, Wai
    Ma, Shumin
    Wang, Huadong
    NATURAL LANGUAGE ENGINEERING, 2024, 30 (03) : 525 - 553
  • [26] A Novel Deep Learning Attention Based Sequence to Sequence Model for Automatic Abstractive Text Summarization
    Abd Algani Y.M.
    International Journal of Information Technology, 2024, 16 (6) : 3597 - 3603
  • [27] Multi-Encoder Transformer for Korean Abstractive Text Summarization
    Shin, Youhyun
    IEEE ACCESS, 2023, 11 : 48768 - 48782
  • [28] A Survey of Abstractive Text Summarization Utilising Pretrained Language Models
    Syed, Ayesha Ayub
    Gaol, Ford Lumban
    Boediman, Alfred
    Matsuo, Tokuro
    Budiharto, Widodo
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT I, 2022, 13757 : 532 - 544
  • [29] Attention History-based Attention for Abstractive Text Summarization
    Lee, Hyunsoo
    Choi, YunSeok
    Lee, Jee-Hyong
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 1075 - 1081
  • [30] Indonesian Abstractive Text Summarization Using Bidirectional Gated Recurrent Unit
    Adelia, Rike
    Suyanto, Suyanto
    Wisesty, Untari Novia
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 581 - 588