Enhancing abstractive summarization of implicit datasets with contrastive attention

被引:1
|
作者
Kwon S. [1 ]
Lee Y. [2 ]
机构
[1] Department of Data Science, Seoul National University of Science and Technology, 232, Gongneung-ro, Nowon-gu, Seoul
[2] Department of Industrial Engineering, Seoul National University of Science and Technology, 232, Gongneung-ro, Nowon-gu, Seoul
基金
新加坡国家研究基金会;
关键词
Abstractive summarization; Contrastive attention; Implicit dataset; Text summarization;
D O I
10.1007/s00521-024-09864-y
中图分类号
学科分类号
摘要
It is important for abstractive summarization models to understand the important parts of the original document and create a natural summary accordingly. Recently, studies have been conducted to incorporate important parts of the original document during learning and have shown good performance. However, these studies are effective for explicit datasets but not implicit datasets which are relatively more abstract. This study addresses the challenge of summarizing implicit datasets, which have a lower deviation in the significance of important sentences compared to explicit datasets. A multi-task learning approach that reflects information about salient and incidental objects during the learning process was proposed. This was achieved by adding a contrastive objective to the fine-tuning process of the encoder-decoder language model. The salient and incidental parts were selected based on the ROUGE-L F1 score and their relationships were learned through triplet loss. The proposed method was evaluated using five benchmark summarization datasets, including two explicit and three implicit. The experimental results showed a greater improvement in implicit datasets, particularly for the highly abstractive XSum dataset, compared to the vanilla fine-tuning method in both the BART-base and T5-small models. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
引用
收藏
页码:15337 / 15351
页数:14
相关论文
共 50 条
  • [41] A Study on Ontology based Abstractive Summarization
    Mohan, Jishma M.
    Sunitha, C.
    Ganesh, Amal
    Jaya, A.
    FOURTH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE & ENGINEERING (ICRTCSE 2016), 2016, 87 : 32 - 37
  • [42] Improving Abstractive Summarization with Iterative Representation
    Li, Jinpeng
    Zhang, Chuang
    Chen, Xiaojun
    Cao, Yanan
    Jia, Ruipeng
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
    He, Jia-Wei
    Jiang, Wen-Jun
    Chen, Guo-Bang
    Le, Yu-Quan
    Ding, Xiao-Fei
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (05) : 1118 - 1133
  • [44] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
    Jia-Wei He
    Wen-Jun Jiang
    Guo-Bang Chen
    Yu-Quan Le
    Xiao-Fei Ding
    Journal of Computer Science and Technology, 2022, 37 : 1118 - 1133
  • [45] Abstractive Document Summarization via Bidirectional Decoder
    Wan, Xin
    Li, Chen
    Wang, Ruijia
    Xiao, Ding
    Shi, Chuan
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 364 - 377
  • [46] Topic-Aware Abstractive Summarization Based on Heterogeneous Graph Attention Networks for Chinese Complaint Reports
    Li, Yan
    Zhang, Xiaoguang
    Gong, Tianyu
    Dong, Qi
    Zhu, Hailong
    Zhang, Tianqiang
    Jiang, Yanji
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3691 - 3705
  • [47] Variational Neural Decoder for Abstractive Text Summarization
    Zhao, Huan
    Cao, Jie
    Xu, Mingquan
    Lu, Jian
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 537 - 552
  • [48] Abstractive Summarizers Become Emotional on News Summarization
    Ahuir, Vicent
    Gonzalez, Jose-Angel
    Hurtado, Lluis-F.
    Segarra, Encarna
    APPLIED SCIENCES-BASEL, 2024, 14 (02):
  • [49] Recurrent neural network for abstractive summarization of documents
    Bansal, Neha
    Sharma, Arun
    Singh, R. K.
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01) : 65 - 72
  • [50] Abstractive Meeting Summarization as a Markov Decision Process
    Murray, Gabriel
    ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 212 - 219