Enhancing abstractive summarization of implicit datasets with contrastive attention

被引：1

作者：

Kwon S. ^{[1
]}

Lee Y. ^{[2
]}

机构：

[1] Department of Data Science, Seoul National University of Science and Technology, 232, Gongneung-ro, Nowon-gu, Seoul

[2] Department of Industrial Engineering, Seoul National University of Science and Technology, 232, Gongneung-ro, Nowon-gu, Seoul

来源：

Neural Computing and Applications | 2024年 / 36卷 / 25期

基金：

新加坡国家研究基金会;

关键词：

Abstractive summarization; Contrastive attention; Implicit dataset; Text summarization;

D O I：

10.1007/s00521-024-09864-y

中图分类号：

学科分类号：

摘要：

It is important for abstractive summarization models to understand the important parts of the original document and create a natural summary accordingly. Recently, studies have been conducted to incorporate important parts of the original document during learning and have shown good performance. However, these studies are effective for explicit datasets but not implicit datasets which are relatively more abstract. This study addresses the challenge of summarizing implicit datasets, which have a lower deviation in the significance of important sentences compared to explicit datasets. A multi-task learning approach that reflects information about salient and incidental objects during the learning process was proposed. This was achieved by adding a contrastive objective to the fine-tuning process of the encoder-decoder language model. The salient and incidental parts were selected based on the ROUGE-L F1 score and their relationships were learned through triplet loss. The proposed method was evaluated using five benchmark summarization datasets, including two explicit and three implicit. The experimental results showed a greater improvement in implicit datasets, particularly for the highly abstractive XSum dataset, compared to the vanilla fine-tuning method in both the BART-base and T5-small models. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.

引用

页码：15337 / 15351

页数：14

共 50 条

[41] A Study on Ontology based Abstractive Summarization
Mohan, Jishma M.
Sunitha, C.
Ganesh, Amal
Jaya, A.
FOURTH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN COMPUTER SCIENCE & ENGINEERING (ICRTCSE 2016), 2016, 87 : 32 - 37
[42] Improving Abstractive Summarization with Iterative Representation
Li, Jinpeng
Zhang, Chuang
Chen, Xiaojun
Cao, Yanan
Jia, Ruipeng
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[43] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
He, Jia-Wei
Jiang, Wen-Jun
Chen, Guo-Bang
Le, Yu-Quan
Ding, Xiao-Fei
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (05) : 1118 - 1133
[44] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
Jia-Wei He
Wen-Jun Jiang
Guo-Bang Chen
Yu-Quan Le
Xiao-Fei Ding
Journal of Computer Science and Technology, 2022, 37 : 1118 - 1133
[45] Abstractive Document Summarization via Bidirectional Decoder
Wan, Xin
Li, Chen
Wang, Ruijia
Xiao, Ding
Shi, Chuan
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 364 - 377
[46] Topic-Aware Abstractive Summarization Based on Heterogeneous Graph Attention Networks for Chinese Complaint Reports
Li, Yan
Zhang, Xiaoguang
Gong, Tianyu
Dong, Qi
Zhu, Hailong
Zhang, Tianqiang
Jiang, Yanji
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 76 (03): : 3691 - 3705
[47] Variational Neural Decoder for Abstractive Text Summarization
Zhao, Huan
Cao, Jie
Xu, Mingquan
Lu, Jian
COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2020, 17 (02) : 537 - 552
[48] Abstractive Summarizers Become Emotional on News Summarization
Ahuir, Vicent
Gonzalez, Jose-Angel
Hurtado, Lluis-F.
Segarra, Encarna
APPLIED SCIENCES-BASEL, 2024, 14 (02):
[49] Recurrent neural network for abstractive summarization of documents
Bansal, Neha
Sharma, Arun
Singh, R. K.
JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (01) : 65 - 72
[50] Abstractive Meeting Summarization as a Markov Decision Process
Murray, Gabriel
ADVANCES IN ARTIFICIAL INTELLIGENCE (AI 2015), 2015, 9091 : 212 - 219

← 1 2 3 4 5 →