Improving abstractive summarization based on dynamic residual network with reinforce dependency

被引:11
|
作者
Liao, Weizhi [1 ]
Ma, Yaheng [1 ]
Yin, Yanchao [2 ]
Ye, Guanglei [1 ]
Zuo, Dongzhou [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] Kunming Univ Sci & Technol, Kunming, Yunnan, Peoples R China
基金
国家重点研发计划;
关键词
Abstractive summarization; Dynamic residual network; Reinforcement learning agent; Long-term dependencies; One-dimensional convolution; Sequence-to-sequence;
D O I
10.1016/j.neucom.2021.02.028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Seq2Seq abstract summarization model based on long short-term memory (LSTM) is very effective for short text summarization. However, LSTM is limited by long-term dependencies, which can potentially result in salient information loss when long text is processed by the Seq2Seq model based on LSTM. To overcome the long-term dependence limitation, an encoder-decoder model based on the dynamic residual network is proposed in this work. The model can dynamically select an optimal state from the state history to establish a connection with the current state to improve the LSTM long sequence dependencies according to the current decoding environment. Because the dynamic residual connections will result in long-term connection-dependent words, a new method based on reinforcement learning is proposed to simulate the dependence between words, which is then implemented into the training process of the model. This model is verified using the CNN/Daily Mail and New York Times datasets, and the experimental results show that the proposed model achieves significant improvements in capturing longterm dependencies compared with the traditional LSTM-based Seq2Seq abstractive summarization model.& nbsp; (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:228 / 237
页数:10
相关论文
共 48 条
  • [31] An Abstractive Summarization Model Based on Joint-Attention Mechanism and a Priori Knowledge
    Li, Yuanyuan
    Huang, Yuan
    Huang, Weijian
    Yu, Junhao
    Huang, Zheng
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [32] AI-based abstractive text summarization towards AIoT and edge computing
    Ma, Jun
    Li, Tong
    Zhang, Yanling
    INTERNET TECHNOLOGY LETTERS, 2023, 6 (05)
  • [33] Keyword-based Augmentation Method to Enhance Abstractive Summarization for Legal Documents
    Huyen Nguyen
    Ding, Junhua
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 437 - 441
  • [34] A novel abstractive summarization model based on topic-aware and contrastive learning
    Tang, Huanling
    Li, Ruiquan
    Duan, Wenhao
    Dou, Quansheng
    Lu, Mingyu
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 5563 - 5577
  • [35] Order-Preserving Abstractive Summarization for Spoken Content Based on Connectionist Temporal Classification
    Lu, Bo-Ru
    Shyu, Frank
    Chen, Yun-Nung
    Lee, Hung-Yi
    Lee, Lin-Shan
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2899 - 2903
  • [36] A Two-Stage Transformer-Based Approach for Variable-Length Abstractive Summarization
    Su, Ming-Hsiang
    Wu, Chung-Hsien
    Cheng, Hao-Tse
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2061 - 2072
  • [37] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
    Jia-Wei He
    Wen-Jun Jiang
    Guo-Bang Chen
    Yu-Quan Le
    Xiao-Fei Ding
    Journal of Computer Science and Technology, 2022, 37 : 1118 - 1133
  • [38] Enhancing N-Gram Based Metrics with Semantics for Better Evaluation of Abstractive Text Summarization
    He, Jia-Wei
    Jiang, Wen-Jun
    Chen, Guo-Bang
    Le, Yu-Quan
    Ding, Xiao-Fei
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 37 (05) : 1118 - 1133
  • [39] Integrating Topic-Aware Heterogeneous Graph Neural Network With Transformer Model for Medical Scientific Document Abstractive Summarization
    Khaliq, Ayesha
    Khan, Atif
    Awan, Salman Afsar
    Jan, Salman
    Umair, Muhammad
    Zuhairi, Megat F.
    IEEE ACCESS, 2024, 12 : 113855 - 113866
  • [40] An Abstractive Summarizer Based on Improved Pointer-Generator Network
    Nie, Wenbo
    Zhang, Wei
    Li, Xinle
    Yu, Yao
    2019 34RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2019, : 515 - 520