Incorporating word attention with convolutional neural networks for abstractive summarization

被引:0
|
作者
Chengzhe Yuan
Zhifeng Bao
Mark Sanderson
Yong Tang
机构
[1] South China Normal University,School of Computer Science
[2] RMIT University,School of Science, Computer Science and Information Technology
来源
World Wide Web | 2020年 / 23卷
关键词
Abstractive summarization; Word attention; Convolutional neural networks; Sequence-to-sequence model;
D O I
暂无
中图分类号
学科分类号
摘要
Neural sequence-to-sequence (seq2seq) models have been widely used in abstractive summarization tasks. One of the challenges of this task is redundant contents in the input document often confuses the models and leads to poor performance. An efficient way to solve this problem is to select salient information from the input document. In this paper, we propose an approach that incorporates word attention with multilayer convolutional neural networks (CNNs) to extend a standard seq2seq model for abstractive summarization. First, by concentrating on a subset of source words during encoding an input sentence, word attention is able to extract informative keywords in the input, which gives us the ability to interpret generated summaries. Second, these keywords are further distilled by multilayer CNNs to capture the coarse-grained contextual features of the input sentence. Thus, the combined word attention and multilayer CNNs modules provide a better-learned representation of the input document, which helps the model generate interpretable, coherent and informative summaries in an abstractive summarization task. We evaluate the effectiveness of our model on the English Gigaword, DUC2004 and Chinese summarization dataset LCSTS. Experimental results show the effectiveness of our approach.
引用
收藏
页码:267 / 287
页数:20
相关论文
共 50 条
  • [41] Arrhythmia Detection Using Convolutional Neural Networks with Temporal Attention Mechanism
    Zubair, Muhammad
    Woo, Sungpil
    Lim, Sunhwan
    Park, Chan-Won
    12TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE (ICTC 2021): BEYOND THE PANDEMIC ERA WITH ICT CONVERGENCE INNOVATION, 2021, : 1101 - 1103
  • [42] A Novel Deep Learning Attention Based Sequence to Sequence Model for Automatic Abstractive Text Summarization
    Abd Algani Y.M.
    International Journal of Information Technology, 2024, 16 (6) : 3597 - 3603
  • [43] DermaKNet: Incorporating the Knowledge of Dermatologists to Convolutional Neural Networks for Skin Lesion Diagnosis
    Gonzalez-Diaz, Ivan
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (02) : 547 - 559
  • [44] Text Classification with Topic-based Word Embedding and Convolutional Neural Networks
    Xu, Haotian
    Dong, Ming
    Zhu, Dongxiao
    Kotov, Alexander
    Carcone, April Idalski
    Naar-King, Sylvie
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 88 - 97
  • [45] Word Sense Disambiguation Based on Semi-Supervised Convolutional Neural Networks
    Zhang C.
    Tang L.
    Gao X.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (01): : 11 - 17and27
  • [46] Handwritten Word Image Categorization with Convolutional Neural Networks and Spatial Pyramid Pooling
    Ignacio Toledo, J.
    Sudholt, Sebastian
    Fornes, Alicia
    Cucurull, Jordi
    Fink, Gernot A.
    Llados, Josep
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 543 - 552
  • [47] A study on word vector dimensions for sentence classifications using convolutional neural networks
    Takuya S.
    Satoshi Y.
    IEEJ Transactions on Electronics, Information and Systems, 2019, 139 (09) : 1066 - 1079
  • [48] A Self-Attention-Based Deep Convolutional Neural Networks for IIoT Networks Intrusion Detection
    Alshehri, Mohammed S.
    Saidani, Oumaima
    Alrayes, Fatma S.
    Abbasi, Saadullah Farooq
    Ahmad, Jawad
    IEEE ACCESS, 2024, 12 : 45762 - 45772
  • [49] Abstractive Summarization of Text Document in Malayalam Language: Enhancing Attention Model Using POS Tagging Feature
    Nambiar, Sindhya K.
    Peter, David S.
    Idicula, Sumam Mary
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (02)
  • [50] Image manipulation localization algorithm based on channel attention convolutional neural networks
    Zhong H.
    Kang H.
    Lyu Y.-D.
    Li Z.-J.
    Li H.
    Ouyang R.-C.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2021, 51 (05): : 1838 - 1844