Deep reinforcement learning for extractive document summarization

被引:45
|
作者
Yao, Kaichun [1 ]
Zhang, Libo [2 ]
Luo, Tiejian [1 ]
Wu, Yanjun [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[2] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
关键词
DQN; Extractive summarization; Hierarchical architecture; Rouge metric;
D O I
10.1016/j.neucom.2018.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel extractive document summarization approach based on a Deep Q-Network (DQN), which can model salience and redundancy of sentences in the Q-value approximation and learn a policy that maximize the Rouge score with respect to gold summaries. We design two hierarchical network architectures to not only generate informative features from the document to represent the states of DQN, but also create a list of potential actions from sentences in the document for the DQN. At training time, our model is directly trained on reference summaries generated by human, eliminating the need for sentence-level extractive labels. For testing, we evaluate this model on the CNN/Daily corpus, the DUC 2002 dataset and the DUC 2004 dataset using Rouge metric. Our experiments show that our approach achieves performance which is better than or comparable to state-of-the-art models on these corpora without any access to linguistic annotation. This is the first time DQN has been applied to extractive summarization tasks. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 62
页数:11
相关论文
共 50 条
  • [21] MAGNeto: An Efficient Deep Learning Method for the Extractive Tags Summarization Problem
    Hieu Trong Phung
    Anh Tuan Vu
    Tung Dinh Nguyen
    Lam Thanh Do
    Giang Nam Ngo
    Trung Thanh Tran
    Le, Ngoc C.
    PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL 1, 2023, 447 : 297 - 309
  • [22] Deep Differential Amplifier for Extractive Summarization
    Jia, Ruipeng
    Cao, Yanan
    Fang, Fang
    Zhou, Yuchen
    Fang, Zheng
    Liu, Yanbing
    Wang, Shi
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 366 - 376
  • [23] Supervised weight learning-based PSO framework for single document extractive summarization
    Singh, Sangita
    Singh, Jyoti Prakash
    Deepak, Akshay
    APPLIED SOFT COMPUTING, 2024, 161
  • [24] Enhanced Genetic Algorithm for Single Document Extractive Summarization
    Bui Thi Mai Anh
    Nguyen Tra My
    Nguyen Thi Thu Trang
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 370 - 376
  • [25] Automatic Extractive Single Document Summarization: A Systematic Mapping
    Yip-Herrera, Juan-David
    Mendoza-Becerra, Martha-Eliana
    Rodriguez, Francisco-Javier
    REVISTA FACULTAD DE INGENIERIA, UNIVERSIDAD PEDAGOGICA Y TECNOLOGICA DE COLOMBIA, 2023, 32 (63):
  • [26] A BERT based single document extractive summarization model
    Liu, Wei
    Song, Pei-Ran
    Jiao, Rui-Li
    Journal of Computers (Taiwan), 2020, 31 (02) : 241 - 249
  • [27] Extractive Document Summarization Based on Convolutional Neural Networks
    Zhang, Yong
    Er, Meng Joo
    Pratama, Mahardhika
    PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 918 - 922
  • [28] AUTOMATIC EXTRACTIVE AND GENERIC DOCUMENT SUMMARIZATION BASED ON NMF
    Aghdam, Mehdi Hosseinzadeh
    JOURNAL OF ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING RESEARCH, 2023, 13 (01) : 37 - 49
  • [29] Deep Learning in the Domain of Multi-Document Text Summarization
    Roul, Rajendra Kumar
    Sahoo, Jajati Keshari
    Goel, Rohan
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 575 - 581
  • [30] Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning
    Nath, Swaroop
    Bhattacharyya, Pushpak
    Khadilkar, Harshad
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 15770 - 15789