Deep reinforcement learning for extractive document summarization

被引:45
作者
Yao, Kaichun [1 ]
Zhang, Libo [2 ]
Luo, Tiejian [1 ]
Wu, Yanjun [2 ]
机构
[1] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[2] Chinese Acad Sci, Inst Software, Beijing 100190, Peoples R China
关键词
DQN; Extractive summarization; Hierarchical architecture; Rouge metric;
D O I
10.1016/j.neucom.2018.01.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel extractive document summarization approach based on a Deep Q-Network (DQN), which can model salience and redundancy of sentences in the Q-value approximation and learn a policy that maximize the Rouge score with respect to gold summaries. We design two hierarchical network architectures to not only generate informative features from the document to represent the states of DQN, but also create a list of potential actions from sentences in the document for the DQN. At training time, our model is directly trained on reference summaries generated by human, eliminating the need for sentence-level extractive labels. For testing, we evaluate this model on the CNN/Daily corpus, the DUC 2002 dataset and the DUC 2004 dataset using Rouge metric. Our experiments show that our approach achieves performance which is better than or comparable to state-of-the-art models on these corpora without any access to linguistic annotation. This is the first time DQN has been applied to extractive summarization tasks. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:52 / 62
页数:11
相关论文
共 50 条
  • [41] The Combination of Similarity Measures for Extractive Summarization
    Hy Nguyen
    Tung Le
    Viet-Thang Luong
    Minh-Quoc Nghiem
    Dien Dinh
    [J]. PROCEEDINGS OF THE SEVENTH SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY (SOICT 2016), 2016, : 66 - 72
  • [42] Extractive Summarization Based on Quadratic Check
    Cheng, Yanfang
    Lu, Yinan
    [J]. PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2019), 2019,
  • [43] Interactive Document Summarization
    Said, Raoufdine
    Guille, Adrien
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT V, 2024, 14612 : 177 - 181
  • [44] Extractive text summarization for biomedical transcripts using deep dense LSTM-CNN framework
    Bedi, Parminder Pal Singh
    Bala, Manju
    Sharma, Kapil
    [J]. EXPERT SYSTEMS, 2024, 41 (07)
  • [45] Auto-regressive extractive summarization with replacement
    Zhu, Tianyu
    Hua, Wen
    Qu, Jianfeng
    Hosseini, Saeid
    Zhou, Xiaofang
    [J]. WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (04): : 2003 - 2026
  • [46] Central Embeddings for Extractive Summarization Based on Similarity
    Gutierrez-Hinojosa, Sandra J.
    Calvo, Hiram
    Moreno-Armendariz, Marco A.
    [J]. COMPUTACION Y SISTEMAS, 2019, 23 (03): : 649 - 663
  • [47] WHICH EXTRACTIVE SUMMARIZATION METHOD FOR MALAY TEXTS?
    Ranaivo-Malancon, Bali
    Iboi, Hazimah
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON COMPUTING AND INFORMATICS: EMBRACING ECO-FRIENDLY COMPUTING, 2017, : 577 - +
  • [48] Extractive Summarization under Strict Length Constraints
    Mehdad, Yashar
    Thadani, Kapil
    Radev, Dragomir
    Stent, Amanda
    Billawala, Youssef
    Buchner, Karolina
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 3089 - 3093
  • [49] Auto-regressive extractive summarization with replacement
    Tianyu Zhu
    Wen Hua
    Jianfeng Qu
    Saeid Hosseini
    Xiaofang Zhou
    [J]. World Wide Web, 2023, 26 : 2003 - 2026
  • [50] A novel extractive multi-document text summarization system using quantum-inspired genetic algorithm: MTSQIGA
    Mojrian, Mohammad
    Mirroshandel, Seyed Abolghasem
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 171