Auto-regressive extractive summarization with replacement

被引：3

作者：

Zhu, Tianyu ^{[1
]}

Hua, Wen ^{[1
]}

Qu, Jianfeng ^{[2
]}

Hosseini, Saeid ^{[3
]}

Zhou, Xiaofang ^{[4
]}

机构：

[1] Univ Queensland, Sch ITEE, Brisbane, Australia

[2] Soochow Univ, Suzhou, Peoples R China

[3] Sohar Univ, Fac Comp & IT, Sohar, Oman

[4] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong, Peoples R China

来源：

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS | 2023年 / 26卷 / 04期

基金：

澳大利亚研究理事会;

关键词：

Extractive summarization; Auto-regressive model; Partial extraction discrepancy; Lead bias;

D O I：

10.1007/s11280-022-01108-0

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Auto-regressive extractive summarization approaches determine sentence extraction probability conditioning on previous decisions by maintaining a partial summary representation. Despite its popularity, the framework has two main drawbacks: 1) the partial summary representation is irresolutely denoted by a weighted summation of all the processed sentences without any filtering, resulting in a noisy representation and degrading the effectiveness of extracting subsequent sentences; 2) earlier sentences are biased towards a higher extraction probability due to the sequential nature of sequence tagging. To address these two problems, we propose the Auto-regressive Extractive Summarization with Replacement (AES-Rep), a novel auto-regressive extractive summarization model. In particular, the AES-Rep model consists of two main modules: the extraction decision module that determines whether a sentence should be extracted, and the replacement locater module that enables extracted deficient sentences to be replaced with latter sentences by comparing their expressiveness with respect to the main idea of the document. These modules update the partial summary with explicit actions using elaborated multidimensional guidance. We conduct extensive experiments on the benchmark CNN and DailyMail datasets. Experimental results show that AES-Rep can achieve better performance compared with various strong baselines in terms of multiple ROUGE metrics.

引用

页码：2003 / 2026

页数：24

共 46 条

[1] Allahyari M, 2017, INT J ADV COMPUT SC, V8, P397, DOI 10.14569/IJACSA.2017.081052
[2] Bae S., 2019, PROC 2 WORKSHOP NEW, P10
[3] Bahdanau D., 2015, ICLP
[4] Cao Z., 2018, AAAI, V32
[5] Celikyilmaz A., 2018, P 2018 C N AM CHAPT, P1662, DOI DOI 10.18653/V1/N18-1150
[6] Chen YC, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P675
[7] Cheng JP, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P484
[8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9] Dong Y, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P3739
[10] Gehrmann S, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P4098

← 1 2 3 4 5 →