SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization

被引：0

作者：

Liu, Yixin ^{[1
]}

Liu, Pengfei ^{[1
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

来源：

ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we present a conceptually simple while empirically powerful framework for abstractive summarization, SIMCLS, which can bridge the gap between the learning objective and evaluation metrics resulting from the currently dominated sequence-to-sequence learning framework by formulating text generation as a reference-free evaluation problem (i.e., quality estimation) assisted by contrastive learning. Experimental results show that, with minor modification over existing top-scoring systems, SimCLS can improve the performance of existing top-performing models by a large margin. Particularly, 2.51 absolute improvement against BART (Lewis et al., 2020) and 2.50 over PEGASUS (Zhang et al., 2020a) w.r.t ROUGE-1 on the CNN/DailyMail dataset, driving the state-of-the-art performance to a new level. We have open-sourced our codes and results: https://github.com/yixinL7/SimCLS. Results of our proposed models have been deployed into EXPLAINABOARD (Liu et al., 2021a) platform, which allows researchers to understand our systems in a more fine-grained way.

引用

页码：1065 / 1072

页数：8

共 36 条

[1]

Bengio S, 2015, ADV NEUR IN, V28

[2]

Chen T, 2020, PR MACH LEARN RES, V119

[3] Learning a similarity metric discriminatively, with application to face verification [J].

Chopra, S ;

Hadsell, R ;

LeCun, Y .

2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546

[4]

Dou Zi-Yi, P 2021 C N AM CHAPT, P4830

[5]

Edunov Sergey, 2018, P 2018 C N AM CHAPT, V1, P355, DOI DOI 10.18653/V1/N18-1033

[6]

Gekhman Z, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020

[7]

Greensmith E, 2004, J MACH LEARN RES, V5, P1471

[8]

Hermann KM, 2015, ADV NEUR IN, V28

[9]

Jain Sarthak, 2020, P 58 ANN M ASS COMPU, P7506, DOI [10.18653/v1/2020.acl-main.670, DOI 10.18653/V1/2020.ACL-MAIN.670]

[10]

King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001

← 1 2 3 4 →