SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization

被引:0
作者
Liu, Yixin [1 ]
Liu, Pengfei [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
来源
ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2 | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a conceptually simple while empirically powerful framework for abstractive summarization, SIMCLS, which can bridge the gap between the learning objective and evaluation metrics resulting from the currently dominated sequence-to-sequence learning framework by formulating text generation as a reference-free evaluation problem (i.e., quality estimation) assisted by contrastive learning. Experimental results show that, with minor modification over existing top-scoring systems, SimCLS can improve the performance of existing top-performing models by a large margin. Particularly, 2.51 absolute improvement against BART (Lewis et al., 2020) and 2.50 over PEGASUS (Zhang et al., 2020a) w.r.t ROUGE-1 on the CNN/DailyMail dataset, driving the state-of-the-art performance to a new level. We have open-sourced our codes and results: https://github.com/yixinL7/SimCLS. Results of our proposed models have been deployed into EXPLAINABOARD (Liu et al., 2021a) platform, which allows researchers to understand our systems in a more fine-grained way.
引用
收藏
页码:1065 / 1072
页数:8
相关论文
共 36 条
[1]  
Bengio S, 2015, ADV NEUR IN, V28
[2]  
Chen T, 2020, PR MACH LEARN RES, V119
[3]   Learning a similarity metric discriminatively, with application to face verification [J].
Chopra, S ;
Hadsell, R ;
LeCun, Y .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :539-546
[4]  
Dou Zi-Yi, P 2021 C N AM CHAPT, P4830
[5]  
Edunov Sergey, 2018, P 2018 C N AM CHAPT, V1, P355, DOI DOI 10.18653/V1/N18-1033
[6]  
Gekhman Z, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020
[7]  
Greensmith E, 2004, J MACH LEARN RES, V5, P1471
[8]  
Hermann KM, 2015, ADV NEUR IN, V28
[9]  
Jain Sarthak, 2020, P 58 ANN M ASS COMPU, P7506, DOI [10.18653/v1/2020.acl-main.670, DOI 10.18653/V1/2020.ACL-MAIN.670]
[10]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001