Multilingual Summarization with Factual Consistency Evaluation

被引:0
作者
Aharoni, Roee [1 ]
Narayan, Shashi [2 ]
Maynez, Joshua [2 ]
Herzig, Jonathan [1 ]
Clark, Elizabeth [2 ]
Lapata, Mirella [2 ]
机构
[1] Google, Mountain View, CA 94043 USA
[2] Google DeepMind, London, England
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Abstractive summarization has enjoyed renewed interest in recent years, thanks to pre-trained language models and the availability of large-scale datasets. Despite promising results, current models still suffer from generating factually inconsistent summaries, reducing their utility for real-world application. Several recent efforts attempt to address this by devising models that automatically detect factual inconsistencies in machine generated summaries. However, they focus exclusively on English, a language with abundant resources. In this work, we leverage factual consistency evaluation models to improve multilingual summarization. We explore two intuitive approaches to mitigate hallucinations based on the signal provided by a multilingual NLI model, namely data filtering and controlled generation. Experimental results in the 45 languages from the XLSum dataset show gains over strong baselines in both automatic and human evaluation. We release models and human judgements of summaries to foster progress towards more factually consistent multilingual summarization.(1)
引用
收藏
页码:3562 / 3591
页数:30
相关论文
共 50 条
[21]   Weakly Supervised Abstractive Summarization with Enhancing Factual Consistency for Chinese Complaint Reports [J].
Tao, Ren ;
Shuang, Chen .
CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03) :6201-6217
[22]   SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation [J].
Clark, Elizabeth ;
Rijhwani, Shruti ;
Gehrmann, Sebastian ;
Maynez, Joshua ;
Aharoni, Roee ;
Nikolaev, Vitaly ;
Sellam, Thibault ;
Siddhant, Aditya ;
Das, Dipanjan ;
Parikh, Ankur P. .
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, :9397-9413
[23]   TRUE: Re-evaluating Factual Consistency Evaluation [J].
Honovich, Or ;
Aharoni, Roee ;
Herzig, Jonathan ;
Taitelbaum, Hagai ;
Cohen, Vered ;
Kukliansky, Doron ;
Scialom, Thomas ;
Szpektor, Idan ;
Hassidim, Avinatan ;
Matias, Yossi .
PROCEEDINGS OF THE SECOND DIALDOC WORKSHOP ON DOCUMENT-GROUNDED DIALOGUE AND CONVERSATIONAL QUESTION ANSWERING (DIALDOC 2022), 2022, :161-175
[24]   TRUE: Re-evaluating Factual Consistency Evaluation [J].
Honovich, Or ;
Aharoni, Roee ;
Herzig, Jonathan ;
Taitelbaum, Hagai ;
Cohen, Vered ;
Kukliansky, Doron ;
Scialom, Thomas ;
Szpektor, Idan ;
Hassidim, Avinatan ;
Matias, Yossi .
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, :3905-3920
[25]   Model Intrinsic Features of Fine-tuning based Text Summarization Models for Factual Consistency [J].
Song, Jongyoon ;
Park, Nohil ;
Hwang, Bongkyu ;
Yung, Jaewoong ;
Joe, Seongho ;
Gwon, Youngjune L. ;
Yoon, Sungroh .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, :13884-13898
[26]   TrueTeacher: Learning Factual Consistency Evaluation with Large Language Models [J].
Gekhman, Zorik ;
Herzig, Jonathan ;
Aharoni, Roee ;
Elkind, Chen ;
Szpektor, Idan .
2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2023, 2023, :2053-2070
[27]   On the Intractability to Synthesize Factual Inconsistencies in Summarization [J].
Luc, Ge ;
Fan, Weisi ;
Li, Miaoran ;
He, Youbiao ;
Yang, Yinfei ;
Bao, Forrest Sheng .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: EACL 2024, 2024, :1026-1037
[28]   Rethinking Efficient Multilingual Text Summarization Meta-Evaluation [J].
Han, Rilyn R. ;
Chen, Jiawen ;
Liu, Yixin ;
Cohan, Arman .
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, :15739-15746
[29]   MLSUM: The Multilingual Summarization Corpus [J].
Scialom, Thomas ;
Dray, Paul-Alexis ;
Lamprier, Sylvain ;
Piwowarski, Benjamin ;
Staiano, Jacopo .
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, :8051-8067
[30]   Multilingual Text Summarization with UNL [J].
Sharma, Sherry ;
Bhatia, Parteek .
2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTER ENGINEERING AND APPLICATIONS (ICACEA), 2015, :740-745