Multilingual Summarization with Factual Consistency Evaluation

被引：0

作者：

Aharoni, Roee ^{[1
]}

Narayan, Shashi ^{[2
]}

Maynez, Joshua ^{[2
]}

Herzig, Jonathan ^{[1
]}

Clark, Elizabeth ^{[2
]}

Lapata, Mirella ^{[2
]}

机构：

[1] Google, Mountain View, CA 94043 USA

[2] Google DeepMind, London, England

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Abstractive summarization has enjoyed renewed interest in recent years, thanks to pre-trained language models and the availability of large-scale datasets. Despite promising results, current models still suffer from generating factually inconsistent summaries, reducing their utility for real-world application. Several recent efforts attempt to address this by devising models that automatically detect factual inconsistencies in machine generated summaries. However, they focus exclusively on English, a language with abundant resources. In this work, we leverage factual consistency evaluation models to improve multilingual summarization. We explore two intuitive approaches to mitigate hallucinations based on the signal provided by a multilingual NLI model, namely data filtering and controlled generation. Experimental results in the 45 languages from the XLSum dataset show gains over strong baselines in both automatic and human evaluation. We release models and human judgements of summaries to foster progress towards more factually consistent multilingual summarization.(1)

引用

页码：3562 / 3591

页数：30

共 50 条

[31] Factual Error Correction for Abstractive Summarization Models [J].

Cao, Meng ;

Dong, Yue ;

Wu, Jiapeng ;

Cheung, Jackie Chi Kit .

PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, :6251-6258

[32] Factual Consistency Oriented Speech Recognition [J].

Kanda, Naoyuki ;

Yoshioka, Takuya ;

Liu, Yang .

INTERSPEECH 2023, 2023, :236-240

[33] MUSEEC: A Multilingual Text Summarization Tool [J].

Litvak, Marina ;

Vanetik, Natalia ;

Last, Mark ;

Churkin, Elena .

PROCEEDINGS OF 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL-2016): SYSTEM DEMONSTRATIONS, 2016, :73-78

[34] INFORMATION OVERLAP IN MULTILINGUAL WIKIPEDIA AND SUMMARIZATION [J].

Filatova, Elena .

INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2012, 21 (04) :383-403

[35] Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization [J].

Cao, Meng ;

Dong, Yue ;

Cheung, Jackie Chi Kit .

PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, :3340-3354

[36] Correlating summarization of a pair of multilingual documents [J].

Ji, X ;

Zha, HY .

RIDE - MLIM 2003: THIRTEENTH INTERNATIONAL WORK SHOP ON RESEARCH ISSUES IN DATA ENGINEERING: MULTI-LINGUAL INFORMATION MANAGEMENT, PROCEEDINGS, 2003, :39-46

[37] Face4RAG: Factual Consistency Evaluation for Retrieval Augmented Generation in Chinese [J].

Xu, Yunqi ;

Cai, Tianchi ;

Jiang, Jiyan ;

Song, Xierui .

PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, :6083-6094

[38] Reference Matters: Benchmarking Factual Error Correction for Dialogue Summarization with Fine-grained Evaluation Framework [J].

Gao, Mingqi ;

Wan, Xiaojun ;

Su, Jia ;

Wang, Zhefeng ;

Huai, Baoxing .

PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, :13932-13959

[39] Factual Error Correction in Summarization with Retriever-Reader Pipeline [J].

Li, Weiwei ;

Liu, Junzhuo ;

Gan, Hui .

NEURAL INFORMATION PROCESSING, PT I, ICONIP 2022, 2023, 13623 :393-405

[40] Contrastive Aligned Joint Learning for Multilingual Summarization [J].

Wang, Danqing ;

Chen, Jiaze ;

Zhou, Hao ;

Qiu, Xipeng ;

Li, Lei .

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, :2739-2750

← 1 2 3 4 5 →