Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive Summarization

被引:0
作者
Cao, Meng [1 ]
Dong, Yue
Cheung, Jackie Chi Kit
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
来源
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS) | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art abstractive summarization systems often generate hallucinations; i.e., content that is not directly inferable from the source text. Despite being assumed incorrect, we find that much hallucinated content is factual, namely consistent with world knowledge. These factual hallucinations can be beneficial in a summary by providing useful background information. In this work, we propose a novel detection approach that separates factual from non-factual hallucinations of entities. Our method utilizes an entity's prior and posterior probabilities according to pre-trained and fine-tuned masked language models, respectively. Empirical results suggest that our approach outperforms five baselines and strongly correlates with human judgments. Furthermore, we show that our detector, when used as a reward signal in an off-line reinforcement learning (RL) algorithm, significantly improves the factuality of summaries while maintaining the level of abstractiveness.(1)
引用
收藏
页码:3340 / 3354
页数:15
相关论文
共 29 条
[1]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[2]  
Cao M, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6251
[3]  
Dong Y, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P9320
[4]  
Durmus Esin, 2020, P 58 ANN M ASS COMP, P5055
[5]  
Fillippova K, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, P864
[6]  
Goyal Tanya., 2020, FINDINGS ACL EMNLP, P3592
[7]  
Hermann Karl Moritz, 2015, Advances in Neural Information Processing Systems, V28
[8]  
Honnibal M, 2017, spaCy 2: Natural Language Understanding With Bloom Embeddings, Convolutional Neural Networks and Incremental Parsing., DOI DOI 10.3233/978-1-60750-588-4-1080
[9]  
Kang D, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P718
[10]  
Kingma D. P., 2015, INT C LEARN REPR