Fine-grained Factual Consistency Assessment for Abstractive Summarization Models

被引:0
作者
Zhang, Sen [1 ]
Niu, Jianwei [1 ]
Wei, Chuyuan [2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
来源
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Factual inconsistencies existed in the output of abstractive summarization models with original documents are frequently presented. Fact consistency assessment requires the reasoning capability to find subtle clues to identify whether a model-generated summary is consistent with the original document. This paper proposes a fine-grained two-stage Fact Consistency assessment framework for Summarization models (SumFC). Given a document and a summary sentence, in the first stage, SumFC selects the top-K most relevant sentences with the summary sentence from the document. In the second stage, the model performs fine-grained consistency reasoning at the sentence level, and then aggregates all sentences' consistency scores to obtain the final assessment result. We get the training data pairs by data synthesis and adopt contrastive loss of data pairs to help the model identify subtle cues. Experiment results show that SumFC has made a significant improvement over the previous state-of-the-art methods. Our experiments also indicate that SumFC distinguishes detailed differences better.
引用
收藏
页码:107 / 116
页数:10
相关论文
共 32 条
  • [11] Gulcehre C., 2016, PROC 20 SIGNLL C COM, P280, DOI [10.18653/v1/k16-1028, 10.18653/v1/K16-1028, DOI 10.18653/V1/K16-1028]
  • [12] Gunel Beliz, 2019, P WORKSH KNOWL REPR, V32
  • [13] Kryscinski W, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P9332
  • [14] Lavie A., 2007, P 2 WORKSHOP STAT MA, P228
  • [15] Lebanoff L, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2175
  • [16] Li H., 2018, P 27 INT C COMP LING, P1430
  • [17] Lin C.-Y., 2002, Proceedings of the ACL-02 Workshop on Automatic Summarization, V4, P45, DOI [DOI 10.3115/1118162.1118168, 10.3115/1118162.1118168]
  • [18] Liu Y, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3730
  • [19] Matsumaru K, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1335
  • [20] Maynez J, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1906