GERE: Generative Evidence Retrieval for Fact Verification

被引:32
作者
Chen, Jiangui [1 ]
Zhang, Ruqing [1 ]
Guo, Jiafeng [1 ]
Fan, Yixing [1 ]
Cheng, Xueqi [1 ]
机构
[1] Univ Chinese Acad Sci, CAS, ICT, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
基金
中国国家自然科学基金;
关键词
Fact Verification; Evidence Retrieval; Generative Retrieval;
D O I
10.1145/3477495.3531827
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fact verification (FV) is a challenging task which aims to verify a claim using multiple evidential sentences from trustworthy corpora, e.g., Wikipedia. Most existing approaches follow a three-step pipeline framework, including document retrieval, sentence retrieval and claim verification. High-quality evidences provided by the first two steps are the foundation of the effective reasoning in the last step. Despite being important, high-quality evidences are rarely studied by existing works for FV, which often adopt the off-the-shelf models to retrieve relevant documents and sentences in an "index-retrieve-then-rank" fashion. This classical approach has clear drawbacks as follows: i) a large document index as well as a complicated search process is required, leading to considerable memory and computational overhead; ii) independent scoring paradigms fail to capture the interactions among documents and sentences in ranking; iii) a fixed number of sentences are selected to form the final evidence set. In this work, we propose GERE, the first system that retrieves evidences in a generative fashion, i.e., generating the document titles as well as evidence sentence identifiers. This enables us to mitigate the aforementioned technical issues since: i) the memory and computational cost is greatly reduced because the document index is eliminated and the heavy ranking process is replaced by a light generative process; ii) the dependency between documents and that between sentences could be captured via sequential generation process; iii) the generative formulation allows us to dynamically select a precise set of relevant evidences for each claim. The experimental results on the FEVER dataset show that GERE achieves significant improvements over the state-of-the-art baselines, with both time-efficiency and memory-efficiency.
引用
收藏
页码:2184 / 2189
页数:6
相关论文
共 38 条
[1]   Context Attentive Document Ranking and Query Suggestion [J].
Ahmad, Wasi Uddin ;
Chang, Kai-Wei ;
Wang, Hongning .
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :385-394
[2]  
Chakrabarty Tuhin., 2018, P 1 WORKSHOP FACT EX, P127
[3]  
Chernyavskiy A., 2019, Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), P69, DOI 10.18653/v1/D19-6612
[4]  
De Cao Nicola, 2021, 9 INT C LEARN REPR I
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   A probability ranking principle for interactive information retrieval [J].
Fuhr, Norbert .
INFORMATION RETRIEVAL, 2008, 11 (03) :251-265
[7]   A Deep Relevance Matching Model for Ad-hoc Retrieval [J].
Guo, Jiafeng ;
Fan, Yixing ;
Ai, Qingyao ;
Croft, W. Bruce .
CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :55-64
[8]  
Hanselowski Andreas., 2018, P FEVER, P103, DOI [10.18653/v1/W18, DOI 10.18653/V1/W18-5516]
[9]  
Hidey Christopher., 2018, P 1 WORKSHOP FACT EX, P150
[10]  
Karpukhin V, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6769