GERE: Generative Evidence Retrieval for Fact Verification

被引：32

作者：

Chen, Jiangui ^{[1
]}

Zhang, Ruqing ^{[1
]}

Guo, Jiafeng ^{[1
]}

Fan, Yixing ^{[1
]}

Cheng, Xueqi ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, CAS, ICT, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

基金：

中国国家自然科学基金;

关键词：

Fact Verification; Evidence Retrieval; Generative Retrieval;

D O I：

10.1145/3477495.3531827

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Fact verification (FV) is a challenging task which aims to verify a claim using multiple evidential sentences from trustworthy corpora, e.g., Wikipedia. Most existing approaches follow a three-step pipeline framework, including document retrieval, sentence retrieval and claim verification. High-quality evidences provided by the first two steps are the foundation of the effective reasoning in the last step. Despite being important, high-quality evidences are rarely studied by existing works for FV, which often adopt the off-the-shelf models to retrieve relevant documents and sentences in an "index-retrieve-then-rank" fashion. This classical approach has clear drawbacks as follows: i) a large document index as well as a complicated search process is required, leading to considerable memory and computational overhead; ii) independent scoring paradigms fail to capture the interactions among documents and sentences in ranking; iii) a fixed number of sentences are selected to form the final evidence set. In this work, we propose GERE, the first system that retrieves evidences in a generative fashion, i.e., generating the document titles as well as evidence sentence identifiers. This enables us to mitigate the aforementioned technical issues since: i) the memory and computational cost is greatly reduced because the document index is eliminated and the heavy ranking process is replaced by a light generative process; ii) the dependency between documents and that between sentences could be captured via sequential generation process; iii) the generative formulation allows us to dynamically select a precise set of relevant evidences for each claim. The experimental results on the FEVER dataset show that GERE achieves significant improvements over the state-of-the-art baselines, with both time-efficiency and memory-efficiency.

引用

页码：2184 / 2189

页数：6

共 38 条

[1] Context Attentive Document Ranking and Query Suggestion [J].

Ahmad, Wasi Uddin ;

Chang, Kai-Wei ;

Wang, Hongning .

PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :385-394

[2]

Chakrabarty Tuhin., 2018, P 1 WORKSHOP FACT EX, P127

[3]

Chernyavskiy A., 2019, Proceedings of the Second Workshop on Fact Extraction and VERification (FEVER), P69, DOI 10.18653/v1/D19-6612

[4]

De Cao Nicola, 2021, 9 INT C LEARN REPR I

[5]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

[6] A probability ranking principle for interactive information retrieval [J].

Fuhr, Norbert .

INFORMATION RETRIEVAL, 2008, 11 (03) :251-265

[7] A Deep Relevance Matching Model for Ad-hoc Retrieval [J].

Guo, Jiafeng ;

Fan, Yixing ;

Ai, Qingyao ;

Croft, W. Bruce .

CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, :55-64

[8]

Hanselowski Andreas., 2018, P FEVER, P103, DOI [10.18653/v1/W18, DOI 10.18653/V1/W18-5516]

[9]

Hidey Christopher., 2018, P 1 WORKSHOP FACT EX, P150

[10]

Karpukhin V, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6769

← 1 2 3 4 →