On the Role of Relevance in Natural Language Processing Tasks

被引：6

作者：

Sauchuk, Artsiom ^{[1
]}

Thorne, James ^{[2
]}

Halevy, Alon ^{[3
]}

Tonellotto, Nicola ^{[4
]}

Silvestri, Fabrizio ^{[1
]}

机构：

[1] Sapienza Univ Rome, Rome, Italy

[2] Univ Cambridge, Cambridge, England

[3] Meta AI, Menlo Pk, CA USA

[4] Univ Pisa, Pisa, Italy

来源：

PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年

基金：

欧盟地平线“2020”;

关键词：

NLP; IR; relevance; neural databases; effectiveness;

D O I：

10.1145/3477495.3532034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many recent Natural Language Processing (NLP) task formulations, such as question answering and fact verification, are implemented as a two-stage cascading architecture. In the first stage an IR system retrieves "relevant" documents containing the knowledge, and in the second stage an NLP system performs reasoning to solve the task. Optimizing the IR system for retrieving relevant documents ensures that the NLP system has sufficient information to operate over. These recent NLP task formulations raise interesting and exciting challenges for IR, where the end-user of an IR system is not a human with an information need, but another system exploiting the documents retrieved by the IR system to perform reasoning and address the user information need. Among these challenges, as we will show, is that noise from the IR system, such as retrieving spurious or irrelevant documents, can negatively impact the accuracy of the downstream reasoning module. Hence, there is the need to balance maximizing relevance while minimizing noise in the IR system. This paper presents experimental results on two NLP tasks implemented as a two-stage cascading architecture. We show how spurious or irrelevant retrieved results from the first stage can induce errors in the second stage. We use these results to ground our discussion of the research challenges that the IR community should address in the context of these knowledge-intensive NLP tasks.

引用

页码：1785 / 1789

页数：5

共 21 条

[1] Andor Daniel, 2019, P 2019 C EMPIRICAL M, P5949, DOI 10.18653/v1/D19-1609
[2] Reading Wikipedia to Answer Open-Domain Questions
Chen, Danqi
Fisch, Adam
Weston, Jason
Bordes, Antoine
[J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879
[3] Collins-Thompson Kevyn, 2014, P 23 TEXT RETRIEVAL
[4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5] Dua D, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2368
[6] A General Theory of IR Evaluation Measures
Ferrante, Marco
Ferro, Nicola
Pontarollo, Silvia
[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (03) : 409 - 422
[7] A probability ranking principle for interactive information retrieval
Fuhr, Norbert
[J]. INFORMATION RETRIEVAL, 2008, 11 (03): : 251 - 265
[8] Gienapp Lukas, 2020, P CIKM
[9] Karpukhin V, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6769
[10] McCloskey M, 1989, PSYCHOL LEARN MOTIV, V24, P109, DOI [10.1016/S0079-7421(08)60536-8, DOI 10.1016/S0079-7421(08)60536-8]

← 1 2 3 →