On the Role of Relevance in Natural Language Processing Tasks

被引:6
作者
Sauchuk, Artsiom [1 ]
Thorne, James [2 ]
Halevy, Alon [3 ]
Tonellotto, Nicola [4 ]
Silvestri, Fabrizio [1 ]
机构
[1] Sapienza Univ Rome, Rome, Italy
[2] Univ Cambridge, Cambridge, England
[3] Meta AI, Menlo Pk, CA USA
[4] Univ Pisa, Pisa, Italy
来源
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22) | 2022年
基金
欧盟地平线“2020”;
关键词
NLP; IR; relevance; neural databases; effectiveness;
D O I
10.1145/3477495.3532034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many recent Natural Language Processing (NLP) task formulations, such as question answering and fact verification, are implemented as a two-stage cascading architecture. In the first stage an IR system retrieves "relevant" documents containing the knowledge, and in the second stage an NLP system performs reasoning to solve the task. Optimizing the IR system for retrieving relevant documents ensures that the NLP system has sufficient information to operate over. These recent NLP task formulations raise interesting and exciting challenges for IR, where the end-user of an IR system is not a human with an information need, but another system exploiting the documents retrieved by the IR system to perform reasoning and address the user information need. Among these challenges, as we will show, is that noise from the IR system, such as retrieving spurious or irrelevant documents, can negatively impact the accuracy of the downstream reasoning module. Hence, there is the need to balance maximizing relevance while minimizing noise in the IR system. This paper presents experimental results on two NLP tasks implemented as a two-stage cascading architecture. We show how spurious or irrelevant retrieved results from the first stage can induce errors in the second stage. We use these results to ground our discussion of the research challenges that the IR community should address in the context of these knowledge-intensive NLP tasks.
引用
收藏
页码:1785 / 1789
页数:5
相关论文
共 21 条
  • [1] Andor Daniel, 2019, P 2019 C EMPIRICAL M, P5949, DOI 10.18653/v1/D19-1609
  • [2] Reading Wikipedia to Answer Open-Domain Questions
    Chen, Danqi
    Fisch, Adam
    Weston, Jason
    Bordes, Antoine
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 1870 - 1879
  • [3] Collins-Thompson Kevyn, 2014, P 23 TEXT RETRIEVAL
  • [4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [5] Dua D, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2368
  • [6] A General Theory of IR Evaluation Measures
    Ferrante, Marco
    Ferro, Nicola
    Pontarollo, Silvia
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (03) : 409 - 422
  • [7] A probability ranking principle for interactive information retrieval
    Fuhr, Norbert
    [J]. INFORMATION RETRIEVAL, 2008, 11 (03): : 251 - 265
  • [8] Gienapp Lukas, 2020, P CIKM
  • [9] Karpukhin V, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P6769
  • [10] McCloskey M, 1989, PSYCHOL LEARN MOTIV, V24, P109, DOI [10.1016/S0079-7421(08)60536-8, DOI 10.1016/S0079-7421(08)60536-8]