Dense Passage Retrieval for Open-Domain Question Answering

被引:0
|
作者
Karpukhin, Vladimir [1 ]
Oguz, Barlas [1 ]
Min, Sewon [2 ]
Lewis, Patrick [1 ]
Wu, Ledell [1 ]
Edunov, Sergey [1 ]
Chen, Danqi [3 ]
Yih, Wen Tau [1 ]
机构
[1] Facebook AI, London, England
[2] Univ Washington, Seattle, WA USA
[3] Princeton Univ, Princeton, NJ USA
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method. In this work, we show that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual-encoder framework. When evaluated on a wide range of open-domain QA datasets, our dense retriever outperforms a strong Lucene-BM25 system greatly by 9%-19% absolute in terms of top-20 passage retrieval accuracy, and helps our end-to-end QA system establish new state-of-the-art on multiple open-domain QA benchmarks.(1)
引用
收藏
页码:6769 / 6781
页数:13
相关论文
共 50 条
  • [1] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200
  • [2] RocketQA: An Optimized Training Approach to Dense Passage Retrieval for Open-Domain Question Answering
    Qu, Yingqi
    Ding, Yuchen
    Liu, Jing
    Liu, Kai
    Ren, Ruiyang
    Zhao, Wayne Xin
    Dong, Daxiang
    Wu, Hua
    Wang, Haifeng
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5835 - 5847
  • [3] Efficient Passage Retrieval with Hashing for Open-domain Question Answering
    Yamada, Ikuya
    Asai, Akari
    Hajishirzi, Hannaneh
    ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 979 - 986
  • [4] Passage filtering for open-domain Question Answering
    Noguera, Elisa
    Llopis, Fernando
    Ferrandez, Antonio
    ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 534 - 540
  • [5] Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task
    Zhai, Qiuhong
    Zhu, Wenhao
    Zhang, Xiaoyu
    Liu, Chenyun
    FUTURE INTERNET, 2023, 15 (04):
  • [6] Hybrid Hierarchical Retrieval for Open-Domain Question Answering
    Arivazhagan, Manoj Ghuhan
    Li, Lan
    Qi, Peng
    Chen, Xinchi
    Wang, William Yang
    Huang, Zhiheng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10680 - 10689
  • [7] Span prompt dense passage retrieval for Chinese open domain question answering
    Fan, Chunxiao
    Yan, Zhen
    Wu, Yuexin
    Qian, Bing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (05) : 7285 - 7295
  • [8] Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering
    Zhou, Jiawei
    Li, Xiaoguang
    Shang, Lifeng
    Luo, Lan
    Zhan, Ke
    Hu, Enrui
    Zhang, Xinyu
    Jiang, Hao
    Cao, Zhao
    Yu, Fan
    Jiang, Xin
    Liu, Qun
    Chen, Lei
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 7135 - 7146
  • [9] Generation-Augmented Retrieval for Open-Domain Question Answering
    Mao, Yuning
    He, Pengcheng
    Liu, Xiaodong
    Shen, Yelong
    Gao, Jianfeng
    Han, Jiawei
    Chen, Weizhu
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4089 - 4100
  • [10] Task-Aware Specialization for Efficient and Robust Dense Retrieval for Open-Domain Question Answering
    Cheng, Hao
    Fang, Hao
    Liu, Xiaodong
    Gao, Jianfeng
    61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1864 - 1875