Open Domain Question Answering over Tables via Dense Retrieval

被引:0
作者
Herzig, Jonathan [1 ]
Muller, Thomas [2 ]
Krichene, Syrine [2 ]
Eisenschlos, Julian Martin [2 ]
机构
[1] Tel Aviv Univ, Sch Comp Sci, Tel Aviv, Israel
[2] Google Res, Mountain View, CA USA
来源
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021) | 2021年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent advances in open-domain QA have led to strong models based on dense retrieval, but only focused on retrieving textual passages. In this work, we tackle open-domain QA over tables for the first time, and show that retrieval can be improved by a retriever designed to handle tabular context. We present an effective pre-training procedure for our retriever and improve retrieval quality with mined hard negatives. As relevant datasets are missing, we extract a subset of NATURAL QUESTIONS (Kwiatkowski et al., 2019) into a Table QA dataset. We find that our retriever improves retrieval results from 72.0 to 81.1 recall@ 10 and end-to-end QA results from 33.8 to 37.7 exact match, over a BERT based retriever.
引用
收藏
页码:512 / 519
页数:8
相关论文
共 25 条
[1]   Reading Wikipedia to Answer Open-Domain Questions [J].
Chen, Danqi ;
Fisch, Adam ;
Weston, Jason ;
Bordes, Antoine .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879
[2]  
Chen W., 2021, INT C LEARNING REPRE
[3]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4]  
Eisenschlos JM, 2020, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020
[5]  
Gillick Daniel, 2019, P 23 C COMP NAT LANG, P528, DOI 10.18653/v1/K19-1049
[6]   Google Vizier: A Service for Black-Box Optimization [J].
Golovin, Daniel ;
Solnik, Benjamin ;
Moitra, Subhodeep ;
Kochanski, Greg ;
Karro, John ;
Sculley, D. .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :1487-1496
[7]  
Guo RQ, 2020, PR MACH LEARN RES, V119
[8]  
Guu K, 2020, PR MACH LEARN RES, V119
[9]  
Henderson Matthew L., 2017, arXiv
[10]  
Herzig J, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4320