A Corpus for Hybrid Question Answering Systems

被引:3
作者
Grau, Brigitte [1 ]
Ligozat, Anne-Laure [1 ]
机构
[1] Univ Paris Saclay, LIMSI, CNRS, ENSILE, Orsay, France
来源
COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018) | 2018年
关键词
Hybrid Question Answering; Corpus;
D O I
10.1145/3184558.3191540
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question answering has been the focus of a lot of researches and evaluation campaigns, either for text-based systems (TREC and CLEF evaluation campaigns for example), or for knowledge-based systems (QALD, BioASQ). Few systems have effectively combined both types of resources and methods in order to exploit the fruitfulness of merging the two kinds of information repositories. The only evaluation QA track that focuses on hybrid QA is QALD since 2014. As it is a recent task, few annotated data are available (around 150 questions). In this paper, we present a question answering dataset that was constructed to develop and evaluate hybrid question answering systems. In order to create this corpus, we collected several textual corpora and augmented them with entities and relations of a knowledge base by retrieving paths in the knowledge base which allow to answer the questions. The resulting corpus contains 4300 question-answer pairs and 1600 have a true link with DBpedia.
引用
收藏
页码:1081 / 1086
页数:6
相关论文
共 30 条
[1]  
[Anonymous], 2016, P 2016 C N AM CHAPTE, DOI [10.18653/v1/N16-1108, DOI 10.18653/V1/N16-1108]
[2]  
[Anonymous], 2010, P 23 INT C COMPUTATI
[3]  
[Anonymous], 2005, TREC
[4]  
[Anonymous], 2016, ARXIV
[5]  
[Anonymous], 2013, ADV NEURAL INF PROCE
[6]  
Baeza-Yates R, 1999, MODERN INFORM RETRIE, V463
[7]  
Beaumont R., 2015, CLEF
[8]  
Berant J., 2013, P EMNLP 2013, P1533
[9]   Reading Wikipedia to Answer Open-Domain Questions [J].
Chen, Danqi ;
Fisch, Adam ;
Weston, Jason ;
Bordes, Antoine .
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, :1870-1879
[10]   Finding needles in the haystack: Search and candidate generation [J].
Chu-Carroll, J. ;
Fan, J. ;
Boguraev, B. K. ;
Carmel, D. ;
Sheinwald, D. ;
Welty, C. .
IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2012, 56 (3-4)