A Mighty Dataset for Stress-Testing Question Answering Systems

被引：0

作者：

Haarmann, Bastian ^{[1
]}

Martens, Claudio ^{[1
]}

Petzka, Henning ^{[1
]}

Napolitano, Giulio ^{[1
]}

机构：

[1] Fraunhofer Inst Intelligent Anal & Informat Syst, IAIS, D-53757 St Augustin, Germany

来源：

2018 IEEE 12TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC) | 2018年

基金：

欧盟地平线“2020”;

关键词：

Question Answering; Benchmark; DBpedia; Semantics;

D O I：

10.1109/ICSC.2018.00054

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The general goal of semantic question answering systems is to provide correct answers to natural language queries, given a number of structured datasets. The increasing broad deployment of question answering (QA) systems in everyday life requires a comparable and reliable rating of how well QA systems perform and how scalable they are. In order to achieve this, we developed a massive dataset of more than 2 million natural language questions and their SPARQL queries for the DBpedia dataset. We combined natural language processing and linked open data to automatically generate this large amount of valid question-query pairs. Our aim is to assist the benchmarking or scoring of QA systems in terms of answering questions in a range of languages, retrieving answers from heterogeneous sources or answering massive amounts of questions within a limited time. This dataset represents an ideal choice for stress-testing systems' scalability, speed and correctness. As such it has already been included into the Large-scale QA task of the Question Answering Over Linked Data (QALD) Challenge and the HOBBIT project Question Answering Benchmark.

引用

页码：278 / 281

页数：4

共 50 条

[1] Automatic question answering for multiple stakeholders, the epidemic question answering dataset
Travis R. Goodwin
Dina Demner-Fushman
Kyle Lo
Lucy Lu Wang
Hoa T. Dang
Ian M. Soboroff
Scientific Data, 9
[2] Automatic question answering for multiple stakeholders, the epidemic question answering dataset
Goodwin, Travis R.
Demner-Fushman, Dina
Lo, Kyle
Wang, Lucy Lu
Dang, Hoa T.
Soboroff, Ian M.
SCIENTIFIC DATA, 2022, 9 (01)
[3] QookA: A Cooking Question Answering Dataset
Frummet, Alexander
Elsweiler, David
PROCEEDINGS OF THE 2024 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL, CHIIR 2024, 2024, : 406 - 410
[4] PQuAD: A Persian question answering dataset
Darvishi, Kasra
Shahbodaghkhan, Newsha
Abbasiantaeb, Zahra
Momtazi, Saeedeh
COMPUTER SPEECH AND LANGUAGE, 2023, 80
[5] FQuAD: French Question Answering Dataset
d'Hoffschmidt, Martin
Belblidia, Wacim
Heinrich, Quentin
Brendle, Tom
Vidal, Maxime
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1193 - 1208
[6] Slovak Dataset for Multilingual Question Answering
Hladek, Daniel
Stas, Jan
Juhar, Jozef
Koctur, Tomas
IEEE ACCESS, 2023, 11 : 32869 - 32881
[7] VQuAnDa: Verbalization QUestion ANswering DAtaset
Kacupaj, Endri
Zafar, Hamid
Lehmann, Jens
Maleshkova, Maria
SEMANTIC WEB (ESWC 2020), 2020, 12123 : 531 - 547
[8] LLQA - Lifelog Question Answering Dataset
Tran, Ly-Duyen
Thanh Cong Ho
Lan Anh Pham
Binh Nguyen
Gurrin, Cathal
Zhou, Liting
MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 217 - 228
[9] A Metamorphic Testing Approach for Assessing Question Answering Systems
Tu, Kaiyi
Jiang, Mingyue
Ding, Zuohua
MATHEMATICS, 2021, 9 (07)
[10] QALD-9-ES: A Spanish Dataset for Question Answering Systems
Soruco, Javier
Collarana, Diego
Both, Andreas
Usbeck, Ricardo
KNOWLEDGE GRAPHS: SEMANTICS, MACHINE LEARNING, AND LANGUAGES, 2023, 56 : 38 - 52

← 1 2 3 4 5 →