SCITAIL: A Textual Entailment Dataset from Science Question Answering

被引：0

作者：

Khot, Tushar ^{[1
]}

Sabharwal, Ashish ^{[1
]}

Clark, Peter ^{[1
]}

机构：

[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA

来源：

THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new dataset and model for textual entailment, derived from treating multiple-choice question-answering as an entailment problem. SCITAIL, is the first entailment set that is created solely from natural sentences that already exist independently "in the wild" rather than sentences authored specifically for the entailment task. Different from existing entailment datasets, we create hypotheses from science questions and the corresponding answer candidates, and premises from relevant web sentences retrieved from a large corpus. These sentences are often linguistically challenging. This, combined with the high lexical similarity of premise and hypothesis for both entailed and non-entailed pairs, makes this new entailment task particularly difficult. The resulting challenge is evidenced by state-of-the-art textual entailment systems achieving mediocre performance on So:TAIL, especially in comparison to a simple majority class baseline. As a step forward, we demonstrate that one can improve accuracy on SCITAIL by 5% using a new neural model that exploits linguistic structure.

引用

页码：5189 / 5197

页数：9

共 50 条

[31] ArabicaQA: A Comprehensive Dataset for Arabic Question Answering
Abdallah, Abdelrahman
Kasem, Mahmoud
Abdalla, Mahmoud
Mahmoud, Mohamed
Elkasaby, Mohamed
Elbendary, Yasser
Jatowt, Adam
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2049 - 2059
[32] VQuAD: Video Question Answering Diagnostic Dataset
Gupta, Vivek
Patro, Badri N.
Parihar, Hemant
Namboodiri, Vinay P.
2022 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS (WACVW 2022), 2022, : 282 - 291
[33] TutorialVQA: Question Answering Dataset for Tutorial Videos
Colas, Anthony
Kim, Seokhwan
Dernoncourt, Franck
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5450 - 5455
[34] Towards a Polish Question Answering Dataset (PoQuAD)
Tuora, Ryszard
Zawadzka-Paluektau, Natalia
Klamra, Cezary
Zwierzchowska, Aleksandra
Kobylinski, Lukasz
FROM BORN-PHYSICAL TO BORN-VIRTUAL: AUGMENTING INTELLIGENCE IN DIGITAL LIBRARIES, ICADL 2022, 2022, 13636 : 194 - 203
[35] QED: A Framework and Dataset for Explanations in Question Answering
Lamm, Matthew
Palomaki, Jennimaria
Alberti, Chris
Andor, Daniel
Choi, Eunsol
Soares, Livio Baldini
Collins, Michael
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 790 - 806
[36] PerCQA: Persian Community Question Answering Dataset
Jamali, Naghme
Yaghoobzadeh, Yadollah
Faili, Heshaam
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6083 - 6092
[37] PRAGMATICQA: A Dataset for Pragmatic Question Answering in Conversations
Qi, Peng
Du, Nina
Manning, Christopher D.
Huang, Jing
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 6175 - 6191
[38] MemoriQA: A Question-Answering Lifelog Dataset
Tran, Quang-Linh
Nguyen, Binh
Jones, Gareth J. F.
Gurrin, Cathal
PROCEEDINGS OF THE FIRST ACM WORKSHOP ON AI-POWERED QUESTION ANSWERING SYSTEMS FOR MULTIMEDIA, AIQAM 2024, 2024, : 7 - 12
[39] SYLLABUSQA: A Course Logistics Question Answering Dataset
Fernandez, Nigel
Scarlatos, Alexander
Lan, Andrew
PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 10344 - 10369
[40] A Portuguese Dataset for Evaluation of Semantic Question Answering
de Araujo, Denis Andrei
Rigo, Sandro Jose
Quaresma, Paulo
Muniz, Joao Henrique
COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 217 - 227

← 1 2 3 4 5 →