SCITAIL: A Textual Entailment Dataset from Science Question Answering

被引：0

作者：

Khot, Tushar ^{[1
]}

Sabharwal, Ashish ^{[1
]}

Clark, Peter ^{[1
]}

机构：

[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA

来源：

THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2018年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a new dataset and model for textual entailment, derived from treating multiple-choice question-answering as an entailment problem. SCITAIL, is the first entailment set that is created solely from natural sentences that already exist independently "in the wild" rather than sentences authored specifically for the entailment task. Different from existing entailment datasets, we create hypotheses from science questions and the corresponding answer candidates, and premises from relevant web sentences retrieved from a large corpus. These sentences are often linguistically challenging. This, combined with the high lexical similarity of premise and hypothesis for both entailed and non-entailed pairs, makes this new entailment task particularly difficult. The resulting challenge is evidenced by state-of-the-art textual entailment systems achieving mediocre performance on So:TAIL, especially in comparison to a simple majority class baseline. As a step forward, we demonstrate that one can improve accuracy on SCITAIL by 5% using a new neural model that exploits linguistic structure.

引用

页码：5189 / 5197

页数：9

共 50 条

[21] FQuAD: French Question Answering Dataset
d'Hoffschmidt, Martin
Belblidia, Wacim
Heinrich, Quentin
Brendle, Tom
Vidal, Maxime
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1193 - 1208
[22] Slovak Dataset for Multilingual Question Answering
Hladek, Daniel
Stas, Jan
Juhar, Jozef
Koctur, Tomas
IEEE ACCESS, 2023, 11 : 32869 - 32881
[23] VQuAnDa: Verbalization QUestion ANswering DAtaset
Kacupaj, Endri
Zafar, Hamid
Lehmann, Jens
Maleshkova, Maria
SEMANTIC WEB (ESWC 2020), 2020, 12123 : 531 - 547
[24] LLQA - Lifelog Question Answering Dataset
Tran, Ly-Duyen
Thanh Cong Ho
Lan Anh Pham
Binh Nguyen
Gurrin, Cathal
Zhou, Liting
MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 217 - 228
[25] Repurposing Entailment for Multi-Hop Question Answering Tasks
Trivedi, Harsh
Kwon, Heeyoung
Khot, Tushar
Sabharwal, Ashish
Balasubramanian, Niranjan
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 2948 - 2958
[26] Learning Textual Entailment Classification from a Chinese RITE Dataset Specialized for Linguistic Phenomena
Liu, Chi-Ting
Chen, Shao-Heng
Lin, Chuan-Jie
PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 506 - 512
[27] SNLI Indo: A recognizing textual entailment dataset in Indonesian derived from the Stanford Natural Language Inference dataset
Putra, I. Made Suwija
Siahaan, Daniel
Saikhu, Ahmad
DATA IN BRIEF, 2024, 52
[28] Visual Question Answering with Textual Representations for Images
Hirota, Yusuke
Garcia, Noa
Otani, Mayu
Chu, Chenhui
Nakashima, Yuta
Taniguchi, Ittetsu
Onoye, Takao
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3147 - 3150
[29] Question and Answer Classification in Czech Question Answering Benchmark Dataset
Kusnirakova, Dasa
Medved, Marek
Horak, Ales
PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE (ICAART), VOL 2, 2019, : 701 - 706
[30] PubMedQA: A Dataset for Biomedical Research Question Answering
Jin, Qiao
Dhingra, Bhuwan
Liu, Zhengping
Cohen, William W.
Lu, Xinghua
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2567 - 2577

← 1 2 3 4 5 →