SCITAIL: A Textual Entailment Dataset from Science Question Answering

被引:0
|
作者
Khot, Tushar [1 ]
Sabharwal, Ashish [1 ]
Clark, Peter [1 ]
机构
[1] Allen Inst Artificial Intelligence, Seattle, WA 98103 USA
来源
THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new dataset and model for textual entailment, derived from treating multiple-choice question-answering as an entailment problem. SCITAIL, is the first entailment set that is created solely from natural sentences that already exist independently "in the wild" rather than sentences authored specifically for the entailment task. Different from existing entailment datasets, we create hypotheses from science questions and the corresponding answer candidates, and premises from relevant web sentences retrieved from a large corpus. These sentences are often linguistically challenging. This, combined with the high lexical similarity of premise and hypothesis for both entailed and non-entailed pairs, makes this new entailment task particularly difficult. The resulting challenge is evidenced by state-of-the-art textual entailment systems achieving mediocre performance on So:TAIL, especially in comparison to a simple majority class baseline. As a step forward, we demonstrate that one can improve accuracy on SCITAIL by 5% using a new neural model that exploits linguistic structure.
引用
收藏
页码:5189 / 5197
页数:9
相关论文
共 50 条
  • [41] Enhanced question understanding with dynamic memory networks for textual question answering
    Yue, Chunyi
    Cao, Hanqiang
    Xiong, Kun
    Cui, Anqi
    Qin, Haocheng
    Li, Ming
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 80 : 39 - 45
  • [42] Check It Again: Progressive Visual Question Answering via Visual Entailment
    Si, Qingyi
    Lin, Zheng
    Zheng, Mingyu
    Fu, Peng
    Wang, Weiping
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4101 - 4110
  • [43] Fully Authentic Visual Question Answering Dataset from Online Communities
    Chen, Chongyan
    Liu, Mengchen
    Codella, Noel
    Lie, Yunsheng
    Yuan, Lu
    Gurari, Danna
    COMPUTER VISION - ECCV 2024, PT XLVIII, 2025, 15106 : 252 - 269
  • [44] SPARTQA: A Textual Question Answering Benchmark for Spatial Reasoning
    Mirzaee, Roshanak
    Faghihi, Hossein Rajaby
    Ning, Qiang
    Kordjamshidi, Parisa
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4582 - 4598
  • [45] Open-domain textual question answering techniques
    Harabagiu, Sanda M.
    Maiorano, Steven J.
    Paşca, Marius A.
    Natural Language Engineering, 2003, 9 (03) : 231 - 267
  • [46] Dynamic Memory Networks for Visual and Textual Question Answering
    Xiong, Caiming
    Merity, Stephen
    Socher, Richard
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [47] Single-dataset Experts for Multi-dataset Question Answering
    Friedman, Dan
    Dodge, Ben
    Chen, Danqi
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 6128 - 6137
  • [48] IITP at MEDIQA 2019: Systems Report for Natural Language Inference, Question Entailment and Question Answering
    Bandyopadhyay, Dibyanayan
    Gain, Baban
    Saikh, Tanik
    Ekbal, Asif
    SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2019), 2019, : 517 - 522
  • [49] From Alignment to Entailment: A Unified Textual Entailment Framework for Entity Alignment
    Zhao, Yu
    Wu, Yike
    Cai, Xiangrui
    Zhang, Ying
    Zhang, Haiwei
    Yuan, Xiaojie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 8795 - 8806
  • [50] Improvisation of Dataset Efficiency in Visual Question Answering Domain
    Mohamed, Sheerin Sitara Noor
    Srinivasan, Kavitha
    STATISTICS AND APPLICATIONS, 2022, 20 (02): : 279 - 289