Testing the reasoning for question answering validation

被引:11
|
作者
Penas, Anselmo [1 ]
Rodrigo, Alvaro [1 ]
Sama, Valentin [1 ]
Verdejo, Felisa [1 ]
机构
[1] Univ Nacl Educ Distancia, Depto Lenguajes & Sistemas Informat, Madrid 28040, Spain
关键词
textual entailment; test collections; question answering; answer validation; evaluation;
D O I
10.1093/logcom/exm072
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Question answering (QA) is a task that deserves more collaboration between natural language processing (NLP) and knowledge representation (KR) communities, not only to introduce reasoning when looking for answers or making use of answer type taxonomies and encyclopaedic knowledge, but also, as discussed here, for answer validation (AV), that is to say, to decide whether the responses of a QA system are correct or not. This was one of the motivations for the first Answer Validation Exercise at CLEF 2006 (AVE 2006). The starting point for the AVE 2006 was the reformulation of the answer validation as a recognizing textual entailment (RTE) problem, under the assumption that a hypothesis can be automatically generated instantiating a hypothesis pattern with a QA system answer. The test collections that we developed in seven different languages at AVE 2006 are specially oriented to the development and evaluation of answer validation systems. We show in this article the methodology followed for developing these collections taking advantage of the human assessments already made in the evaluation of QA systems. We also propose an evaluation framework for AV linked to a QA evaluation track. We quantify and discuss the source of errors introduced by the reformulation of the answer validation problem in terms of textual entailment (around 2, in the range of inter-annotator disagreement). We also show the evaluation results of the first answer validation exercise at CLEF 2006 where 11 groups have participated with 38 runs in seven different languages. The most extensively used techniques were Machine Learning and overlapping measures, but systems with broader knowledge resources and richer representation formalisms obtained the best results.
引用
收藏
页码:459 / 474
页数:16
相关论文
共 50 条
  • [1] Enabling answer validation by logic form reasoning in Chinese question answering
    Zhang, Y
    Zhang, DM
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 275 - 280
  • [2] Chain of Reasoning for Visual Question Answering
    Wu, Chenfei
    Liu, Jinlai
    Wang, Xiaojie
    Dong, Xuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Knowledge and reasoning for question answering: Research perspectives
    Saint-Dizier, Patrick
    Moens, Marie-Francine
    INFORMATION PROCESSING & MANAGEMENT, 2011, 47 (06) : 899 - 906
  • [4] Variational Reasoning for Question Answering with Knowledge Graph
    Zhang, Yuyu
    Dai, Hanjun
    Kozareva, Zornitsa
    Smola, Alexander J.
    Song, Le
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6069 - 6076
  • [5] Sequential Visual Reasoning for Visual Question Answering
    Liu, Jinlai
    Wu, Chenfei
    Wang, Xiaojie
    Dong, Xuan
    PROCEEDINGS OF 2018 5TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2018, : 410 - 415
  • [6] Preference reasoning in advanced question answering systems
    Benamara, Farah
    Kaci, Souhila
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 1116 - +
  • [7] Video Question Answering With Semantic Disentanglement and Reasoning
    Liu, Jin
    Wang, Guoxiang
    Xie, Jialong
    Zhou, Fengyu
    Xu, Huijuan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (05) : 3663 - 3673
  • [8] Visual question answering by pattern matching and reasoning
    Zhan, Huayi
    Xiong, Peixi
    Wang, Xin
    Yang, Lan
    NEUROCOMPUTING, 2022, 467 : 323 - 336
  • [9] Multimodal Learning and Reasoning for Visual Question Answering
    Ilievski, Ilija
    Feng, Jiashi
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [10] Graph Reasoning for Question Answering with Triplet Retrieval
    Li, Shiyang
    Gao, Yifan
    Jiang, Haoming
    Yin, Qingyu
    Li, Zheng
    Yan, Xifeng
    Zhang, Chao
    Yin, Bing
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 3366 - 3375