A Metamorphic Testing Approach for Assessing Question Answering Systems

被引:6
|
作者
Tu, Kaiyi [1 ]
Jiang, Mingyue [1 ]
Ding, Zuohua [1 ]
机构
[1] Zhejiang Sci Tech Univ, Sch Informat Sci & Technol, Hangzhou 310018, Peoples R China
关键词
textual question answering; visual question answering; metamorphic testing; metamorphic relations; quality assessment;
D O I
10.3390/math9070726
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Question Answering (QA) enables the machine to understand and answer questions posed in natural language, which has emerged as a powerful tool in various domains. However, QA is a challenging task and there is an increasing concern about its quality. In this paper, we propose to apply the technique of metamorphic testing (MT) to evaluate QA systems from the users' perspectives, in order to help the users to better understand the capabilities of these systems and then to select appropriate QA systems for their specific needs. Two typical categories of QA systems, namely, the textual QA (TQA) and visual QA (VQA), are studied, and a total number of 17 metamorphic relations (MRs) are identified for them. These MRs respectively focus on some characteristics of different aspects of QA. We further apply MT to four QA systems (including two APIs from the AllenNLP platform, one API from the Transformers platform, and one API from CloudCV) by using all of the MRs. Our experimental results demonstrate the capabilities of the four subject QA systems from various aspects, revealing their strengths and weaknesses. These results further suggest that MT can be an effective method for assessing QA systems.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Lexical Approach for Spanish Question Answering
    Tellez, Alberto
    Juarez, Antonio
    Hernandez, Gustavo
    Denicia, Claudia
    Villatoro, Esau
    Montes, Manuel
    Villasenor, Luis
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 328 - 331
  • [22] A survey on semantic question answering systems
    Antoniou, Christina
    Bassiliades, Nick
    KNOWLEDGE ENGINEERING REVIEW, 2022, 37 (03):
  • [23] Classification of question answering systems: A survey
    Sandhini, S.
    Binu, R.
    EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY FOR SOCIETY, ENERGY AND ENVIRONMENT, 2018, : 779 - 784
  • [24] Question Answering Systems: Survey and Trends
    Bouziane, Abdelghani
    Bouchiha, Djelloul
    Doumi, Noureddine
    Malki, Mimoun
    INTERNATIONAL CONFERENCE ON ADVANCED WIRELESS INFORMATION AND COMMUNICATION TECHNOLOGIES (AWICT 2015), 2015, 73 : 366 - 375
  • [25] A Corpus for Hybrid Question Answering Systems
    Grau, Brigitte
    Ligozat, Anne-Laure
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 1081 - 1086
  • [26] A survey on question answering systems with classification
    Mishra, Amit
    Jain, Sanjay Kumar
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2016, 28 (03) : 345 - 361
  • [27] Assessing the quality of answers autonomously in community question–answering
    Long T. Le
    Chirag Shah
    Erik Choi
    International Journal on Digital Libraries, 2019, 20 : 351 - 367
  • [28] A Multilingual Semantic Similarity-Based Approach for Question-Answering Systems
    Wali, Wafa
    Ghorbel, Fatma
    Gragouri, Bilel
    Hamdi, Faycal
    Metais, Elisabeth
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT I, 2019, 11775 : 604 - 614
  • [29] Multiple answers to a question: a new approach for visual question answering
    Hosseinabad, Sayedshayan Hashemi
    Safayani, Mehran
    Mirzaei, Abdolreza
    VISUAL COMPUTER, 2021, 37 (01): : 119 - 131
  • [30] Connecting Question Answering and Conversational Agents Contextualizing German Questions for Interactive Question Answering Systems
    Waltinger, Ulli
    Breuing, Alexa
    Wachsmuth, Ipke
    KUNSTLICHE INTELLIGENZ, 2012, 26 (04): : 381 - 390