Multi-Label Question Classification for Factoid and List Type Questions in Biomedical Question Answering

被引:10
|
作者
Wasim, Muhammad [1 ]
Mahmood, Waqar [2 ]
Asim, Muhammad Nabeel [2 ]
Ghani, Muhammad Usman [1 ]
机构
[1] Univ Engn & Technol, Dept Comp Sci & Engn, Lahore 54890, Pakistan
[2] Univ Engn & Technol, Al Khwarizmi Inst Comp Sci, Lahore 54890, Pakistan
关键词
Question classification; question answering; corpus generation; binary relevance; copy transformation; SEMANTIC WEB; SYSTEM; BIOASQ;
D O I
10.1109/ACCESS.2018.2887165
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Biomedical experts and bio-curators are unable to quickly find short and precise information using typical search engines as the amount of biomedical literature is increasing exponentially. The research community is focusing on biomedical question answering (QA) systems so that anyone can find precise information nuggets from the massive amount of biomedical literature. Generally, the user queries fall under different categories such as factoid, list, yes/no, or summary. The existing state-of-the-art question answering systems deal with most of these question types. However, the research to improve the performance of individual question types is also on the rise. To improve QA system performance, question classification plays a vital role for factoid and list type questions as it allows the answer processing stage to narrow down the candidate answer space and assigns a higher rank to the correct answers. A single biomedical answer or entity may be associated with more than one biomedical category or semantic type, e.g., Coenzyme Q(10) is classified under two categories in Unified Medical Language System (UMLS): organic chemical and biologically active substance. This inherent characteristic of biomedical entities makes question classification in the biomedical domain a multi-label classification problem, where one question might expect answers belonging to more than one semantic type. To the best of our knowledge, several QA systems deal with question classification as a multi-class classification problem and only one state-of-the-art system - OAQA - deals with it as a multi-label classification problem. In this paper, we analyze the pipeline of the OAQA system for factoid and list type questions, emphasizing the multi-label question classification. We use an improved question classification dataset with the copy transformation technique to improve the performance of list type questions. Moreover, we introduce a binary transformation in the pipeline of factoid questions to increase its performance. Our modified methodology enhances the performance of both list and factoid type questions by a margin of 2% and 3% evaluated on standard F-1 and Mean Reciprocal Rank measures, respectively.
引用
收藏
页码:3882 / 3896
页数:15
相关论文
共 50 条
  • [1] The Research of Multi-label Question Classification in Community Question Answering
    Shu, Peng
    Su, Lei
    Yuan, Liwei
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 5504 - 5507
  • [2] Multi-label biomedical question classification for lexical answer type prediction
    Wasim, Muhammad
    Asim, Muhammad Nabeel
    Khan, Muhammad Usman Ghani
    Mahmood, Waqar
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 93
  • [3] Automatic Question Tagging using Multi-Label Classification in Community Question Answering Sites
    Sahu, Tirath Prasad
    Thummalapudi, Reswanth Sai
    Nagwani, Naresh Kumar
    2019 6TH IEEE INTERNATIONAL CONFERENCE ON CYBER SECURITY AND CLOUD COMPUTING (IEEE CSCLOUD 2019) / 2019 5TH IEEE INTERNATIONAL CONFERENCE ON EDGE COMPUTING AND SCALABLE CLOUD (IEEE EDGECOM 2019), 2019, : 63 - 68
  • [4] Data Augmentation for Biomedical Factoid Question Answering
    Pappas, Dimitris
    Malakasiotis, Prodromos
    Androutsopoulos, Ion
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 63 - 81
  • [5] An Arabic Question-Answering system for factoid questions
    Brini, Wissal
    Ellouze, Mariem
    Mesfar, Slim
    Belguith, Lamia Hadrich
    IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 417 - +
  • [6] QUESTION ANSWERING SYSTEM FOR FACTOID BASED QUESTION
    Ranjan, Prakash
    Balabantaray, Rakesh Chandra
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 221 - 224
  • [7] Adversarial Knowledge Distillation Based Biomedical Factoid Question Answering
    Bai, Jun
    Yin, Chuantao
    Zhang, Jianfei
    Wang, Yanmeng
    Dong, Yi
    Rong, Wenge
    Xiong, Zhang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 106 - 118
  • [8] Using IS-A relation patterns for factoid questions in Question Answering systems
    Shim, Bojun
    Ko, Youngjoong
    Seo, Jungyun
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (12) : 2985 - 2989
  • [9] A systematic review of question answering systems for non-factoid questions
    Eduardo Gabriel Cortes
    Vinicius Woloszyn
    Dante Barone
    Sebastian Möller
    Renata Vieira
    Journal of Intelligent Information Systems, 2022, 58 : 453 - 480
  • [10] Syntactic Open Domain Arabic Question/Answering System for Factoid Questions
    Fareed, Noha S.
    Mousa, Hamdy M.
    Elsisi, Ashraf B.
    2014 9th International Conference on Informatics and Systems (INFOS), 2014,