A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering

被引:23
作者
Sarrouti, Mourad [1 ]
El Alaoui, Said Ouatik [1 ]
机构
[1] Sidi Mohammed Ben Abdellah Univ, FSDM, Lab Comp Sci & Modeling, Fes, Morocco
关键词
Biomedical question answering; information retrieval; biomedical question classification; natural language processing; biomedical informatics; CLINICAL QUESTIONS; DOMAIN;
D O I
10.3414/ME16-01-0116
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background and Objective: Biomedical question type classification is one of the important components of an automatic biomedical question answering system. The performance of the latter depends directly on the performance of its biomedical question type classification system, which consists of assigning a category to each question in order to determine the appropriate answer extraction algorithm. This study aims to automatically classify biomedical questions into one of the four categories: (1) yes/no, (2) factoid, (3) list, and (4) summary. Methods: In this paper, we propose a biomedical question type classification method based on machine learning approaches to automatically assign a category to a biomedical question. First, we extract features from biomedical questions using the proposed handcrafted lexico-syntactic patterns. Then, we feed these features for machine learning algorithms. Finally, the class label is predicted using the trained classifiers. Results: Experimental evaluations performed on large standard annotated datasets of biomedical questions, provided by the BioASQ challenge, demonstrated that our method exhibits significant improved performance when compared to four baseline systems. The proposed method achieves a roughly 10-point increase over the best baseline in terms of accuracy. Moreover, the obtained results show that using handcrafted lexico-syntactic patterns as features' provider of support vector machine (SVM) lead to the highest accuracy of 89.40%. Conclusion: The proposed method can automatically classify BioASQ questions into one of the four categories: yes/no, factoid, list, and summary. Furthermore, the results demonstrated that our method produced the best classification performance compared to four baseline systems.
引用
收藏
页码:209 / 216
页数:8
相关论文
共 50 条
  • [31] Linguistic treatment of questions in Spanish for question classification in question answering systems.
    Olvera-Lobo, Maria-Dolores
    Robinson-Garcia, Nicolas
    PROFESIONAL DE LA INFORMACION, 2009, 18 (02): : 180 - 187
  • [32] DAQAS: Deep Arabic Question Answering System based on duplicate question detection and machine reading comprehension
    Alami, Hamza
    Mahdaouy, Abdelkader El
    Benlahbib, Abdessamad
    En-Nahnahi, Noureddine
    Berrada, Ismail
    Ouatik, Said El Alaoui
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
  • [33] Pre-trained Language Model for Biomedical Question Answering
    Yoon, Wonjin
    Lee, Jinhyuk
    Kim, Donghyeon
    Jeong, Minbyul
    Kang, Jaewoo
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 727 - 740
  • [34] Document Retrieval for Biomedical Question Answering with Neural Sentence Matching
    Noh, Jiho
    Kavuluru, Ramakanth
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 194 - 201
  • [35] Improving Biomedical Question Answering by Data Augmentation and Model Weighting
    Du, Yongping
    Yan, Jingya
    Lu, Yuxuan
    Zhao, Yiliang
    Jin, Xingnan
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1114 - 1124
  • [36] A Survey on Representation Learning in Visual Question Answering
    Sahani, Manish
    Singh, Priyadarshan
    Jangpangi, Sachin
    Kumar, Shailender
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 326 - 336
  • [37] Collaborative Learning for Answer Selection in Question Answering
    Shao, Taihua
    Kui, Xiaoyan
    Zhang, Pengfei
    Chen, Honghui
    IEEE ACCESS, 2019, 7 : 7337 - 7347
  • [38] Improved class-specific vector for biomedical question type classification
    Gupta, Tanu
    Kumar, Ela
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (02) : 182 - 191
  • [39] A question answering system approach for collaborative learning
    Wang, Chun-Chia
    Hung, Jason C.
    Yang, Che-Yu
    Chang, Hsuan-Pu
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1403 - 1407
  • [40] A Legal Question Answering Ontology-Based System
    Kourtin, Ismahane
    Mbarki, Samir
    Mouloudi, Abdelaaziz
    FORMALISING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2020, 2021, 1389 : 218 - 229