A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering

被引：23

作者：

Sarrouti, Mourad ^{[1
]}

El Alaoui, Said Ouatik ^{[1
]}

机构：

[1] Sidi Mohammed Ben Abdellah Univ, FSDM, Lab Comp Sci & Modeling, Fes, Morocco

来源：

METHODS OF INFORMATION IN MEDICINE | 2017年 / 56卷 / 03期

关键词：

Biomedical question answering; information retrieval; biomedical question classification; natural language processing; biomedical informatics; CLINICAL QUESTIONS; DOMAIN;

D O I：

10.3414/ME16-01-0116

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Background and Objective: Biomedical question type classification is one of the important components of an automatic biomedical question answering system. The performance of the latter depends directly on the performance of its biomedical question type classification system, which consists of assigning a category to each question in order to determine the appropriate answer extraction algorithm. This study aims to automatically classify biomedical questions into one of the four categories: (1) yes/no, (2) factoid, (3) list, and (4) summary. Methods: In this paper, we propose a biomedical question type classification method based on machine learning approaches to automatically assign a category to a biomedical question. First, we extract features from biomedical questions using the proposed handcrafted lexico-syntactic patterns. Then, we feed these features for machine learning algorithms. Finally, the class label is predicted using the trained classifiers. Results: Experimental evaluations performed on large standard annotated datasets of biomedical questions, provided by the BioASQ challenge, demonstrated that our method exhibits significant improved performance when compared to four baseline systems. The proposed method achieves a roughly 10-point increase over the best baseline in terms of accuracy. Moreover, the obtained results show that using handcrafted lexico-syntactic patterns as features' provider of support vector machine (SVM) lead to the highest accuracy of 89.40%. Conclusion: The proposed method can automatically classify BioASQ questions into one of the four categories: yes/no, factoid, list, and summary. Furthermore, the results demonstrated that our method produced the best classification performance compared to four baseline systems.

引用

页码：209 / 216

页数：8

共 50 条

[31] Linguistic treatment of questions in Spanish for question classification in question answering systems.
Olvera-Lobo, Maria-Dolores
Robinson-Garcia, Nicolas
PROFESIONAL DE LA INFORMACION, 2009, 18 (02): : 180 - 187
[32] DAQAS: Deep Arabic Question Answering System based on duplicate question detection and machine reading comprehension
Alami, Hamza
Mahdaouy, Abdelkader El
Benlahbib, Abdessamad
En-Nahnahi, Noureddine
Berrada, Ismail
Ouatik, Said El Alaoui
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (08)
[33] Pre-trained Language Model for Biomedical Question Answering
Yoon, Wonjin
Lee, Jinhyuk
Kim, Donghyeon
Jeong, Minbyul
Kang, Jaewoo
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II, 2020, 1168 : 727 - 740
[34] Document Retrieval for Biomedical Question Answering with Neural Sentence Matching
Noh, Jiho
Kavuluru, Ramakanth
2018 17TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA), 2018, : 194 - 201
[35] Improving Biomedical Question Answering by Data Augmentation and Model Weighting
Du, Yongping
Yan, Jingya
Lu, Yuxuan
Zhao, Yiliang
Jin, Xingnan
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1114 - 1124
[36] A Survey on Representation Learning in Visual Question Answering
Sahani, Manish
Singh, Priyadarshan
Jangpangi, Sachin
Kumar, Shailender
MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 326 - 336
[37] Collaborative Learning for Answer Selection in Question Answering
Shao, Taihua
Kui, Xiaoyan
Zhang, Pengfei
Chen, Honghui
IEEE ACCESS, 2019, 7 : 7337 - 7347
[38] Improved class-specific vector for biomedical question type classification
Gupta, Tanu
Kumar, Ela
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (02) : 182 - 191
[39] A question answering system approach for collaborative learning
Wang, Chun-Chia
Hung, Jason C.
Yang, Che-Yu
Chang, Hsuan-Pu
2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1403 - 1407
[40] A Legal Question Answering Ontology-Based System
Kourtin, Ismahane
Mbarki, Samir
Mouloudi, Abdelaaziz
FORMALISING NATURAL LANGUAGES: APPLICATIONS TO NATURAL LANGUAGE PROCESSING AND DIGITAL HUMANITIES, NOOJ 2020, 2021, 1389 : 218 - 229

← 1 2 3 4 5 →