A Machine Learning-based Method for Question Type Classification in Biomedical Question Answering

被引:23
|
作者
Sarrouti, Mourad [1 ]
El Alaoui, Said Ouatik [1 ]
机构
[1] Sidi Mohammed Ben Abdellah Univ, FSDM, Lab Comp Sci & Modeling, Fes, Morocco
关键词
Biomedical question answering; information retrieval; biomedical question classification; natural language processing; biomedical informatics; CLINICAL QUESTIONS; DOMAIN;
D O I
10.3414/ME16-01-0116
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Background and Objective: Biomedical question type classification is one of the important components of an automatic biomedical question answering system. The performance of the latter depends directly on the performance of its biomedical question type classification system, which consists of assigning a category to each question in order to determine the appropriate answer extraction algorithm. This study aims to automatically classify biomedical questions into one of the four categories: (1) yes/no, (2) factoid, (3) list, and (4) summary. Methods: In this paper, we propose a biomedical question type classification method based on machine learning approaches to automatically assign a category to a biomedical question. First, we extract features from biomedical questions using the proposed handcrafted lexico-syntactic patterns. Then, we feed these features for machine learning algorithms. Finally, the class label is predicted using the trained classifiers. Results: Experimental evaluations performed on large standard annotated datasets of biomedical questions, provided by the BioASQ challenge, demonstrated that our method exhibits significant improved performance when compared to four baseline systems. The proposed method achieves a roughly 10-point increase over the best baseline in terms of accuracy. Moreover, the obtained results show that using handcrafted lexico-syntactic patterns as features' provider of support vector machine (SVM) lead to the highest accuracy of 89.40%. Conclusion: The proposed method can automatically classify BioASQ questions into one of the four categories: yes/no, factoid, list, and summary. Furthermore, the results demonstrated that our method produced the best classification performance compared to four baseline systems.
引用
收藏
页码:209 / 216
页数:8
相关论文
共 50 条
  • [21] Hierarchical Question-Aware Context Learning with Augmented Data for Biomedical Question Answering
    Du, Yongping
    Guo, Wenyang
    Zhao, Yiliang
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 370 - 375
  • [22] Question Classification in a Question Answering System on Cooking
    Manna, Riyanka
    Das, Dipankar
    Gelbukh, Alexander
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, MICAI 2020, PT II, 2020, 12469 : 103 - 108
  • [23] Insincere Question Classification on Question Answering Forum
    Priyambowo, Hendri
    Adriani, Mirna
    PROCEEDING OF 2019 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI), 2019, : 390 - 394
  • [24] Research on Question Classification for Automatic Question Answering
    Xu, Shihua
    Cheng, Gang
    Kong, Fang
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 218 - 221
  • [25] Question Classification for Arabic Question Answering Systems
    Al Chalabi, Hani Maluf
    Ray, Santosh Kumar
    Shaalan, Khaled
    2015 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY RESEARCH (ICTRC), 2015, : 310 - 313
  • [26] MED-GPVS: A Deep Learning-Based Joint Biomedical Image Classification and Visual Question Answering System for Precision e-Health
    Haridas, Harishma T.
    Fouda, Mostafa M.
    Fadlullah, Zubair Md
    Mahmoud, Mohamed
    ElHalawany, Basem M.
    Guizani, Mohsen
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3838 - 3843
  • [27] A machine learning approach to introspection in a question answering system
    Czuba, K
    Prager, J
    Chu-Carroll, J
    PROCEEDINGS OF THE 2002 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, 2002, : 265 - 272
  • [28] Question Processing and Clustering in INDOC: A Biomedical Question Answering System
    Sondhi, Parikshit
    Raj, Purushottam
    Kumar, V. Vinod
    Mittal, Ankush
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2007, (01)
  • [29] Using machine learning and text mining in question answering
    Juarez-Gonzalez, Antonio
    Tellez-Valero, Alberto
    Denicia-Carral, Claudia
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    Evaluation of Multilingual and Multi-modal Information Retrieval, 2007, 4730 : 415 - 423
  • [30] Machine learning for question answering from tabular data
    Khalid, Mahboob Alam
    Jijkoun, Valentin
    de Rijke, Maarten
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 392 - +