MLEC-QA: A Chinese Multi-Choice Biomedical Question Answering Dataset

被引:0
|
作者
Li, Jing [1 ]
Zhong, Shangping [1 ]
Chen, Kaizhi [1 ]
机构
[1] Fuzhou Univ, Coll Comp & Data Sci, Fuzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Question Answering (QA) has been successfully applied in scenarios of human-computer interaction such as chatbots and search engines. However, for the specific biomedical domain, QA systems are still immature due to expert-annotated datasets being limited by category and scale. In this paper, we present MLEC-QA, the largest-scale Chinese multi-choice biomedical QA dataset, collected from the National Medical Licensing Examination in China. The dataset is composed of five subsets with 136,236 biomedical multi-choice questions with extra materials (images or tables) annotated by human experts, and first covers the following biomedical sub-fields: Clinic, Stomatology, Public Health, Traditional Chinese Medicine, and Traditional Chinese Medicine Combined with Western Medicine. We implement eight representative control methods and open-domain QA methods as baselines. Experimental results demonstrate that even the current best model can only achieve accuracies between 40% to 55% on five subsets, especially performing poorly on questions that require sophisticated reasoning ability. We hope the release of the MLEC-QA dataset can serve as a valuable resource for research and evaluation in open-domain QA, and also make advances for biomedical QA systems.(1)
引用
收藏
页码:8862 / 8874
页数:13
相关论文
共 50 条
  • [41] Hierarchical Multi-layer Transfer Learning Model for Biomedical Question Answering
    Du, Yongping
    Pei, Bingbing
    Zhao, Xiaozheng
    Ji, Junzhong
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 362 - 367
  • [42] Bi-directional Capsule Network Model for Chinese Biomedical Community Question Answering
    Zhang, Tongxuan
    Ren, Yuqi
    Tadessem, Michael Mesfin
    Xu, Bo
    Liu, Xikai
    Yang, Liang
    Yang, Zhihao
    Wang, Jian
    Lin, Hongfei
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 : 105 - 116
  • [43] HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data
    Chen, Wenhu
    Zha, Hanwen
    Chen, Zhiyu
    Xiong, Wenhan
    Wang, Hong
    Wang, William
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1026 - 1036
  • [44] VIMQA: A Vietnamese Dataset for Advanced Reasoning and Explainable Multi-hop Question Answering
    Le, Nguyen-Khang
    Nguyen, Dieu-Hien
    Le, Tung
    Nguyen, Minh Le
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 6521 - 6529
  • [45] Development of an Extractive Clinical Question Answering Dataset with Multi-Answer and Multi-Focus Questions
    Moon, Sungrim
    He, Huan
    Liu, Hongfang
    Fan, Jungwei W.
    arXiv, 2022,
  • [46] EQUALS: A Real-world Dataset for Legal Question Answering via Reading Chinese Laws
    Chen, Andong
    Yao, Feng
    Zhao, Xinyan
    Zhang, Yating
    Sun, Changlong
    Liu, Yun
    Shen, Weixing
    PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND LAW, ICAIL 2023, 2023, : 71 - 80
  • [47] NuScenes-QA: A Multi-Modal Visual Question Answering Benchmark for Autonomous Driving Scenario
    Qian, Tianwen
    Chen, Jingjing
    Zhuo, Linhai
    Jiao, Yang
    Jiang, Yu-Gang
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 5, 2024, : 4542 - 4550
  • [48] MusTQ: A Temporal Knowledge Graph Question Answering Dataset for Multi-Step Temporal Reasoning
    Zhang, Tingyi
    Wang, Jiaan
    Li, Zhixu
    Qu, Jianfeng
    Liu, An
    Chen, Zhigang
    Zhi, Hongping
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 11688 - 11699
  • [49] COKG-QA: Multi-hop Question Answering over COVID-19 Knowledge Graphs
    Du, Huifang
    Le, Zhongwen
    Wang, Haofen
    Chen, Yunwen
    Yu, Jing
    DATA INTELLIGENCE, 2022, 4 (03) : 471 - 492
  • [50] COKG-QA: Multi-hop Question Answering over COVID-19 Knowledge Graphs
    Huifang Du
    Zhongwen Le
    Haofen Wang
    Yunwen Chen
    Jing Yu
    Data Intelligence, 2022, (03) : 471 - 492