Applying deep matching networks to Chinese medical question answering: a study and a dataset

被引:19
|
作者
He, Junqing [1 ,2 ]
Fu, Mingming [1 ,2 ]
Tu, Manshu [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Acoust, Key Lab Speech Acoust & Content Understanding, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Medical question answering; Chinese word segmentation; Semantic matching; Convolutional neural networks; Deep learning;
D O I
10.1186/s12911-019-0761-8
中图分类号
R-058 [];
学科分类号
摘要
BackgroundMedical and clinical question answering (QA) is highly concerned by researchers recently. Though there are remarkable advances in this field, the development in Chinese medical domain is relatively backward. It can be attributed to the difficulty of Chinese text processing and the lack of large-scale datasets. To bridge the gap, this paper introduces a Chinese medical QA dataset and proposes effective methods for the task.MethodsWe first construct a large scale Chinese medical QA dataset. Then we leverage deep matching neural networks to capture semantic interaction between words in questions and answers. Considering that Chinese Word Segmentation (CWS) tools may fail to identify clinical terms, we design a module to merge the word segments and produce a new representation. It learns the common compositions of words or segments by using convolutional kernels and selects the strongest signals by windowed pooling.ResultsThe best performer among popular CWS tools on our dataset is found. In our experiments, deep matching models substantially outperform existing methods. Results also show that our proposed semantic clustered representation module improves the performance of models by up to 5.5% Precision at 1 and 4.9% Mean Average Precision.ConclusionsIn this paper, we introduce a large scale Chinese medical QA dataset and cast the task into a semantic matching problem. We also compare different CWS tools and input units. Among the two state-of-the-art deep matching neural networks, MatchPyramid performs better. Results also show the effectiveness of the proposed semantic clustered representation module.
引用
收藏
页数:10
相关论文
共 24 条
  • [21] Study design of deep learning based automatic detection of cerebrovascular diseases on medical imaging: a position paper from Chinese Association of Radiologists
    Zhang, Longjiang
    Shi, Zhao
    Chen, Min
    Chen, Yingmin
    Cheng, Jingliang
    Fan, Li
    Hong, Nan
    Jia, Wenxiao
    Jiang, Guihua
    Ju, Shenghong
    Li, Xiaogang
    Li, Xiuli
    Liang, Changhong
    Liao, Weihua
    Liu, Shiyuan
    Lu, Zaiming
    Ma, Lin
    Ren, Ke
    Rong, Pengfei
    Song, Bin
    Sun, Gang
    Wang, Rongpin
    Wen, Zhibo
    Xu, Haibo
    Xu, Kai
    Yan, Fuhua
    Yu, Yizhou
    Zha, Yunfei
    Zhang, Fandong
    Zheng, Minwen
    Zhou, Zhen
    Zhu, Wenzhen
    Lu, Guangming
    Jin, Zhengyu
    INTELLIGENT MEDICINE, 2022, 2 (04): : 221 - 229
  • [22] A Comparative Study of Deep Learning Classification Methods on a Small Environmental Microorganism Image Dataset (EMDS-6): From Convolutional Neural Networks to Visual Transformers
    Zhao, Peng
    Li, Chen
    Rahaman, Md Mamunur
    Xu, Hao
    Yang, Hechen
    Sun, Hongzan
    Jiang, Tao
    Grzegorzek, Marcin
    FRONTIERS IN MICROBIOLOGY, 2022, 13
  • [23] Multiscale Local Enhancement Deep Convolutional Networks for the Automated 3D Segmentation of Gross Tumor Volumes in Nasopharyngeal Carcinoma: A Multi-Institutional Dataset Study
    Yang, Geng
    Dai, Zhenhui
    Zhang, Yiwen
    Zhu, Lin
    Tan, Junwen
    Chen, Zefeiyun
    Zhang, Bailin
    Cai, Chunya
    He, Qiang
    Li, Fei
    Wang, Xuetao
    Yang, Wei
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [24] Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network
    Er, Siawpeng
    Liu, Edward
    Chen, Minshuo
    Li, Yan
    Liu, Yuqi
    Zhao, Tuo
    Wang, Hua
    2021 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM (IMS), 2021, : 66 - 69