Analysis of Neural Network Modules for Named Entity Recognition of Chinese Medical Texts

被引:0
|
作者
Yufeng D. [1 ]
Guoxiu H. [1 ]
机构
[1] Faculty of Economics and Management, East China Normal University, Shanghai
关键词
Chinese Medical Text; Module Decomposition; Named Entity Recognition; Neural Network;
D O I
10.11925/infotech.2096-3467.2022.0908
中图分类号
学科分类号
摘要
[Objective] This paper decomposes the named entity recognition models based on neural network for Chinese medical texts. We investigate the impacts of single neural network module and the collaboration of multiple modules on the entity recognition performance. [Methods] First, we chosed the benchmark datasets from CCKS2017, CCKS2019, and IMCS-NER for named entity recognition tasks. Then, we conducted extensive experiments to compare the performance of different single modules of the aforementioned layers. Third, we built and compared entity recognition models based on ensemble, parallel, and serial neural models. [Results] Using hfl/chinese-macbert-base, hfl/chinese-roberta-wwm-ext, hfl/chinese-bert-wwm-ext in the symbolic representation layer significantly improved the performance of entity recognition models, the average F1-scores reached 0.8816, 0.8816 and 0.8812 respectively. Stacking neural models at the context encoding layer improved the performance of the neural network. Moreover, ensembled neural networks could achieve the best performance, the F1-scores reached 0.9330, 0.8211 and 0.9181 respectively. [Limitations] More research is needed to examine our findings with datasets in other languages. [Conclusions] The characteristics of single neural modules and their collaboration could significantly affect the performance of the named entity recognition of Chinese medical texts. © 2023, Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:26 / 37
页数:11
相关论文
共 42 条
  • [1] Li Wenxin, Zhang Kunli, Guan Tongfeng, Et al., Overview of CHIP2020 Shared Task 1: Named Entity Recognition in Chinese Medical Text, Journal of Chinese Information Processing, 36, 4, pp. 66-72, (2022)
  • [2] Yang X, Huang W., A Conditional Random Fields Approach to Clinical Name Entity Recognition
  • [3] Tong Y Q, Chen Y D, Shi X D., A Multi-Task Approach for Improving Biomedical Named Entity Recognition by Incorporating Multi-Granularity Information, Proceedings of the 2021 International Joint Conference on Natural Language Processing, pp. 4804-4813, (2021)
  • [4] Li L Q, Zhao J, Hou L, Et al., An Attention-Based Deep Learning Model for Clinical Named Entity Recognition of Chinese Electronic Medical Records, BMC Medical Informatics and Decision Making, 19, (2019)
  • [5] Crichton G, Pyysalo S, Chiu B, Et al., A Neural Network Multi-Task Learning Approach to Biomedical Named Entity Recognition, BMC Bioinformatics, 18, 1, (2017)
  • [6] Sheng Yu, Hu Huirong, Wang Congcong, Et al., Analyzing Structures of Medical Imaging Diagnosis Reports, Data Analysis and Knowledge Discovery, 6, 10, pp. 46-56, (2022)
  • [7] Yan Boyang, Wu Chen, Chinese Medical Named Entity Recognition Based on Self-Attention, Computer & Digital Engineering, 50, 4, pp. 839-842, (2022)
  • [8] Zhang Houchang, Liu Chengliang, Recognition of Chinese-Named Medical Entities Embedded Words Character, Chinese Journal of Medical Library and Information Science, 30, 9, pp. 42-49, (2021)
  • [9] Hu Jiming, Qian Wei, Wen Peng, Et al., Text Semantic Representation with Structure-Function and Entity Recognition: Case Study of Medical Records, Data Analysis and Knowledge Discovery, 6, 8, pp. 110-121, (2022)
  • [10] Gong Dunwei, Zhang Yongkai, Guo Yinan, Et al., Named Entity Recognition of Chinese Electronic Medical Records Based on Multifeature Embedding and Attention Mechanism, Chinese Journal of Engineering, 43, 9, pp. 1190-1196, (2021)