Analysis of Neural Network Modules for Named Entity Recognition of Chinese Medical Texts

被引:0
|
作者
Yufeng D. [1 ]
Guoxiu H. [1 ]
机构
[1] Faculty of Economics and Management, East China Normal University, Shanghai
关键词
Chinese Medical Text; Module Decomposition; Named Entity Recognition; Neural Network;
D O I
10.11925/infotech.2096-3467.2022.0908
中图分类号
学科分类号
摘要
[Objective] This paper decomposes the named entity recognition models based on neural network for Chinese medical texts. We investigate the impacts of single neural network module and the collaboration of multiple modules on the entity recognition performance. [Methods] First, we chosed the benchmark datasets from CCKS2017, CCKS2019, and IMCS-NER for named entity recognition tasks. Then, we conducted extensive experiments to compare the performance of different single modules of the aforementioned layers. Third, we built and compared entity recognition models based on ensemble, parallel, and serial neural models. [Results] Using hfl/chinese-macbert-base, hfl/chinese-roberta-wwm-ext, hfl/chinese-bert-wwm-ext in the symbolic representation layer significantly improved the performance of entity recognition models, the average F1-scores reached 0.8816, 0.8816 and 0.8812 respectively. Stacking neural models at the context encoding layer improved the performance of the neural network. Moreover, ensembled neural networks could achieve the best performance, the F1-scores reached 0.9330, 0.8211 and 0.9181 respectively. [Limitations] More research is needed to examine our findings with datasets in other languages. [Conclusions] The characteristics of single neural modules and their collaboration could significantly affect the performance of the named entity recognition of Chinese medical texts. © 2023, Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:26 / 37
页数:11
相关论文
共 42 条
  • [31] Ji B, Liu R, Li S S, Et al., A Hybrid Approach for Named Entity Recognition in Chinese Electronic Medical Record, BMC Medical Informatics and Decision Making, 19, (2019)
  • [32] Xiong Y, Peng H, Xiang Y, Et al., Leveraging Multi-Source Knowledge for Chinese Clinical Named Entity Recognition via Relational Graph Convolutional Network, Journal of Biomedical Informatics, 128, (2022)
  • [33] He Q Z, Wu L, Yin Y D, Et al., Knowledge-Graph Augmented Word Representations for Named Entity Recognition [C], Proceedings of the 2020 AAAI Conference on Artificial Intelligence, 34, 5, pp. 7919-7926, (2020)
  • [34] Zhou B H, Cai X R, Zhang Y, Et al., An End-to-End Progressive Multi-Task Learning Framework for Medical Named Entity Recognition and Normalization, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 6214-6224, (2021)
  • [35] Luo L, Li N, Li S C, Et al., DUTIR at the CCKS-2018 Task1: A Neural Network Ensemble Approach for Chinese Clinical Named Entity Recognition, Proceedings of the 3rd Evaluation Tasks at the China Conference on Knowledge Graph and Semantic Computing, pp. 7-12, (2018)
  • [36] Hu J, Shi X, Liu Z, Et al., HITSZ_CNER: A Hybrid System for Entity Recognition from Chinese Clinical Text
  • [37] Nie B L, Ding R X, Xie P J, Et al., Knowledge-Aware Named Entity Recognition with Alleviating Heterogeneity, Proceedings of the 2021 AAAI Conference on Artificial Intelligence, 35, 15, pp. 13595-13603, (2021)
  • [38] Nakayama H., Seqeval: A Python Framework for Sequence Labeling Evaluation
  • [39] CCKS2017-Medical Named Entity Recognition Task [DS/OL]
  • [40] Yin Z, Shen Y Y., On the Dimensionality of Word Embedding [OL]