Robust Chinese Clinical Named Entity Recognition with information bottleneck and adversarial training

被引:0
作者
He, Yunfei [1 ]
Zhang, Zhiqiang [2 ]
Shen, Jinlong [1 ]
Li, Yuling [1 ]
Zhang, Yiwen [3 ]
Ding, Weiping [4 ,5 ]
Yang, Fei [1 ]
机构
[1] Anhui Med Univ, Sch Biomed Engn, Hefei 230601, Anhui, Peoples R China
[2] Bengbu First Peoples Hosp, Med Equipment Engn Dept, Bengbu 233000, Anhui, Peoples R China
[3] Anhui Univ, Sch Comp Sci & Technol, Hefei 230601, Anhui, Peoples R China
[4] Nantong Univ, Sch Artificial Intelligence & Comp Sci, Nantong 226019, Jiangsu, Peoples R China
[5] City Univ Macau, Fac Data Sci, Macau 999078, Peoples R China
基金
中国国家自然科学基金;
关键词
Chinese Clinical Named Entity Recognition; Multifaceted text representation; Information bottleneck; Hilbert-Schmidt independence criterion; Adversarial training; NETWORKS;
D O I
10.1016/j.asoc.2024.112409
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Chinese Clinical Named Entity Recognition (CCNER) aims to extract entities with specific medical significance from Chinese clinical texts, which is an important part of medical data mining. Some existing CCNER models may assume perfect text data and design complex models to improve their accuracy. However, due to the complexity of Chinese clinical entity semantics and the professionalism of annotation, Chinese clinical texts are prone to contain irregular misrepresentations and sparse entity labeling. That would lead to noisy or incomplete text features extracted by CCNER, seriously threatening the robustness of recognition in real-world scenarios. To address these problems, we propose the Robust Chinese Clinical Named Entity Recognition model (RCCNER). RCCNER comprises three essential components: multifaceted text representation, robust feature extraction, and robust model training. For multifaceted text representation, the model enhances consistency and collaboration between feature representations by integrating word embedding, radical embedding, and dictionary embedding to help withstand textual noise. Then, guided by the information bottleneck and the Hilbert-Schmidt independence criterion, robust feature extraction compresses the dependency between text representation and extracted features, while enhancing the dependency between extracted features and labels, which consequently provides reliable text features for robust recognition. The robust model training aspect leverages adversarial training to diminish RCCNER's sensitivity to noise disturbances and sparse entity labeling, thereby reinforcing its robustness in entity recognition. RCCNER collaboratively enhances the noise immunity through text representation, text feature extraction and model training. Several experiments on two popular public datasets validate the effectiveness and robustness of RCCNER.
引用
收藏
页数:15
相关论文
共 37 条
[31]   Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF [J].
An, Ying ;
Xia, Xianyun ;
Chen, Xianlai ;
Wu, Fang-Xiang ;
Wang, Jianxin .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
[32]   A survey of robust adversarial training in pattern recognition: Fundamental, theory, and methodologies [J].
Qian, Zhuang ;
Huang, Kaizhu ;
Wang, Qiu-Feng ;
Zhang, Xu-Yao .
PATTERN RECOGNITION, 2022, 131
[33]   Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training [J].
Qi, Gege ;
Chen, Yuefeng ;
Mao, Xiaofeng ;
Jia, Xiaojun ;
Duan, Ranjie ;
Zhang, Rong ;
Xue, Hui .
INTERSPEECH 2023, 2023, :561-565
[34]   A Chinese Named Entity Recognition Method Based on ERNIE-BiLSTM-CRF for Food Safety Domain [J].
Yuan, Taiping ;
Qin, Xizhong ;
Wei, Chunji .
APPLIED SCIENCES-BASEL, 2023, 13 (05)
[35]   A continual learning framework to train robust image recognition models by adversarial training and knowledge distillation [J].
Chou, Ting-Chun ;
Kuo, Yu-Cheng ;
Huang, Jhih-Yuan ;
Lee, Wei-Po .
CONNECTION SCIENCE, 2024, 36 (01)
[36]   A temporal domain generalization method for PM2.5 concentration prediction based on adversarial training and deep variational information bottleneck [J].
Shan, Miaoxuan ;
Ye, Chunlin ;
Chen, Peng ;
Peng, Shufan .
ATMOSPHERIC POLLUTION RESEARCH, 2025, 16 (05)
[37]   Chinese Clinical Named Entity Recognition From Electronic Medical Records Based on Multisemantic Features by Using Robustly Optimized Bidirectional Encoder Representation From Transformers Pretraining Approach Whole Word Masking and Convolutional Neural Networks: Model Development and Validation [J].
Wang, Weijie ;
Li, Xiaoying ;
Ren, Huiling ;
Gao, Dongping ;
Fang, An .
JMIR MEDICAL INFORMATICS, 2023, 11