The interactive fusion of characters and lexical information for Chinese named entity recognition

被引:3
作者
Wang, Ye [1 ]
Wang, Zheng [1 ]
Yu, Hong [1 ]
Wang, Guoyin [1 ,2 ]
Lei, Dajiang [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Computat Intelligence, Chongqing 400065, Peoples R China
[2] Chongqing Normal Univ, Chongqing, Peoples R China
基金
中国国家自然科学基金;
关键词
Chinese Named Entity Recognition; Characters and lexical information; Graph attention network; Interactive fusion;
D O I
10.1007/s10462-024-10891-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many studies have demonstrated that incorporating lexical information into characters can effectively improve the performance of Chinese Named Entity Recognition (CNER). However, we argue that previous studies have not extensively explored the interactive relationship between characters and lexical information, and have only used the lexical information to enhance character-level representation. To address this limitation, we propose an interactive fusion approach that integrates characters and lexical information for CNER. Specifically, we first design graph attention networks to initially fuse character and lexical information within an interactive graph structure. Additionally, by introducing methods such as feedforward neural networks, residual connections, and layer normalization, the fusion effect of the graph attention network is further enhanced. Finally, concatenating and reducing dimensionality of character feature vectors and lexical feature vectors to achieve secondary fusion, thereby obtaining a more comprehensive feature representation. Experimental results on multiple datasets demonstrate that our proposed model outperforms other models that fuse lexical information. Particularly, on the CCKS2020 and Ontonotes datasets, our model achieves higher F1 scores than previous state-of-the-art models. The code is available via the link: https://github.com/wangye0523/The-interactive-fusion-of-characters-and-lexical-information-for-Chinese-named-entity-recognition.
引用
收藏
页数:21
相关论文
共 39 条
[1]   Can BERT Dig It? Named Entity Recognition for Information Retrieval in the Archaeology Domain [J].
Brandsen, Alex ;
Verberne, Suzan ;
Lambers, Karsten ;
Wansleeben, Milco .
ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2022, 15 (03)
[2]  
Chen C, 2021, ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, P20
[3]  
Cheng JR, 2021, KSII T INTERNET INF, V15, P2012
[4]  
Dai Z., 2019, P 12 INT C IM SIGN P, P1, DOI [DOI 10.1109/CISP-BMEI48845.2019.8965823, 10.1109/CISP-BMEI48845.2019.8965823]
[5]  
[邓依依 Deng Yiyi], 2021, [中文信息学报, Journal of Chinese Information Processing], V35, P30
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]   A Novel Machine-Learning-Based Hybrid CNN Model for Tumor Identification in Medical Image Processing [J].
Dhiman, Gaurav ;
Juneja, Sapna ;
Viriyasitavat, Wattana ;
Mohafez, Hamidreza ;
Hadizadeh, Maryam ;
Islam, Mohammad Aminul ;
El Bayoumy, Ibrahim ;
Gulati, Kamal .
SUSTAINABILITY, 2022, 14 (03)
[8]   An attentive joint model with transformer-based weighted graph convolutional network for extracting adverse drug event relation [J].
El-allaly, Ed-drissiya ;
Sarrouti, Mourad ;
En-Nahnahi, Noureddine ;
El Alaoui, Said Ouatik .
JOURNAL OF BIOMEDICAL INFORMATICS, 2022, 125
[9]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]  
Gui T, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P1040