Improving Chinese Named Entity Recognition by Interactive Fusion of Contextual Representation and Glyph Representation

被引:6
作者
Gu, Ruiming [1 ]
Wang, Tao [2 ]
Deng, Jianfeng [2 ]
Cheng, Lianglun [1 ]
机构
[1] Guangdong Univ Technol, Sch Comp Sci & Technol, Guangzhou 510006, Peoples R China
[2] Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Chinese named entity recognition; Chinese character glyph; interactive fusion; crossmodal attention; glyph representation;
D O I
10.3390/app13074299
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Named entity recognition (NER) is a fundamental task in natural language processing. In Chinese NER, additional resources such as lexicons, syntactic features and knowledge graphs are usually introduced to improve the recognition performance of the model. However, Chinese characters evolved from pictographs, and their glyphs contain rich semantic information, which is often ignored. Therefore, in order to make full use of the semantic information contained in Chinese character glyphs, we propose a Chinese NER model that combines character contextual representation and glyph representation, named CGR-NER (Character-Glyph Representation for NER). First, CGR-NER uses the large-scale pre-trained language model to dynamically generate contextual semantic representations of characters. Secondly, a hybrid neural network combining a three-dimensional convolutional neural network (3DCNN) and bi-directional long short-term memory network (BiLSTM) is designed to extract the semantic information contained in a Chinese character glyph, the potential word formation knowledge between adjacent glyphs and the contextual semantic and global dependency features of the glyph sequence. Thirdly, an interactive fusion method with a crossmodal attention and gate mechanism is proposed to fuse the contextual representation and glyph representation from different models dynamically. The experimental results show that our proposed model achieves 82.97% and 70.70% F1 scores on the OntoNotes 4 and Weibo datasets. Multiple ablation studies also verify the advantages and effectiveness of our proposed model.
引用
收藏
页数:20
相关论文
共 37 条
[1]   Context-Aware Bidirectional Neural Model for Sindhi Named Entity Recognition [J].
Ali, Wazir ;
Kumar, Jay ;
Xu, Zenglin ;
Kumar, Rajesh ;
Ren, Yazhou .
APPLIED SCIENCES-BASEL, 2021, 11 (19)
[2]  
Burger J.D., 2002, Proceedings of CoNLL-2002, P163
[3]  
Chen W., 2006, PROC 5 SIGHAN WORKS, P118
[4]  
Chiu Jason., 2016, T ASSOC COMPUT LING, DOI DOI 10.1162/TACL_A_00104
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   Multimodal sentiment analysis: A systematic review of history, datasets, multimodal fusion methods, applications, challenges and future directions [J].
Gandhi, Ankita ;
Adhvaryu, Kinjal ;
Poria, Soujanya ;
Cambria, Erik ;
Hussain, Amir .
INFORMATION FUSION, 2023, 91 :424-444
[7]   KGNER: Improving Chinese Named Entity Recognition by BERT Infused with the Knowledge Graph [J].
Hu, Weiwei ;
He, Liang ;
Ma, Hanhan ;
Wang, Kai ;
Xiao, Jingfeng .
APPLIED SCIENCES-BASEL, 2022, 12 (15)
[8]  
Huang ZH, 2015, Arxiv, DOI [arXiv:1508.01991, DOI 10.48550/ARXIV.1508.01991]
[9]   Multi-stage hybrid embedding fusion network for visual question answering [J].
Lao, Mingrui ;
Guo, Yanming ;
Pu, Nan ;
Chen, Wei ;
Liu, Yu ;
Lew, Michael S. .
NEUROCOMPUTING, 2021, 423 :541-550
[10]  
Li JT, 2024, Arxiv, DOI arXiv:2109.07877