Survey of Chinese Named Entity Recognition Research

被引:0
作者
Zhao, Jigui [1 ,2 ,3 ]
Qian, Yurong [1 ,2 ,3 ]
Wang, Kui [4 ]
Hou, Shuxiang [2 ,3 ,5 ]
Chen, Jiaying [1 ,2 ,3 ]
机构
[1] School of Software, Xinjiang University, Urumqi
[2] Key Laboratory of Signal Detection and Processing in Xinjiang Uygur Autonomous Region, Urumqi
[3] Key Laboratory of Software Engineering, Xinjiang University, Urumqi
[4] School of Economics and Management, University of the Chinese Academy of Sciences, Beijing
[5] School of Information Science and Engineering, Xinjiang University, Urumqi
关键词
Chinese named entity recognition (CNER); deep learning; machine learning; natural language processing; pre-training models;
D O I
10.3778/j.issn.1002-8331.2304-0398
中图分类号
学科分类号
摘要
Named entity recognition (NER) is one of the most fundamental tasks in natural language processing, and its main content is to identify the entity types and boundaries with specific meanings in natural language text. However, the data samples of Chinese named entity recognition (CNER) have problems such as blurred word boundaries, semantic diversity, blurred morphological features and small Chinese corpus content, which make it difficult to improve the performance of Chinese NER. In this paper, firstly, the dataset, annotation scheme and evaluation index of CNER are introduced. Secondly, according to the research process of CNER, CNER methods are classified into three categories: rule-based methods, statistical-based methods and deep learning-based methods, and the main models of CNER based on deep learning in the past five years are summarized. Finally, the research trends of CNER are discussed to provide some reference for the proposal of new methods and future research directions. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:15 / 27
页数:12
相关论文
共 104 条
[1]  
GRISHMAN R, SUNDHEIM B., Message Understanding Conference- 6: a brief history, COLING 1996 Volume 1: The 16th International Conference on Computational Linguistics, pp. 466-471, (1996)
[2]  
KHALID M A, JIJKOUN V, DE RIJKE M., The impact of named entity normalization on information retrieval for question answering, Advances in Information Retrieval: 30th European Conference on IR Research (ECIR 2008), pp. 705-710, (2008)
[3]  
RIEDEL S, YAO L, MCCALLUM A, Et al., Relation extraction with matrix factorization and universal schemas, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 74-84, (2013)
[4]  
DIEFENBACH D, LOPEZ V, SINGH K, Et al., Core techniques of question answering systems over knowledge bases: a survey, Knowledge and Information Systems, 55, pp. 529-569, (2018)
[5]  
WANG Z, CUI X, GAO L, Et al., A hybrid model of sentimental entity recognition on mobile social media, EURASIP Journal on Wireless Communications and Networking, 2016, 1, pp. 1-12, (2016)
[6]  
SETTLES B., Biomedical named entity recognition using conditional random fields and rich feature sets, Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications, pp. 107-110, (2004)
[7]  
LUO Y Y, YANG C M, LI B, Et al., Chinese medical named entity recognition method incorporating machine reading comprehension, Computer Science, 50, 9, pp. 287-294, (2023)
[8]  
KARATAY D, KARAGOZ P., User interest modeling in twitter with named entity recognition, 5th Workshop on Making Sense of Microposts, pp. 35-38, (2015)
[9]  
LIU K, YU Q S, ZHONG S Y., Chinese named entity recognition based on bi-directional quasi-recurrent neural networks improved with BERT: new method to solve Chinese NER, 2021 the 5th International Conference on Innovation in Artificial Intelligence, pp. 15-19, (2021)
[10]  
WANG W, CHANG L, BIN C, Et al., ESN-NER: entity storage network using attention mechanism for Chinese NER, Proceedings of the International Conference on Artificial Intelligence, Information Processing and Cloud Computing, pp. 1-8, (2019)