Research Progress on Named Entity Recognition in Chinese Deep Learning

被引:2
作者
Li, Li [1 ,2 ]
Xi, Xuefeng [1 ,2 ,3 ]
Sheng, Shengli [4 ]
Cui, Zhiming [1 ,2 ,3 ]
Xu, Jiabao [1 ,2 ]
机构
[1] School of Electronic & Information Engineering, Suzhou University of Science and Technology, Jiangsu, Suzhou
[2] Suzhou Key Laboratory of Virtual Reality Intelligent Interaction and Application Technology, Jiangsu, Suzhou
[3] Suzhou Smart City Research Institute, Suzhou University of Science and Technology, Jiangsu, Suzhou
[4] Texas Institute of Technology, Lubbock, 79401, TX
关键词
Chinese named entity recognition; Chinese nested named entity recognition; deep learning; entity boundary; low resource Chinese named entity recognition;
D O I
10.3778/j.issn.1002-8331.2302-0361
中图分类号
学科分类号
摘要
Chinese named entity recognition(CNER)is the process of identifying and categorizing entities with specific meanings in Chinese text. It is a crucial component in many downstream tasks within natural language processing. In the past few years, deep learning technology has increasingly relied on end-to-end methods to automatically learn more complex and abstract data features, thereby reducing the need for manual annotation and addressing the issue of data sparsity in high-dimensional feature spaces. As a result, deep learning has emerged as the dominant approach for Chinese named entity recognition. This article initially provides an overview of the historical development of named entity recognition and outlines the specific challenges and intricacies associated with Chinese named entity recognition(CNER). It then delves into the distinct processing characteristics of CNER and categorizes deep learning-based methods for CNER into three key areas:flat entity boundary problem, Chinese nested named entity recognition, and CNER small sample problem. The paper offers a detailed description of the models, subdivisions, and recent research progress in each of these areas, and presents experimental results of several noteworthy deep learning methods on relevant datasets. Finally, the article identifies the challenges and future research directions for CNER, and concludes with a summary of commonly used datasets and evaluation methods for Chinese named entity recognition. © 2016 Chinese Medical Journals Publishing House Co.Ltd. All rights reserved.
引用
收藏
页码:46 / 69
页数:23
相关论文
共 120 条
[1]  
HARMAN D., Information retrieval:the early years[J], in Information Retrieval, 13, 5, pp. 425-577, (2019)
[2]  
KUMAR Y, KAUR K, KAUR S., Study of automatic text summarization approaches in different languages[J], Artificial Intelligence Review, 54, 8, pp. 5897-5929, (2021)
[3]  
BI M, ZHANG Q, Et al., Bi-directional long short-term memory model with semantic positional attention for the question answering system[J], Transactions on Asian and Low-Resource Language Information Processing, 20, 5, pp. 1-13, (2021)
[4]  
WANG H, WU H,, HE Z, Et al., Progress in machine translation[J], Engineering, 18, 11, (2021)
[5]  
SONG Q, WU Y, LIN P,, Et al., Mining summaries for knowledge graph search[J], IEEE Transactions on Knowledge and Data Engineering, 30, 10, pp. 1887-1900, (2018)
[6]  
KRUPKA G., SRA:description of the SRA system as used for MUC-6, Sixth Message Understanding Conference (MUC-6):Proceedings of a Conference Held in Columbia, (1995)
[7]  
GAYEN V, SARKAR K., An HMM based named entity recognition system for Indian languages:the JU system at ICON 2013[J], (2014)
[8]  
WALLACH H M., Conditional random fields:an introduction[R], (2004)
[9]  
NADEAU D, SEKINE S., A survey of named entity recognition and classification[J], Lingvisticae Investigationes, 30, 1, pp. 3-26, (2007)
[10]  
ZHAO S, LUO R, CAI Z P,, Et al., Survey of Chinese named entity recognition[J], Journal of Frontiers of Computer Science and Technology, 16, 2, pp. 296-304, (2022)