A review of Chinese named entity recognition

被引:24
作者
Cheng, Jieren [1 ,2 ]
Liu, Jingxin [1 ]
Xu, Xinbin [1 ]
Xia, Dongwan [1 ]
Liu, Le [1 ]
Sheng, Victor S. [3 ]
机构
[1] Hainan Univ, Sch Compute Sci & Cyberspace Secur, Haikou 570228, Hainan, Peoples R China
[2] Hainan Univ, Hainan Blockchain Technol Engn Res Ctr, Haikou 570228, Hainan, Peoples R China
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2021年 / 15卷 / 06期
基金
中国国家自然科学基金; 海南省自然科学基金;
关键词
Chinese word segmentation; Deep learning; Machine learning; Model framework; Named entity recognition; NEURAL-NETWORK; EXTRACTION; ATTENTION; CRF;
D O I
10.3837/tiis.2021.06.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is used to identify entity nouns in the corpus such as Location, Person and Organization, etc. NER is also an important basic of research in various natural language fields. The processing of Chinese NER has some unique difficulties, for example, there is no obvious segmentation boundary between each Chinese character in a Chinese sentence. The Chinese NER task is often combined with Chinese word segmentation, and so on. In response to these problems, we summarize the recognition methods of Chinese NER. In this review, we first introduce the sequence labeling system and evaluation metrics of NER. Then, we divide Chinese NER methods into rule-based methods, statistics-based machine learning methods and deep learning-based methods. Subsequently, we analyze in detail the model framework based on deep learning and the typical Chinese NER methods. Finally, we put forward the current challenges and future research directions of Chinese NER technology.
引用
收藏
页码:2012 / 2030
页数:19
相关论文
共 85 条
[1]  
Akbik A., 2018, P 27 INT C COMP LING, P1638
[2]  
[Anonymous], 2016, P 2016 C EMP METH NA
[3]  
[Anonymous], 2018, P 2018 C N AM CHAPT, DOI DOI 10.18653/V1/N18-1131
[4]  
[Anonymous], 2015, COMPUTER SCI
[5]  
Cao P., 2018, P 2018 C EMP METH NA, P182, DOI [DOI 10.18653/V1/D18-1017, 10.18653/v1/d18-1017]
[6]  
Chen H, 2019, AAAI CONF ARTIF INTE, P6236
[7]   Inside Importance Factors of Graph-Based Keyword Extraction on Chinese Short Text [J].
Chen, Junjie ;
Hou, Hongxu ;
Gao, Jing .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
[8]  
Chen S., 2020, Radio Communications Technology, V46, P251
[9]   Association between lipid profiles and osteoporosis in postmenopausal women: a meta-analysis [J].
Chen, Y. -Y. ;
Wang, W. -W. ;
Yang, L. ;
Chen, W. -W. ;
Zhang, H. -X. .
EUROPEAN REVIEW FOR MEDICAL AND PHARMACOLOGICAL SCIENCES, 2018, 22 (01) :1-9
[10]   Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training [J].
Chen, Yao ;
Zhou, Changjiang ;
Li, Tianxin ;
Wu, Hong ;
Zhao, Xia ;
Ye, Kai ;
Liao, Jun .
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 96