A review of Chinese named entity recognition

被引:25
作者
Cheng, Jieren [1 ,2 ]
Liu, Jingxin [1 ]
Xu, Xinbin [1 ]
Xia, Dongwan [1 ]
Liu, Le [1 ]
Sheng, Victor S. [3 ]
机构
[1] Hainan Univ, Sch Compute Sci & Cyberspace Secur, Haikou 570228, Hainan, Peoples R China
[2] Hainan Univ, Hainan Blockchain Technol Engn Res Ctr, Haikou 570228, Hainan, Peoples R China
[3] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
来源
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS | 2021年 / 15卷 / 06期
基金
海南省自然科学基金; 中国国家自然科学基金;
关键词
Chinese word segmentation; Deep learning; Machine learning; Model framework; Named entity recognition; NEURAL-NETWORK; EXTRACTION; ATTENTION; CRF;
D O I
10.3837/tiis.2021.06.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named Entity Recognition (NER) is used to identify entity nouns in the corpus such as Location, Person and Organization, etc. NER is also an important basic of research in various natural language fields. The processing of Chinese NER has some unique difficulties, for example, there is no obvious segmentation boundary between each Chinese character in a Chinese sentence. The Chinese NER task is often combined with Chinese word segmentation, and so on. In response to these problems, we summarize the recognition methods of Chinese NER. In this review, we first introduce the sequence labeling system and evaluation metrics of NER. Then, we divide Chinese NER methods into rule-based methods, statistics-based machine learning methods and deep learning-based methods. Subsequently, we analyze in detail the model framework based on deep learning and the typical Chinese NER methods. Finally, we put forward the current challenges and future research directions of Chinese NER technology.
引用
收藏
页码:2012 / 2030
页数:19
相关论文
共 85 条
  • [1] Akbik A, 2018, P 27 INT C COMP LING, P1638
  • [2] Cao P., 2018, P 2018 C EMP METH NA, P182, DOI DOI 10.18653/V1/D18-1017
  • [3] Chen H, 2019, AAAI CONF ARTIF INTE, P6236
  • [4] Inside Importance Factors of Graph-Based Keyword Extraction on Chinese Short Text
    Chen, Junjie
    Hou, Hongxu
    Gao, Jing
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (05)
  • [5] Chen S. D, 2020, Radio Commun. Technol, V46, P251
  • [6] Association between lipid profiles and osteoporosis in postmenopausal women: a meta-analysis
    Chen, Y. -Y.
    Wang, W. -W.
    Yang, L.
    Chen, W. -W.
    Zhang, H. -X.
    [J]. EUROPEAN REVIEW FOR MEDICAL AND PHARMACOLOGICAL SCIENCES, 2018, 22 (01) : 1 - 9
  • [7] Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training
    Chen, Yao
    Zhou, Changjiang
    Li, Tianxin
    Wu, Hong
    Zhao, Xia
    Ye, Kai
    Liao, Jun
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 96
  • [8] Generative Adversarial Networks: A Literature Review
    Cheng, Jieren
    Yang, Yue
    Tang, Xiangyan
    Xiong, Naixue
    Zhang, Yuan
    Lei, Feifei
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (12): : 4625 - 4647
  • [9] Chinchor N., 1997, P 7 C MESSAGE UNDERS, V29, P1
  • [10] Dai Z., 2019, 2019 12 INT C IM SIG, P1, DOI [10.1109/CISP-BMEI48845.2019.8965823, DOI 10.1109/CISP-BMEI48845.2019.8965823, 10.1109/CISP-BMEI48845]