Joint learning of Chinese word segmentation and named entity recognition

被引:0
作者
Huang X. [1 ,2 ]
Qiao L. [1 ]
Yu W. [2 ]
Li J. [1 ]
Xue H. [2 ]
机构
[1] College of Computer Science and Technology, University of Science and Technology of China, Hefei
[2] Luoyang Campus of the Information Engineering University of the Strategic Support Force, Luoyang
来源
Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology | 2021年 / 43卷 / 01期
关键词
Convolutional recurrent neural network; Local spatial features; Time-dependent features; Word segmentation and entity recognition;
D O I
10.11887/j.cn.202101012
中图分类号
学科分类号
摘要
The convolutional structure was introduced into the recurrent neural network to construct a convolutional recurrent neural network. Based on this network, a sequence annotation model for joint learning of Chinese word segmentation and entity recognition was constructed. The model relies on the convolutional recurrent neural network to construct feature-encoding layer, which realizes the joint extraction of local spatial features and long-distance time-dependent features of Chinese character sequences; the improved recurrent neural network was relies on the constructing of tag-decoding layer, which realizes the effective modeling of timing-dependent features in the tag sequences; the unified word segmentation and entity recognition annotation mode relies on the achieving of joint learning of word segmentation information and entity information, which avoids the error propagation problem of traditional pipeline methods. Experimental results on the People's Daily corpus and Microsoft's annotated corpus show that the framework has significant performance improvement over traditional statistical models and neural network models, especially when identifying entities with multiple characters, and its effect is significantly better than other methods. © 2021, NUDT Press. All right reserved.
引用
收藏
页码:86 / 94
页数:8
相关论文
共 15 条
  • [1] LIU Liu, WANG Dongbo, A review on named entity recognition, Journal of the China Society for Scientific and Technical Information, 37, 3, pp. 329-340, (2018)
  • [2] LAFFERTY J D, MCCALLUM A, PEREIRA F C N., Conditional random fields: probabilistic models for segmenting and labeling sequence data, Proceedings of 18th International Conference on Machine Learning, pp. 282-289, (2001)
  • [3] YU Hongkui, ZHANG Huaping, LIU Qun, Et al., Chinese named entity identification using cascaded hidden Markov model, Journal on Communications, 27, 2, pp. 87-94, (2006)
  • [4] SUN Xiao, SUN Chongyuan, REN Fuji, Biomedical named entity recognition based on deep conditional random fields, Pattern Recognition and Artificial Intelligence, 29, 11, pp. 997-1008, (2016)
  • [5] LAMPLE G, BALLESTEROS M, SUBRAMANIAN S, Et al., Neural architectures for named entity recognition, Proceedings of 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 260-270, (2016)
  • [6] ZHANG Hainan, WU Dayong, LIU Yue, Et al., Chinese named entity recognition based on deep neural network, Journal of Chinese Information Processing, 31, 4, pp. 28-35, (2017)
  • [7] FENG Yanhong, YU Hong, SUN Geng, Et al., Named entity recognition method based on BLSTM, Computer Science, 45, 2, pp. 261-268, (2018)
  • [8] ZHANG Y, YANG J., Chinese NER using lattice LSTM, (2018)
  • [9] HOCHREITER S, SCHMIDHUBER J., Long short-term memory, Neural Computation, 9, 8, pp. 1735-1780, (1997)
  • [10] HUANG Z H, XU W, YU K., Bidirectional LSTM-CRF models for sequence tagging, (2015)