Dynamically Transfer Entity Span Information for Cross-domain Chinese Named Entity Recognition

被引:0
作者
Wu B.-C. [1 ,3 ]
Deng C.-L. [1 ,3 ]
Guan B. [1 ]
Chen X.-L. [1 ,3 ]
Zan D.-G. [1 ,3 ]
Chang Z.-J. [4 ]
Xiao Z.-Y. [5 ]
Qu D.-C. [5 ]
Wang Y.-J. [1 ,2 ,3 ]
机构
[1] Collaborative Innovation Center, Institute of Software, Chinese Academy of Sciences, Beijing
[2] State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences, Beijing
[3] University of Chinese Academy of Sciences, Beijing
[4] National Science Library, Chinese Academy of Sciences, Beijing
[5] School of Computer Science and Technology, Beijing Institute of Technology, Beijing
来源
Ruan Jian Xue Bao/Journal of Software | 2022年 / 33卷 / 10期
关键词
bidirectional long short-term memory (BiLSTM) neural network; cross-domain; dynamic fusion; named entity recognition (NER); transfer learning;
D O I
10.13328/j.cnki.jos.006305
中图分类号
学科分类号
摘要
Boundaries identification of Chinese named entities is a difficult problem because of no separator between Chinese texts. Furthermore, the lack of well-marked NER data makes Chinese named entity recognition (NER) tasks more challenging in vertical domains, such as clinical domain and financial domain. To address aforementioned issues, this study proposes a novel cross-domain Chinese NER model by dynamically transferring entity span information (TES-NER). The cross-domain shared entity span information is transferred from the general domain (source domain) with sufficient corpus to the Chinese NER model on the vertical domain (target domain) through a dynamic fusion layer based on the gate mechanism, where the entity span information is used to represent the scope of the Chinese named entities. Specifically, TES-NER first introduces a cross-domain shared entity span recognition module based on a bidirectional long short-term memory (BiLSTM) layer and a fully connected neural network (FCN) which are used to identify the cross-domain shared entity span information to determine the boundaries of the Chinese named entities. Then, a Chinese NER module is constructed to identify the domain-specific Chinese named entities by applying independent BiLSTM with conditional random field models (BiLSTM-CRF). Finally, a dynamic fusion layer is designed to dynamically determine the amount of the cross-domain shared entity span information extracted from the entity span recognition module, which is used to transfer the knowledge to the domain-specific NER model through the gate mechanism. This study sets the general domain (source domain) dataset as the news domain dataset (MSRA) with sufficient labeled corpus, while the vertical domain (target domain) datasets are composed of three datasets: Mixed domain (OntoNotes 5.0), financial domain (Resume), and medical domain (CCKS 2017). Among them, the mixed domain dataset (OntoNotes 5.0) is a corpus integrating six different vertical domains. The F1 values of the model proposed in this study are 2.18%, 1.68%, and 0.99% higher than BiLSTM-CRF, respectively. © 2022 Chinese Academy of Sciences. All rights reserved.
引用
收藏
页码:3776 / 3792
页数:16
相关论文
共 34 条
[1]  
Behrang M., Named Entity Recognition, (2014)
[2]  
Yang JF, Yu QB, Guan Y, Jiang ZP., An overview of research on electronic medical record oriented named entity recognition and entity relation extraction, Acta Automatica Sinica, 40, 8, pp. 1537-1562, (2014)
[3]  
Huang Z, Xu W, Yu K., Bidirectional LSTM-CRF models for sequence tagging, (2015)
[4]  
Ma X, Hovy E., End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF, Proc. of the 54th Annual Meeting of the Association for Computational Linguistics (Vol.1: Long Papers), pp. 1064-1074, (2016)
[5]  
Wu F, Liu J, Wu C, Et al., Neural Chinese named entity recognition via CNN-LSTM-CRF and joint training with word segmentation, Proc. of the World Wide Web Conf, pp. 3342-3348, (2019)
[6]  
Jia Y, Xu X., Chinese named entity recognition based on CNN-BiLSTM-CRF, Proc. of the 9th IEEE Int’l Conf. on Software Engineering and Service Science (ICSESS), pp. 1-4, (2018)
[7]  
Zhong Q, Tang Y., An attention-based BiLSTM-CRF for Chinese named entity recognition, Proc. of the 5th IEEE Int’l Conf. on Cloud Computing and Big Data Analytics (ICCCBDA), pp. 550-555, (2020)
[8]  
Zhang HN, Wu DY, Liu Y, Et al., Chinese named entity recognition based on deep neural network, Journal of Chinese Information Processing, 31, 4, pp. 28-35, (2017)
[9]  
Pan SJ, Yang Q., A survey on transfer learning, IEEE Trans. on Knowledge and Data Engineering, 22, 10, pp. 1345-1359, (2009)
[10]  
Bender O, Och FJ, Ney H., Maximum entropy models for named entity recognition, Proc. of the 7th Conf. on Natural Language Learning at HLT-NAACL 2003, pp. 148-151, (2003)