TL-NER: A Transfer Learning Model for Chinese Named Entity Recognition

被引:23
作者
Peng, DunLu [1 ]
Wang, YinRui [1 ]
Liu, Cong [1 ]
Chen, Zhang [1 ]
机构
[1] Univ Shanghai Sci & Technol, Sch Opt Elect & Comp Engn, Shanghai 200093, Peoples R China
基金
中国国家自然科学基金;
关键词
Transfer learning; Chinese named entity recognition; Natural language processing; Deep learning; IDENTIFICATION;
D O I
10.1007/s10796-019-09932-y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most of the current research on Named Entity Recognition (NER) in the Chinese domain is based on the assumption that annotated data are adequate. However, in many scenarios, the sufficient amount of annotated data required for Chinese NER task is difficult to obtain, resulting in poor performance of machine learning methods. In view of this situation, this paper tries to excavate the information contained in the massive unlabeled raw text data and utilize it to enhance the performance of Chinese NER task. A deep learning model combined with Transfer Learning technique is proposed in this paper. This method can be leveraged in some domains where there is a large amount of unlabeled text data and a small amount of annotated data. The experiment results show that the proposed method performs well on different sized datasets, and this method also avoids errors that occur during the word segmentation process. We also evaluate the effect of transfer learning from different aspects through a series of experiments.
引用
收藏
页码:1291 / 1304
页数:14
相关论文
共 55 条
[1]  
[Anonymous], 2015, IEEE INT C COMPUTER
[2]  
[Anonymous], 2018, P 32 AAAI C ART INT
[3]   Information transfers and learning in financial markets: Evidence from short selling around insider sales [J].
Chakrabarty, Bidisha ;
Shkilko, Andriy .
JOURNAL OF BANKING & FINANCE, 2013, 37 (05) :1560-1572
[4]  
Che W., 2013, The ACL, P52
[5]  
Chiu J. P. C., 2015, Trans. Assoc. Comput. Linguist, DOI [10.1162/tacl_a_00104, DOI 10.1162/TACLA00104]
[6]  
Cho K., 2014, P EMPIRICAL METHODS, P1724, DOI DOI 10.3115/V1/D14-1179
[7]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[8]   Analysis of named entity recognition and linking for tweets [J].
Derczynski, Leon ;
Maynard, Diana ;
Rizzo, Giuseppe ;
van Erp, Marieke ;
Gorrell, Genevieve ;
Troncy, Raphael ;
Petrak, Johann ;
Bontcheva, Kalina .
INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (02) :32-49
[9]   De-identification of patient notes with recurrent neural networks [J].
Dernoncourt, Franck ;
Lee, Ji Young ;
Uzuner, Ozlem ;
Szolovits, Peter .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2017, 24 (03) :596-606
[10]   Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition [J].
Dong, Chuanhai ;
Zhang, Jiajun ;
Zong, Chengqing ;
Hattori, Masanori ;
Di, Hui .
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 :239-250