Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition

被引:193
作者
Dong, Chuanhai [1 ]
Zhang, Jiajun [1 ]
Zong, Chengqing [1 ]
Hattori, Masanori [2 ]
Di, Hui [2 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing, Peoples R China
[2] Toshiba China R&D Ctr, Beijing, Peoples R China
来源
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016) | 2016年 / 10102卷
关键词
BLSTM-CRF; Radical features; Named Entity Recognition;
D O I
10.1007/978-3-319-50496-4_20
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
State-of-the-art systems of Chinese Named Entity Recognition (CNER) require large amounts of hand-crafted features and domain-specific knowledge to achieve high performance. In this paper, we apply a bidirectional LSTM-CRF neural network that utilizes both character-level and radical-level representations. We are the first to use character-based BLSTM-CRF neural architecture for CNER. By contrasting the results of different variants of LSTM blocks, we find the most suitable LSTM block for CNER. We are also the first to investigate Chinese radical-level representations in BLSTM-CRF architecture and get better performance without carefully designed features. We evaluate our system on the third SIGHAN Bakeoff MSRA data set for simplfied CNER task and achieve state-of-the-art performance 90.95% F1.
引用
收藏
页码:239 / 250
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 2006, P 5 SIGHAN WORKSHOP
[2]  
[Anonymous], 2016, ARXIV160306270
[3]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[4]  
Chen A., 2006, P 5 SIGHAN WORKSH CH, P173
[5]  
Chieu HaiLeong., 2002, Proceedings of the 19th international conference on Computational linguistics-, V1, P1
[6]  
Chiu J P C, 2015, T ACL, V4, P1
[7]  
Cho K., 2014, ARXIV, P103, DOI 10.3115/v1/w14-4012
[8]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[9]  
Duan Huanzhong, 2011, Int J Adv Intell, V3, P287
[10]  
Fu G., 2005, ACM SIGKDD Explorations Newsletter, V7, P19, DOI DOI 10.1145/1089815.1089819