LSTM-CRF Neural Network With Gated Self Attention for Chinese NER

被引:33
作者
Jin, Yanliang [1 ]
Xie, Jinfei [1 ]
Guo, Weisi [2 ,3 ]
Luo, Can [1 ]
Wu, Dijia [1 ]
Wang, Rui [1 ]
机构
[1] Shanghai Univ, Shanghai Inst Adv Commun & Data Sci, Joint Int Res Lab Specialty Fiber Opt & Adv Commu, Key Lab Specialty Fiber Opt & Opt Access Networks, Shanghai 200444, Peoples R China
[2] Univ Warwick, Sch Engn, Coventry CV4 7AL, W Midlands, England
[3] Alan Turing Inst, London NW1 2DB, England
基金
欧盟地平线“2020”;
关键词
Task analysis; Hidden Markov models; Logic gates; Labeling; Feature extraction; Neural networks; Road transportation; Chinese NER; gating mechanism; highway neural network; self-attention;
D O I
10.1109/ACCESS.2019.2942433
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Named entity recognition (NER) is an essential part of natural language processing tasks. Chinese NER task is different from the many European languages due to the lack of natural delimiters. Therefore, Chinese Word Segmentation (CWS) is usually regarded as the first step of processing Chinese NER. However, the word-based NER models relying on CWS are more vulnerable to incorrectly segmented entity boundaries and the presence of out-of-vocabulary (OOV) words. In this paper, we propose a novel character-based Gated Convolutional Recurrent neural network with Attention called GCRA for Chinese NER task. In particular, we introduce a hybrid convolutional neural network with gating filter mechanism to capture local context information and a highway neural network after LSTM to select characters of interest. The additional gated self-attention mechanism is used to capture the global dependencies from different multiple subspaces and arbitrary adjacent characters. We evaluate the performance of our proposed model on three datasets, including SIGHAN bakeoff 2006 MSRA, Chinese Resume, and Literature NER dataset. The experiment results show that our model outperforms other state-of-the-art models without relying on any external resources like lexicons and multi-task joint training.
引用
收藏
页码:136694 / 136703
页数:10
相关论文
共 45 条
[1]  
Cao P., 2018, P 2018 C EMP METH NA, P182, DOI DOI 10.18653/V1/D18-1017
[2]  
Chen A., 2006, P 5 SIGHAN WORKSH CH, P173
[3]  
Chen YB, 2015, PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, P167
[4]  
Chiu J.P.C., 2016, Trans. Assoc. Comput. Linguist., V4, P357
[5]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[6]  
Dauphin YN, 2017, PR MACH LEARN RES, V70
[7]   Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition [J].
Dong, Chuanhai ;
Zhang, Jiajun ;
Zong, Chengqing ;
Hattori, Masanori ;
Di, Hui .
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 :239-250
[8]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[9]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[10]  
He Jingzhou, 2008, P 6 SIGHAN WORKSH CH