A Chinese named entity recognition method combined with relative position information

被引:3
作者
Gan, Ling [1 ]
Huang, Chengming [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Comp Sci & Technol, Chongqing, Peoples R China
来源
2021 ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE (ACCTCS 2021) | 2021年
关键词
Chinese named entity recognition; multiple attention; relative position coding; global information; semantic information;
D O I
10.1109/ACCTCS52002.2021.00056
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Named entity recognition is one of the important tasks of natural language processing, which can help people to select entity information from massive text data. Researchers try to use different methods and improve the recognition effect from different perspectives, including machine learning and deep learning methods, and have made great progress in English datasets. However, in Chinese named entity recognition, it is difficult to recognize entity class because of the complexity of semantic environment and the variety of word formation grammar. Therefore, in order to solve this problem, this paper proposes to use the multi-head attention mechanism of relative position, using the difference of relative position encoding between characters of different positions, to extract the feature of full sentence information, so as to make up for the lack of attention of Lattice-LSTM model to the feature information of full sentence, resulting in the weak ability to recognize complex sentences. Experiments on Chinese Weibo dataset, resume dataset, OntoNotes 4.0 dataset and MSRA dataset verify the model in terms of statement complexity and data volume respectively, and the recognition effect is improved. Finally, we find out a better combination of super parameters, which are further improved on the four datasets.
引用
收藏
页码:250 / 254
页数:5
相关论文
共 15 条
[1]  
[Anonymous], 2003, Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
[2]  
BIKEL D, 1997, 5 C APPL NAT LANG PR, P194
[3]  
Borthwick A., 1998, MUC MUC
[4]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5]  
Gui T, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P4982
[6]  
He H., 2017, ASS COMPUTATIONAL LI, V52, P713
[7]   Log-Linear Model Based Behavior Selection Method for Artificial Fish Swarm Algorithm [J].
Huang, Zhehuang ;
Chen, Yidong .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
[8]  
Ma XZ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1064
[9]  
Meng Yu, 2019, NEURIPS
[10]  
Shaw Peter, 2018, P 2018 C N AM CHAPT, V2, P464, DOI [DOI 10.18653/V1/N18-2074, 10.18653/V1/N18-2074]