A chinese named entity recognition method for small-scale dataset based on lexicon and unlabeled data

被引:0
作者
Shaobin Huang
Yongpeng Sha
Rongsheng Li
机构
[1] Harbin Engineering University,College of Computer Science and Technology
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Named entity recognition; Lexicon information; Unlabeled data; Pre-trained language model;
D O I
暂无
中图分类号
学科分类号
摘要
Recently, using lexicon information to improve the performance of Chinese named entity recognition has been proven to be effective. Moreover, the lexicon-based method represented by Lattice-LSTM has also become the mainstream. Although Lattice-LSTM can introduce lexicon information into characters to augment named entity recognition performance, it cannot make good use of unlabeled data, which contains abundant semantic information to assist the network to improve effect. And because Lattice-LSTM introduces much lexicon information, there is currently no suitable way to assign weights to each word. In this work, we propose a method that can effectively introduce lexicon information, which is also simple to implement and can be applied to various networks. Based on the lexicon method, this method uses external unlabeled data to count the word frequency and improved mutual information to represent the weight of the word to introduce lexicon information. And attention mechanism is used to dynamically assign weights to each part of lexicon information. In this method, the fusion of character and lexicon information is processed before the input layer, so that the method has a faster training speed and better versatility. Compared with other methods that are based on lexicon information, this method introduces additional prior knowledge, namely unlabeled data, and achieves better results when the scale of dataset is small. And when combined with the pre-trained language model, the performance is better (the F1 scores on Weibo dataset and Resume dataset are 96.73% and 71.53% respectively). Experimental research shows that our method surpasses many other excellent baseline methods in training speed and performance on two small-scale public Chinese named entity recognition datasets.
引用
收藏
页码:2185 / 2206
页数:21
相关论文
共 15 条
[1]  
Ali A(2021)A data aggregation based approach to exploit dynamic spatio-temporal correlations for citywide crowd flows prediction in fog computing Multimed Tools Appl 80 31401-31433
[2]  
Zhu Y(2021)Exploiting dynamic spatio-temporal correlations for citywide traffic flow prediction using attention based neural networks Inf Sci 577 852-870
[3]  
Zakarya M(2022)Exploiting dynamic spatio-temporal graph convolutional neural networks for citywide traffic flows prediction Neural Netw 145 233-247
[4]  
Ali A(2011)Natural language processing (almost) from scratch J Mach Learn Res 12 2493-2537
[5]  
Zhu Y(undefined)undefined undefined undefined undefined-undefined
[6]  
Zakarya M(undefined)undefined undefined undefined undefined-undefined
[7]  
Ali A(undefined)undefined undefined undefined undefined-undefined
[8]  
Zhu Y(undefined)undefined undefined undefined undefined-undefined
[9]  
Zakarya M(undefined)undefined undefined undefined undefined-undefined
[10]  
Collobert R(undefined)undefined undefined undefined undefined-undefined