Domain Named Entity Recognition Method Based on Skip-gram Model

被引:0
作者
Feng Yan-hong [1 ]
Yu Hong [1 ]
Sun Geng [1 ]
Yu Xun-ran [2 ]
机构
[1] Dalian Ocean Univ, Coll Informat Engn, Dalian, Peoples R China
[2] Univ Int Business & Econ, Sch Int Trade & Econ, Beijing, Peoples R China
来源
PROCEEDINGS FIRST INTERNATIONAL CONFERENCE ON ELECTRONICS INSTRUMENTATION & INFORMATION SYSTEMS (EIIS 2017) | 2017年
关键词
domain named entity recognition; Skip-gram; semantic meaning; word embedding; domain characteristics;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Traditional domain named entity recognition (NER) methods mainly depended on manual features and were implemented by machine learning methods. These features have no capability to express semantic meaning and these methods are very sensitive for artificial features. To resolve these problems, a method based on Skip-gram model is proposed in this paper. In this method, using word embedding with semantic meaning as features, named entity recognition problem is straightly modeled as Skip-gram model, so it achieves end-to-end solution. Domain characteristics are integrated into this model for further improvement in result. The experiment is carried on Sogou and domain corpus. It shows that the proposed method can improve Recall, Precision and F measure of domain named entity recognition.
引用
收藏
页码:510 / 514
页数:5
相关论文
共 13 条
[1]  
[Anonymous], 2013, PROC 1 INT C LEARN R
[2]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[3]  
Collobert R, 2011, J MACH LEARN RES, V12, P2493
[4]  
Li Li-shuang, 2016, Journal of Chinese Computer Systems, V37, P302
[5]  
[李丽双 Li Lishuang], 2015, [中文信息学报, Journal of Chinese Information Processing], V29, P82
[6]  
[栗伟 Li Wei], 2015, [计算机应用研究, Application Research of Computers], V32, P1082
[7]  
Liu Tao, 2007, Acta Electronica Sinica, V35, P328
[8]  
Mikolov T., 2013, ADV NEURAL INFORM PR, P3111
[9]  
OAKES MP, 2001, RECENT ADV COMPUTATI, P353
[10]  
Pan Y J, 2007, DICT FISHERIES, P1