Towards Improving Neural Named Entity Recognition with Gazetteers

被引:0
作者
Liu, Tianyu [1 ]
Yao, Jin-Ge [2 ]
Lin, Chin-Yew [2 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
来源
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019) | 2019年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the recently proposed neural models for named entity recognition have been purely data-driven, with a strong emphasis on getting rid of the efforts for collecting external resources or designing hand-crafted features. This could increase the chance of overfitting since the models cannot access any supervision signal beyond the small amount of annotated data, limiting their power to generalize beyond the annotated entities. In this work, we show that properly utilizing external gazetteers could benefit segmental neural NER models. We add a simple module on the recently proposed hybrid semi-Markov CRF architecture and observe some promising results.
引用
收藏
页码:5301 / 5307
页数:7
相关论文
共 25 条
[1]  
AKBIK A, 2018, P 27 INT C COMP LING, P1638
[2]  
[Anonymous], 2016, P 25 INT JOINT C ART
[3]  
Chiu JP., 2016, Trans. Assoc. Comput. Linguist, V4, P357, DOI [10.1162/tacl_a_00104, DOI 10.1162/TACLA00104, DOI 10.1162/TACL_A_00104]
[4]  
Clark K, 2018, 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), P1914
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]  
Erik F., 2003, P 7 C NAT LANG LEARN
[7]  
Ghaddar A., 2018, P 27 INT C COMP LING, P1896
[8]  
Khashabi D, 2018, PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), P541
[9]  
Kong Lingpeng, 2016, INT C LEARN REPR
[10]  
Lafferty J.D., 2001, Proceedings of the Eighteenth International Conference on Machine Learning, P282, DOI [10.1038/nprot.2006.61, DOI 10.1038/NPROT.2006.61]