Low Resource Named Entity Recognition Using Contextual Word Representation and Neural Cross-Lingual Knowledge Transfer

被引:0
作者
Han, Soyeon Caren [1 ]
Lin, Yingru [2 ]
Long, Siqu [1 ]
Poon, Josiah [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, 1 Cleveland St,Bldg J12, Sydney, NSW 2006, Australia
[2] Ping An Technol Shen Zhen Co Ltd, 4-f Pingans Mans, Shenzhen 518028, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I | 2019年 / 11953卷
关键词
Low resource NER; Cross-lingual knowledge transfer;
D O I
10.1007/978-3-030-36708-4_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low resource Named Entity Recognition can be solved by transferring knowledge from a high to a low-resource language with shared multilingual embedding spaces. In this paper, we focus on the extreme low-resource NER scenario of unsupervised cross-lingual knowledge transfer, where no labelled training data or parallel corpus is available. We apply word-alignment with the contextualised word embedding and propose an efficient cross-lingual centroid-based space translation mechanism for contextual embedding. We found that the proposed alignment mechanism works well between different languages, compared to current state-of-the-art models. Moreover, word order differences is another problem to be resolved in cross-lingual NER. We alleviate this issue by incorporating a transformer, which relies entirely on an attention mechanism to draw global dependency between input and output. Our method was evaluated against state-of-the-art results, and it indicate that our approach was better in terms of the performance and the amount of resources.
引用
收藏
页码:299 / 311
页数:13
相关论文
共 32 条
[11]  
Lafferty J., 2001, PROC ICML
[12]  
Lample Guillaume., 2018, Word translation without parallel data
[13]  
Li Qi, 2012, P 21 ACM INT C INF K, P1727
[14]  
Liu LY, 2018, AAAI CONF ARTIF INTE, P5253
[15]  
Liu Y., 2019, P 57 ANN M ASS COMP
[16]  
Luong T., 2015, PROCEEDINGS, P1412, DOI DOI 10.18653/V1/D15-1166
[17]  
Ma XZ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1064
[18]  
Mayhew S., 2017, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, EMNLP 2017, P2536
[19]  
Mikolov T., 2013, 13094168 CORR
[20]  
Mikolov Tomas, 2013, P 1 INT C LEARN REPR