Low Resource Named Entity Recognition Using Contextual Word Representation and Neural Cross-Lingual Knowledge Transfer

被引:0
作者
Han, Soyeon Caren [1 ]
Lin, Yingru [2 ]
Long, Siqu [1 ]
Poon, Josiah [1 ]
机构
[1] Univ Sydney, Sch Informat Technol, 1 Cleveland St,Bldg J12, Sydney, NSW 2006, Australia
[2] Ping An Technol Shen Zhen Co Ltd, 4-f Pingans Mans, Shenzhen 518028, Peoples R China
来源
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I | 2019年 / 11953卷
关键词
Low resource NER; Cross-lingual knowledge transfer;
D O I
10.1007/978-3-030-36708-4_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low resource Named Entity Recognition can be solved by transferring knowledge from a high to a low-resource language with shared multilingual embedding spaces. In this paper, we focus on the extreme low-resource NER scenario of unsupervised cross-lingual knowledge transfer, where no labelled training data or parallel corpus is available. We apply word-alignment with the contextualised word embedding and propose an efficient cross-lingual centroid-based space translation mechanism for contextual embedding. We found that the proposed alignment mechanism works well between different languages, compared to current state-of-the-art models. Moreover, word order differences is another problem to be resolved in cross-lingual NER. We alleviate this issue by incorporating a transformer, which relies entirely on an attention mechanism to draw global dependency between input and output. Our method was evaluated against state-of-the-art results, and it indicate that our approach was better in terms of the performance and the amount of resources.
引用
收藏
页码:299 / 311
页数:13
相关论文
共 32 条
[1]  
Adams O, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P937
[2]  
Che W., 2018, P CONLL 2018 SHARED, P55
[3]  
Chiu J.P.C., 2016, Trans. Assoc. Comput. Linguist., V4, P357
[4]  
Conneau A., 2018, ICLR 2018
[5]  
Conneau Alexis, 2018, EMNLP
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]  
Florian R., 2003, Proceedings of CoNLL-2003, P168, DOI DOI 10.3115/1119176.1119201
[8]  
Grave Edouard, 2019, PR MACH LEARN RES, V89
[9]  
Joulin A., 2018, P EMNLP 2018, P2979
[10]  
Kim Sungchul, 2012, P 50 ANN M ASS COMP, P694