Neural Cross-Lingual Named Entity Recognition with Minimal Resources

被引:0
作者
Xie, Jiateng [1 ]
Yang, Zhilin [1 ]
Neubig, Graham [1 ]
Smith, Noah A. [2 ,3 ]
Carbonell, Jaime [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Univ Washington, Paul G Allen Sch Comp Sci & Engn, Seattle, WA 98195 USA
[3] Allen Inst Artificial Intelligence, Seattle, WA USA
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For languages with no annotated resources, unsupervised transfer of natural language processing models such as named-entity recognition (NER) from resource-rich languages would be an appealing capability. However, differences in words and word order across languages make it a challenging problem. To improve mapping of lexical items across languages, we propose a method that finds translations based on bilingual word embeddings. To improve robustness to word order differences, we propose to use self-attention, which allows for a degree of flexibility with respect to word order. We demonstrate that these methods achieve state-of-the-art or competitive NER performance on commonly tested languages under a cross-lingual setting, with much lower resource requirements than past approaches. We also evaluate the challenges of applying these methods to Uyghur, a lowresource language.(1)
引用
收藏
页码:369 / 379
页数:11
相关论文
共 54 条
  • [1] Ammar W., 2016, TACL, V4, P431
  • [2] Ammar W, 2016, Arxiv, DOI arXiv:1602.01925
  • [3] [Anonymous], 2013, Transactions of the Association for Computational Linguistics
  • [4] [Anonymous], 2018, P 2018 C N AM CHAPT, DOI [10.18653/v1/N18-1202, DOI 10.18653/V1/N18-1202]
  • [5] [Anonymous], 2013, Bilingual word embeddings for phrasebased machine translation
  • [6] [Anonymous], 2011, P 49 ANN M ASS COMPU
  • [7] Learning bilingual word embeddings with (almost) no bilingual data
    Artetxe, Mikel
    Labaka, Gorka
    Agirre, Eneko
    [J]. PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 451 - 462
  • [8] Artetxe Mikel, 2016, P 2016 C EMPIRICAL M, P2289, DOI [DOI 10.18653/V1/D16-1250, 10.18653/v1/D16-1250]
  • [9] Bharadwaj Akash., 2016, P EMNLP, P1462, DOI DOI 10.18653/V1/D16-1153
  • [10] Bojanowski P, 2017, Transactions of the Association for Computational Linguistics, V5, P135, DOI [10.1162/tacla00051, DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051]