Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages

被引:0
作者
Li, Zheng [1 ]
Kumar, Mukul [2 ]
Headden, William [2 ]
Yin, Bing [2 ]
Wei, Ying [1 ]
Zhang, Yu [3 ]
Yang, Qiang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Amazon Inc, Bellevue, WA USA
[3] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent emergence of multilingual pretraining language model (mPLM) has enabled breakthroughs on various downstream crosslingual transfer (CLT) tasks. However, mPLM-based methods usually involve two problems: (1) simply fine-tuning may not adapt general-purpose multilingual representations to be task-aware on low-resource languages; (2) ignore how cross-lingual adaptation happens for downstream tasks. To address the issues, we propose a meta graph learning (MGL) method. Unlike prior works that transfer from scratch, MGL can learn to cross-lingual transfer by extracting meta-knowledge from historical CLT experiences (tasks), making mPLM insensitive to low-resource languages. Besides, for each CLT task, MGL formulates its transfer process as information propagation over a dynamic graph, where the geometric structure can automatically capture intrinsic language relationships to guide cross-lingual transfer explicitly. Empirically, extensive experiments on both public and real-world datasets demonstrate the effectiveness of the MGL method.
引用
收藏
页码:2290 / 2301
页数:12
相关论文
共 61 条
[1]   Language-Agnostic Representation Learning for Product Search on E-Commerce Platforms [J].
Ahuja, Aman ;
Rao, Nikhil ;
Katariya, Sumeet ;
Subbian, Karthik ;
Reddy, Chandan K. .
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, :7-15
[2]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[3]  
[Anonymous], 2001, Proceedings of HLT2001, First International Conference on Human Language Technology Research
[4]  
[Anonymous], 2007, P 45 ANN M ASS COMP
[5]  
[Anonymous], 2009, ACL, P235
[6]  
[Anonymous], 2018, Transactions of the Association for Computational Linguistics, DOI DOI 10.1162/TACL_A_00039
[7]  
[Anonymous], 2018, ARXIV180808933
[8]  
[Anonymous], 2017, ARXIV170703141
[9]  
Artetxe M, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P789
[10]  
Bao Yujia, 2020, ICLR