Learn to Cross-lingual Transfer with Meta Graph Learning Across Heterogeneous Languages

被引:0
作者
Li, Zheng [1 ]
Kumar, Mukul [2 ]
Headden, William [2 ]
Yin, Bing [2 ]
Wei, Ying [1 ]
Zhang, Yu [3 ]
Yang, Qiang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
[2] Amazon Inc, Bellevue, WA USA
[3] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP) | 2020年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recent emergence of multilingual pretraining language model (mPLM) has enabled breakthroughs on various downstream crosslingual transfer (CLT) tasks. However, mPLM-based methods usually involve two problems: (1) simply fine-tuning may not adapt general-purpose multilingual representations to be task-aware on low-resource languages; (2) ignore how cross-lingual adaptation happens for downstream tasks. To address the issues, we propose a meta graph learning (MGL) method. Unlike prior works that transfer from scratch, MGL can learn to cross-lingual transfer by extracting meta-knowledge from historical CLT experiences (tasks), making mPLM insensitive to low-resource languages. Besides, for each CLT task, MGL formulates its transfer process as information propagation over a dynamic graph, where the geometric structure can automatically capture intrinsic language relationships to guide cross-lingual transfer explicitly. Empirically, extensive experiments on both public and real-world datasets demonstrate the effectiveness of the MGL method.
引用
收藏
页码:2290 / 2301
页数:12
相关论文
共 61 条
[41]  
Shen T, 2018, AAAI CONF ARTIF INTE, P5446
[42]  
Snell J, 2017, ADV NEUR IN, V30
[43]  
Sun SL, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P476
[44]   Learning to Compare: Relation Network for Few-Shot Learning [J].
Sung, Flood ;
Yang, Yongxin ;
Zhang, Li ;
Xiang, Tao ;
Torr, Philip H. S. ;
Hospedales, Timothy M. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1199-1208
[45]  
Upadhyay S, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P6034, DOI 10.1109/ICASSP.2018.8461905
[46]  
Velickovic Petar, 2017, Stat, V1050
[47]  
Vinyals Oriol, 2017, Matching Networks for One Shot Learning
[48]  
Wang YX, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5721
[49]  
Wei Y, 2018, PR MACH LEARN RES, V80
[50]  
Wu Qianhui, 2019, ARXIV191106161