Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision

被引:0
作者
Cao, Yixin [1 ,2 ]
Hou, Lei [2 ]
Li, Juanzi [2 ]
Liu, Zhiyuan [2 ]
Li, Chengjiang [2 ]
Chen, Xu [2 ]
Dong, Tiansi [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[2] Tsinghua Univ, Dept CST, Beijing, Peoples R China
[3] Univ Bonn, B IT, Bonn, Germany
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method.
引用
收藏
页码:227 / 237
页数:11
相关论文
共 50 条
[21]   MetaXL: Meta Representation Transformation for Low-resource Cross-lingual Learning [J].
Xia, Mengzhou ;
Zheng, Guoqing ;
Mukherjee, Subhabrata ;
Shokouhi, Milad ;
Neubig, Graham ;
Awadallah, Ahmed Hassan .
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, :499-511
[22]   A Learning to rank framework based on cross-lingual loss function for cross-lingual information retrieval [J].
Elham Ghanbari ;
Azadeh Shakery .
Applied Intelligence, 2022, 52 :3156-3174
[23]   Cross-lingual Supervision Improves Unsupervised Neural Machine Translation [J].
Wang, Mingxuan ;
Bai, Hongxiao ;
Zhao, Hai ;
Li, Lei .
2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, NAACL-HLT 2021, 2021, :89-96
[24]   Cross-Lingual Entity Alignment via Joint Attribute-Preserving Embedding [J].
Sun, Zequn ;
Hu, Wei ;
Li, Chengkai .
SEMANTIC WEB - ISWC 2017, PT I, 2017, 10587 :628-644
[25]   FuAlign: Cross-lingual entity alignment via multi-view representation learning of fused knowledge graphs [J].
Wang, Chenxu ;
Huang, Zhenhao ;
Wan, Yue ;
Wei, Junyu ;
Zhao, Junzhou ;
Wang, Pinghui .
INFORMATION FUSION, 2023, 89 :41-52
[26]   Distant Supervision via Prototype-Based Global Representation Learning [J].
Han, Xianpei ;
Sun, Le .
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, :3443-3449
[27]   Self-Attention with Cross-Lingual Position Representation [J].
Ding, Liang ;
Wang, Longyue ;
Tao, Dacheng .
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, :1679-1685
[28]   Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank [J].
Briakou, Eleftheria ;
Carpuat, Marine .
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, :1563-1580
[29]   Cross-lingual Linking of Multi-word Entities and their corresponding Acronyms [J].
Jacquet, Guillaume ;
Ehrmann, Maud ;
Steinberger, Ralf ;
Vayrynen, Jaakko .
LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, :528-535
[30]   Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates [J].
Qi, Kunxun ;
Wan, Hai ;
Du, Jianfeng ;
Chen, Haolan .
PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, :1910-1923