Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision

被引:0
|
作者
Cao, Yixin [1 ,2 ]
Hou, Lei [2 ]
Li, Juanzi [2 ]
Liu, Zhiyuan [2 ]
Li, Chengjiang [2 ]
Chen, Xu [2 ]
Dong, Tiansi [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[2] Tsinghua Univ, Dept CST, Beijing, Peoples R China
[3] Univ Bonn, B IT, Bonn, Germany
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method.
引用
收藏
页码:227 / 237
页数:11
相关论文
共 50 条
  • [1] Joint Multilingual Supervision for Cross-lingual Entity Linking
    Upadhyay, Shyam
    Gupta, Nitish
    Roth, Dan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2486 - 2495
  • [2] Unsupervised Cross-Lingual Sentence Representation Learning via Linguistic Isomorphism
    Wang, Shuai
    Hou, Lei
    Tong, Meihan
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2019, PT II, 2019, 11776 : 215 - 226
  • [3] Lightweight Cross-Lingual Sentence Representation Learning
    Mao, Zhuoyuan
    Gupta, Prakhar
    Chu, Chenhui
    Jaggi, Martin
    Kurohashi, Sadao
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 2902 - 2913
  • [4] SimCSum: Joint Learning of Simplification and Cross-lingual Summarization for Cross-lingual Science Journalism
    Fatima, Mehwish
    Kolber, Tim
    Markert, Katja
    Strube, Michael
    NewSumm 2023 - Proceedings of the 4th New Frontiers in Summarization Workshop, Proceedings of EMNLP Workshop, 2023, : 24 - 40
  • [5] Transductive Representation Learning for Cross-Lingual Text Classification
    Guo, Yuhong
    Xiao, Min
    12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 888 - 893
  • [6] Unsupervised Cross-lingual Representation Learning for Speech Recognition
    Conneau, Alexis
    Baevski, Alexei
    Collobert, Ronan
    Mohamed, Abdelrahman
    Auli, Michael
    INTERSPEECH 2021, 2021, : 2426 - 2430
  • [7] Learning a Cross-Lingual Semantic Representation of Relations Expressed in Text
    Rettinger, Achim
    Schumilin, Artem
    Thoma, Steffen
    Ell, Basil
    SEMANTIC WEB: LATEST ADVANCES AND NEW DOMAINS, ESWC 2015, 2015, 9088 : 337 - 352
  • [8] Cross-Lingual Sentiment Classification with Bilingual Document Representation Learning
    Zhou, Xinjie
    Wan, Xianjun
    Xiao, Jianguo
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 1403 - 1412
  • [9] Cross-lingual Entity Alignment with Incidental Supervision
    Chen, Muhao
    Shi, Weijia
    Zhou, Ben
    Roth, Dan
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 645 - 658
  • [10] Combining Cross-lingual and Cross-task Supervision for Zero-Shot Learning
    Pikuliak, Matus
    Simko, Marian
    TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 162 - 170