Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision

被引:0
作者
Cao, Yixin [1 ,2 ]
Hou, Lei [2 ]
Li, Juanzi [2 ]
Liu, Zhiyuan [2 ]
Li, Chengjiang [2 ]
Chen, Xu [2 ]
Dong, Tiansi [3 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[2] Tsinghua Univ, Dept CST, Beijing, Peoples R China
[3] Univ Bonn, B IT, Bonn, Germany
来源
2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018) | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint representation learning of words and entities benefits many NLP tasks, but has not been well explored in cross-lingual settings. In this paper, we propose a novel method for joint representation learning of cross-lingual words and entities. It captures mutually complementary knowledge, and enables cross-lingual inferences among knowledge bases and texts. Our method does not require parallel corpora, and automatically generates comparable data via distant supervision using multi-lingual knowledge bases. We utilize two types of regularizers to align cross-lingual words and entities, and design knowledge attention and crosslingual attention to further reduce noises. We conducted a series of experiments on three tasks: word translation, entity relatedness, and cross-lingual entity linking. The results, both qualitatively and quantitatively, demonstrate the significance of our method.
引用
收藏
页码:227 / 237
页数:11
相关论文
共 50 条
[31]   MultiAligNet: Cross-lingual Knowledge Bridges Between Words and Senses [J].
Grasso, Francesca ;
Rulfi, Vladimiro Lovera ;
Di Caro, Luigi .
KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT, EKAW 2022, 2022, 13514 :36-50
[32]   CROSS-LINGUAL CYBERSECURITY ANALYTICS IN THE INTERNATIONAL DARK WEB WITH ADVERSARIAL DEEP REPRESENTATION LEARNING [J].
Ebrahimi, Mohammadreza ;
Chai, Yidong ;
Samtani, Sagar ;
Chen, Hsinchun .
MIS QUARTERLY, 2022, 46 (02) :1209-1226
[33]   Translation Artifacts in Cross-lingual Transfer Learning [J].
Artetxe, Mikel ;
Labaka, Gorka ;
Agirre, Eneko .
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, :7674-7684
[34]   Choosing Transfer Languages for Cross-Lingual Learning [J].
Lin, Yu-Hsiang ;
Chen, Chian-Yu ;
Lee, Jean ;
Li, Zirui ;
Zhang, Yuyan ;
Xia, Mengzhou ;
Rijhwani, Shruti ;
He, Junxian ;
Zhang, Zhisong ;
Ma, Xuezhe ;
Anastasopoulos, Antonios ;
Littell, Patrick ;
Neubig, Graham ;
Anastasopoulos, Antonios ;
Littell, Patrick ;
Neubig, Graham .
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, :3125-3135
[35]   Learning to Enrich Query Representation with Pseudo-Relevance Feedback for Cross-lingual Retrieval [J].
Chandradevan, Ramraj ;
Yang, Eugene ;
Yarmohammadi, Mahsa ;
Agichtein, Eugene .
PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, :1790-1795
[36]   Active Learning for Cross-Lingual Sentiment Classification [J].
Li, Shoushan ;
Wang, Rong ;
Liu, Huanhuan ;
Huang, Chu-Ren .
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 :236-246
[37]   Cross-lingual learning for text processing: A survey [J].
Pikuliak, Matus ;
Simko, Marian ;
Bielikova, Maria .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[38]   Learning Cross-lingual Word Embeddings via Matrix Co-factorization [J].
Shi, Tianze ;
Liu, Zhiyuan ;
Liu, Yang ;
Sun, Maosong .
PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL) AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (IJCNLP), VOL 2, 2015, :567-572
[39]   Boosting Cross-Lingual Transfer via Self-Learning with Uncertainty Estimation [J].
Xu, Liyan ;
Zhang, Xuchao ;
Zhao, Xujiang ;
Chen, Haifeng ;
Chen, Feng ;
Choi, Jinho D. .
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, :6716-6723
[40]   Rumour Detection via Zero-Shot Cross-Lingual Transfer Learning [J].
Tian, Lin ;
Zhang, Xiuzhen ;
Lau, Jey Han .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 :603-618