Investigating Word Meta-Embeddings by Disentangling Common and Individual Information

被引:3
作者
Chen, Wenfan [1 ]
Sheng, Mengmeng [1 ,2 ]
Mao, Jiafa [1 ]
Sheng, Weiguo [3 ]
机构
[1] Zhejiang Univ Technol, Sch Comp Sci & Technol, Hangzhou 310023, Peoples R China
[2] Zhejiang Police Coll, Hangzhou 310053, Peoples R China
[3] Hangzhou Normal Univ, Dept Comp Sci, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; word meta-embedding; word representation;
D O I
10.1109/ACCESS.2020.2965719
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the field of natural language processing, combining multiple pre-trained word embeddings has become a viable approach to improve word representations. However, there is still a lack of understanding of why such improvements can be achieved. In this paper, we investigate this issue by firstly proposing a novel word meta-embedding method. The proposed method tends to disentangle common and individual information from different word embeddings and learns representations for both. Based on the proposed method, we then carry out a systematic evaluation to provide a perspective on how common and individual information contributes to different tasks. Our intrinsic evaluation results suggest that common information is critical for word-level representations in terms of word similarity and relatedness. While, based on natural language inference, our extrinsic evaluation results show that common and individual information plays different roles and can complement each other. Further, both intrinsic and extrinsic evaluations reveal that explicitly separating common and individual information is beneficial for learning word meta-embeddings.
引用
收藏
页码:11692 / 11699
页数:8
相关论文
共 33 条
[1]  
[Anonymous], 2015, P 19 C COMP NAT LANG
[2]  
[Anonymous], 2016, PROC 2016 C EMPIRICA
[3]  
[Anonymous], 2018, P C EMP METH NAT LAN
[4]  
[Anonymous], 2008, P 25 INT C MACH LEAR, DOI DOI 10.1145/1390156.1390177
[5]  
[Anonymous], 2009, P HLT 2009 ANN C NAA
[6]  
[Anonymous], P INT C LEARN REPR
[7]  
Bollegala D, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3970
[8]  
Bollegala Danushka, 2018, P 27 INT C COMP LING, P1650
[9]  
Bowman Samuel R., 2015, EMNLP, P632
[10]   Multimodal Distributional Semantics [J].
Bruni, Elia ;
Nam Khanh Tran ;
Baroni, Marco .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 49 :1-47