Mutual Information-Based Word Embedding for Unsupervised Cross-Domain Sentiment Classification

被引:0
作者
Liang, Junge [1 ]
Ma, Lei [1 ]
Xiong, Xin [1 ]
Shao, Dangguo [1 ]
Xiang, Yan [1 ]
Wang, Xiongbing [2 ]
机构
[1] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Kunming, Yunnan, Peoples R China
[2] Kunming Med Univ, Informat & Stat Ctr, Affiliated Hosp 2, Kunming, Yunnan, Peoples R China
来源
2019 IEEE 4TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA) | 2019年
关键词
word embedding; cross-domain sentiment classification; mutual information; pivots; word frequency;
D O I
10.1109/icccbda.2019.8725662
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised cross-domain sentiment classification is a challenging task, which trains a sentiment classifier using labeled source domain data and could be applied to unlabeled target domain. This problem can be solved by training cross-domain word embedding. This paper proposes a method of mutual information-based word embedding. Firstly, source domain word embedding is trained based on source domain data. Then the pivots with strong sentimental polarities are selected based on mutual information (MI). According to the MI value of pivots, the transfer coefficient from the source domain to the target domain is calculated. Finally, the transfer coefficient is used as a constraint of the objective of the skip-gram model to train the word embedding of the target domain. Experiments show that the proposed method is superior to the cross-domain word embedding based on word frequency.
引用
收藏
页码:625 / 628
页数:4
相关论文
共 15 条
[1]  
[Anonymous], 2007, P 45 ANN M ASS COMP
[2]   USING MUTUAL INFORMATION FOR SELECTING FEATURES IN SUPERVISED NEURAL-NET LEARNING [J].
BATTITI, R .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (04) :537-550
[3]  
Bollegala D, 2016, IEEE T KNOWLEDGE DAT, V28
[4]  
Bollegala Danushka, 2015, COMPUTER SCI, V65, P60
[5]  
CHURCH KW, 1990, 27TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, P76
[6]  
Kim Yoon, 2014, C EMP METH NAT LANG, P1746, DOI [DOI 10.3115/V1/D14-1181, 10.3115/v1/D14-1181]
[7]  
Lu Wei, 2016, P 2016 C EMP METH NA, P950
[8]  
Mikolov Tomas, 2013, P 1 INT C LEARN REPR
[9]  
Mnih A., 2007, P 24 INT C MACHINE L, P641, DOI DOI 10.1145/1273496.1273577
[10]  
Pan S. J., 2010, Proceedings of the Intl Conf. On Worldwide Web, P751, DOI DOI 10.1145/1772690.1772767