A Deep Learning Model Enhanced with Emotion Semantics for Microblog Sentiment Analysis

被引:0
|
作者
He Y.-X. [1 ,2 ]
Sun S.-T. [1 ]
Niu F.-F. [1 ]
Li F. [1 ]
机构
[1] Computer School, Wuhan University, Wuhan
[2] State Key Laboratory of Software Engineering, Wuhan University, Wuhan
来源
Jisuanji Xuebao/Chinese Journal of Computers | 2017年 / 40卷 / 04期
基金
中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Convolution neural network; Deep learning; Microblog; Natural language processing; Sentiment analysis; Social networks;
D O I
10.11897/SP.J.1016.2017.00773
中图分类号
学科分类号
摘要
Word embedding based on neural language model can automatically learn effective word representation from massive unlabeled text dataset, and has made essential progress in many natural language processing tasks. Emoticons in microblog are important emotion signals for microblog sentiment analysis. There have been a lot of research works exploiting emoticons to improve sentiment classification performance for microblog effectively. Commonly used emoticons are adopted to construct an emotion space as feature representation matrix RE from their word embedding. On the basis of vector based semantic composition, the projection to emotion space is performed as matrix-vector multiplication between RE and other embedding. Then, the results are forward to MCNN to learn a sentiment classifier for microblog. This new model is named as EMCNN, short for Emotion-semantic enhanced MCNN, which seamlessly integrates emotion space projection based on emoticon into deep learning model MCNN to enhance its ability of capturing emotion semantic. On the datasets of NLPCC microblog sentiment analysis task, EMCNN achieves the best performance in several sentiment classification experiments and surpass the state-of-the-art results on all the performance metrics. Comparing to MCNN, EMCNN not only improve the classification performance, but also reduce the training time, i.e. 36.15% for subject classification and 33.82% for 7-class sentiment classification. © 2017, Science Press. All right reserved.
引用
收藏
页码:773 / 790
页数:17
相关论文
共 40 条
  • [1] Tumasjan A., Sprenger T.O., Sandner P.G., Et al., Predicting elections with Twitter: What 140 characters reveal about political sentiment, Proceedings of the 4th International AAAI Conference on Web and Social Media (ICWSM-10), pp. 178-185, (2010)
  • [2] Jansen B.J., Zhang M., Sobel K., Et al., Micro-blogging as online word of mouth branding, Proceedings of the 27th International Conference on Human Factors in Computing Systems (CHI 2009): Extended Abstracts Volume, pp. 3859-3864, (2009)
  • [3] O'Connor B., Balasubramanyan R., Routledge B.R., Et al., From tweets to polls: Linking text sentiment to public opinion time series, Proceedings of the 4th International AAAI Conference on Web and Social Media (ICWSM-10), pp. 122-129, (2010)
  • [4] Liu B., Sentiment analysis and opinion mining, Synthesis Lectures on Human Language Technologies, 5, 1, pp. 1-167, (2012)
  • [5] Davidov D., Tsur O., Rappoport A., Enhanced sentiment learning using Twitter hashtags and smileys, Proceedings of the 23rd International Conference on Computational Linguistics (COLING 2010): Posters, pp. 241-249, (2010)
  • [6] Kouloumpis E., Wilson T., Moore J.D., Twitter sentiment analysis: The good the bad and the OMG!, Proceedings of the 5th International AAAI Conference on Web and Social Media (ICWSM-11), pp. 538-541, (2011)
  • [7] Schlichtkrull M.S., Learning affective projections for emoticons on Twitter, Proceedings of the 6th IEEE International Conference on Cognitive Infocommunications (CogInfoCom 2015), pp. 539-543, (2015)
  • [8] Derks D., Bos A.E., Von Grumbkow J., Emoticons and social interaction on the Internet: The importance of social context, Computers in Human Behavior, 23, 1, pp. 842-849, (2007)
  • [9] Mikolov T., Le Q.V., Sutskever I., Exploiting similarities among languages for machine translation, (2013)
  • [10] Mikolov T., Sutskever I., Chen K., Et al., Distributed representations of words and phrases and their compositionality, Proceedings of the 27th Advances in Neural Information Processing Systems (NIPS 2013), pp. 3111-3119, (2013)