Refining Word Embeddings Using Intensity Scores for Sentiment Analysis

被引:110
|
作者
Yu, Liang-Chih [1 ]
Wang, Jin [2 ,3 ]
Lai, K. Robert [2 ]
Zhang, Xuejie [3 ]
机构
[1] Yuan Ze Univ, Dept Informat Management, Taoyuan 32003, Taiwan
[2] Yuan Ze Univ, Dept Comp Sci & Engn, Taoyuan 32003, Taiwan
[3] Yunnan Univ, Sch Informat Sci & Engn, Kunming 650000, Yunnan, Peoples R China
关键词
Sentiment analysis; word embeddings; sentiment embeddings; word vector refinement; AFFECTIVE NORMS; VALENCE; AROUSAL; ADAPTATION; DOMINANCE; MODELS; SPACE; ANEW;
D O I
10.1109/TASLP.2017.2788182
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Word embeddings that provide continuous lowdimensional vector representations of words have been extensively used for various natural language processing tasks. However, existing context-based word embeddings such as Word2vec and GloVe typically fail to capture sufficient sentiment information, which may result in words with similar vector representations having an opposite sentiment polarity (e.g., good and bad), thus degrading sentiment analysis performance. To tackle this problem, recent studies have suggested learning sentiment embeddings to incorporate the sentiment polarity (positive and negative) information from labeled corpora. This study adopts another strategy to learn sentiment embeddings. Instead of creating a new word embedding from labeled corpora, we propose a word vector refinement model to refine existing pretrained word vectors using real-valued sentiment intensity scores provided by sentiment lexicons. The idea of the refinement model is to improve each word vector such that it can be closer in the lexicon to both semantically and sentimentally similar words (i.e., those with similar intensity scores) and further away from sentimentally dissimilar words (i.e., those with dissimilar intensity scores). An obvious advantage of the proposed method is that it can be applied to any pretrained word embeddings. In addition, the intensity scores can provide more finegrained (real-valued) sentiment information than binary polarity labels to guide the refinement process. Experimental results show that the proposed refinement model can improve both conventional word embeddings and previously proposed sentiment embeddings for binary, ternary, and fine- grained sentiment classification on the SemEval and Stanford Sentiment Treebank datasets.
引用
收藏
页码:671 / 681
页数:11
相关论文
共 50 条
  • [1] Refining Word Embeddings with Sentiment Information for Sentiment Analysis
    Kasri M.
    Birjali M.
    Nabil M.
    Beni-Hssane A.
    El-Ansari A.
    El Fissaoui M.
    Journal of ICT Standardization, 2022, 10 (03): : 353 - 382
  • [2] Improving Sentiment Analysis in Twitter Using Sentiment Specific Word Embeddings
    Othman, Rania
    Abdelsadek, Youcef
    Chelghoum, Kamel
    Kacem, Imed
    Faiz, Rim
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 2, 2019, : 854 - 858
  • [3] Word Embeddings for Arabic Sentiment Analysis
    Altowayan, A. Aziz
    Tao, Lixin
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3820 - 3825
  • [4] Refined Global Word Embeddings Based on Sentiment Concept for Sentiment Analysis
    Wang, Yabing
    Huang, Guimin
    Li, Jun
    Li, Hui
    Zhou, Ya
    Jiang, Hua
    IEEE ACCESS, 2021, 9 : 37075 - 37085
  • [5] Sentiment Analysis in Turkish Based on Weighted Word Embeddings
    Onan, Aytug
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [6] Sentiment analysis with covariate-assisted word embeddings
    Xu, Shirong
    Dai, Ben
    Wang, Junhui
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 3015 - 3039
  • [7] Sentiment Embeddings with Applications to Sentiment Analysis
    Tang, Duyu
    Wei, Furu
    Qin, Bing
    Yang, Nan
    Liu, Ting
    Zhou, Ming
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (02) : 496 - 509
  • [8] Cross-domain sentiment aware word embeddings for review sentiment analysis
    Liu, Jun
    Zheng, Shuang
    Xu, Guangxia
    Lin, Mingwei
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 343 - 354
  • [9] Cross-domain sentiment aware word embeddings for review sentiment analysis
    Jun Liu
    Shuang Zheng
    Guangxia Xu
    Mingwei Lin
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 343 - 354
  • [10] Learning emotional word embeddings for sentiment analysis
    Zeng, Qingtian
    Zhao, Xishi
    Hu, Xiaohui
    Duan, Hua
    Zhao, Zhongying
    Li, Chao
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9515 - 9527