Cross-domain sentiment aware word embeddings for review sentiment analysis

被引:50
作者
Liu, Jun [1 ]
Zheng, Shuang [1 ]
Xu, Guangxia [1 ,2 ,3 ]
Lin, Mingwei [4 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Software Engn, Chongqing, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Cyberspace & Informat Secur, Chongqing 400065, Peoples R China
[3] Chongqing Univ, Informat & Commun Engn Postdoctoral Res Stn, Chongqing, Peoples R China
[4] Fujian Normal Univ, Coll Math & Informat, Fuzhou 350117, Fujian, Peoples R China
基金
中国国家自然科学基金;
关键词
Word embeddings; Sentiment analysis; Deep learning; Domain adaptation; DEEP LEARNING-MODEL; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s13042-020-01175-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning low-dimensional vector representations of words from a large corpus is one of the basic tasks in natural language processing (NLP). The existing universal word embedding model learns word vectors mainly through grammar and semantic information from the context, while ignoring the sentiment information contained in the words. Some approaches, although they model sentiment information in the reviews, do not consider certain words in different domains. In a case where the emotion changes, if the general word vector is directly applied to the review sentiment analysis task, then this will inevitably affect the performance of the sentiment classification. To solve this problem, this paper extends the CBoW (continuous bag-of-words) word vector model and proposes a cross-domain sentiment aware word embedding learning model, which can capture the sentiment information and domain relevance of a word at the same time. This paper conducts several experiments on Amazon user review data in different domains to evaluate the performance of the model. The experimental results show that the proposed model can obtain a nearly 2% accuracy improvement compared with the general word vector when modeling only the sentiment information of the context. At the same time, when the domain information and the sentiment information are both included, the accuracy and Macro-F1 value of the sentiment classification tasks are significantly improved compared with existing sentiment word embeddings.
引用
收藏
页码:343 / 354
页数:12
相关论文
共 30 条
  • [1] Arabic Word Segmentation With Long Short-Term Memory Neural Networks and Word Embedding
    Almuhareb, Abdulrahman
    Alsanie, Waleed
    Al-Thubaity, Abdulmohsen
    [J]. IEEE ACCESS, 2019, 7 : 12879 - 12887
  • [2] A Human-Inspired Recognition System for Pre-Modern Japanese Historical Documents
    Ann Duc Le
    Clanuwat, Tarin
    Kitamoto, Asanobu
    [J]. IEEE ACCESS, 2019, 7 : 84163 - 84169
  • [3] [Anonymous], [No title captured]
  • [4] [Anonymous], 2019, ICSES TRANSACTION NE
  • [5] Bengio Y, 2001, ADV NEUR IN, V13, P932
  • [6] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [7] Cross-Domain Sentiment Classification Using Sentiment Sensitive Embeddings
    Bollegala, Danushka
    Mu, Tingting
    Goulermas, John Yannis
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (02) : 398 - 410
  • [8] Automatic Algorithm to Classify and Locate Research Papers Using Natural Language
    Calvillo, E. A.
    Mendoza, R.
    Munoz, J.
    Martinez, J. C.
    Vargas, M.
    Rodriguez, L. C.
    [J]. IEEE LATIN AMERICA TRANSACTIONS, 2016, 14 (03) : 1367 - 1371
  • [9] Sentiment Lexicon Construction With Hierarchical Supervision Topic Model
    Deng, Dong
    Jing, Liping
    Yu, Jian
    Sun, Shaolong
    Ng, Michael K.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (04) : 704 - 718
  • [10] Adaptive Multi-Compositionality for Recursive Neural Network Models
    Dong, Li
    Wei, Furu
    Xu, Ke
    Liu, Shixia
    Zhou, Ming
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (03) : 422 - 431