Sentiment Embeddings with Applications to Sentiment Analysis

被引:192
|
作者
Tang, Duyu [1 ]
Wei, Furu [2 ]
Qin, Bing [1 ]
Yang, Nan [2 ]
Liu, Ting [1 ]
Zhou, Ming [2 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
[2] Microsoft Res, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Natural language processing; word embeddings; sentiment analysis; neural networks; SPACES; MODELS;
D O I
10.1109/TKDE.2015.2489653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose learning sentiment-specific word embeddings dubbed sentiment embeddings in this paper. Existing word embedding learning algorithms typically only use the contexts of words but ignore the sentiment of texts. It is problematic for sentiment analysis because the words with similar contexts but opposite sentiment polarity, such as good and bad, are mapped to neighboring word vectors. We address this issue by encoding sentiment information of texts (e.g., sentences and words) together with contexts of words in sentiment embeddings. By combining context and sentiment level evidences, the nearest neighbors in sentiment embedding space are semantically similar and it favors words with the same sentiment polarity. In order to learn sentiment embeddings effectively, we develop a number of neural networks with tailoring loss functions, and collect massive texts automatically with sentiment signals like emoticons as the training data. Sentiment embeddings can be naturally used as word features for a variety of sentiment analysis tasks without feature engineering. We apply sentiment embeddings to word-level sentiment analysis, sentence level sentiment classification, and building sentiment lexicons. Experimental results show that sentiment embeddings consistently outperform context-based embeddings on several benchmark datasets of these tasks. This work provides insights on the design of neural networks for learning task-specific word embeddings in other natural language processing tasks.
引用
收藏
页码:496 / 509
页数:14
相关论文
共 50 条
  • [21] Multi-channel word embeddings for sentiment analysis
    Jhe-Wei Lin
    Tran Duy Thanh
    Rong-Guey Chang
    Soft Computing, 2022, 26 : 12703 - 12715
  • [22] Sentiment Analysis with Contextual Embeddings and Self-attention
    Biesialska, Katarzyna
    Biesialska, Magdalena
    Rybinski, Henryk
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISMIS 2020), 2020, 12117 : 32 - 41
  • [23] Multi-channel word embeddings for sentiment analysis
    Lin, Jhe-Wei
    Thanh, Tran Duy
    Chang, Rong-Guey
    SOFT COMPUTING, 2022, 26 (22) : 12703 - 12715
  • [24] Sentiment Analysis using Topic-Document Embeddings
    Mitroi, Madalina
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    Florea, Adina Magda
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTER COMMUNICATION AND PROCESSING (ICCP 2020), 2020, : 75 - 82
  • [25] Persian Sentiment Analysis without Training Data Using Cross-Lingual Word Embeddings
    Aliramezani, Mohammad
    Doostmohammadi, Ehsan
    Bokaei, Mohammad Hadi
    Sameti, Hossien
    2020 10TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2020, : 78 - 82
  • [26] SENTIMENT ANALYSIS
    Das, Ayush
    2017 8TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2017,
  • [27] A Comparative Study of Pre-trained Word Embeddings for Arabic Sentiment Analysis
    Zouidine, Mohamed
    Khalil, Mohammed
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1243 - 1248
  • [28] Word Embeddings with Fuzzy Ontology Reasoning for Feature Learning in Aspect Sentiment Analysis
    Sweidan, Asmaa Hashem
    El-Bendary, Nashwa
    Al-Feel, Haytham
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT II, 2022, 13530 : 320 - 331
  • [29] Sentiment Analysis
    Stine, Robert A.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 6, 2019, 6 : 287 - 308
  • [30] THE JOINT EFFECT OF SEMANTIC AND SYNTACTIC WORD EMBEDDINGS ON SENTIMENT ANALYSIS
    Chen, Shu
    Chen, Guang
    Wang, Wei
    PROCEEDINGS OF 2016 5TH IEEE INTERNATIONAL CONFERENCE ON NETWORK INFRASTRUCTURE AND DIGITAL CONTENT (IEEE IC-NIDC 2016), 2016, : 366 - 370