Context-Aware Sentiment Analysis using Tweet Expansion Method

被引:2
作者
Tahayna, Bashar [1 ]
Ayyasamy, Ramesh Kumar [1 ]
Akbar, Rehan [2 ]
机构
[1] Univ Tunku Abdul Rahman, Fac Informat & Commun Technol, Dept Informat Syst, Jalan Univ, Kampar 31900, Perak, Malaysia
[2] Univ Teknol Persiaran UTP, Dept Comp & Informat Sci, Seri Iskandar 32610, Perak, Malaysia
关键词
embedding; neural networks; sentiment analysis; tweet enrichment; deep learning;
D O I
10.5614/itbj.ict.res.appl.2022.16.2.3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The large source of information space produced by the plethora of social media platforms in general and microblogging in particular has spawned a slew of new applications and prompted the rise and expansion of sentiment analysis research. We propose a sentiment analysis technique that identifies the main parts to describe tweet intent and also enriches them with relevant words, phrases, or even inferred variables. We followed a state-of-the-art hybrid deep learning model to combine Convolutional Neural Network (CNN) and the Long Short-Term Memory network (LSTM) to classify tweet data based on their polarity. To preserve the latent relationships between tweet terms and their expanded representation, sentence encoding and contextualized word embeddings are utilized. To investigate the performance of tweet embeddings on the sentiment analysis task, we tested several context-free models (Word2Vec, Sentence2Vec, Glove, and FastText), a dynamic embedding model (BERT), deep contextualized word representations (ELMo), and an entity-based model (Wikipedia). The proposed method and results prove that text enrichment improves the accuracy of sentiment polarity classification with a notable percentage.
引用
收藏
页码:138 / 151
页数:14
相关论文
共 22 条
  • [1] Sentiment Analysis Using Common-Sense and Context Information
    Agarwal, Basant
    Mittal, Namita
    Bansal, Pooja
    Garg, Sonal
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
  • [2] [Anonymous], 2017, Eprint Arxiv
  • [3] Bojanowski P., 2017, Trans. Assoc. Comput. Linguistics, V5, P135, DOI [DOI 10.1162/TACLA00051, 10.1162/tacl_a_00051, DOI 10.1162/TACL_A_00051]
  • [4] Buccoliero L., 2020, Journal of Marketing Communications, V26, P88, DOI [DOI 10.1080/13527266.2018.1504228, https://doi.org/10.1080/13527266.2018.1504228]
  • [5] Combinatorial feature embedding based on CNN and LSTM for biomedical named entity recognition
    Cho, Minsoo
    Ha, Jihwan
    Park, Chihyun
    Park, Sanghyun
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 103
  • [6] Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
  • [7] Peters ME, 2018, Arxiv, DOI arXiv:1802.05365
  • [8] Go Alec., 2009, CS224N project report 1.12
  • [9] Guggilla C., 2016, P COLING 2016 26 INT, P2740
  • [10] Encoding Syntactic Knowledge in Neural Networks for Sentiment Classification
    Huang, Minlie
    Qian, Qiao
    Zhu, Xiaoyan
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2017, 35 (03)