Comparing Sentiment Analysis and Document Representation Methods of Amazon Reviews

被引:0
作者
Katic, Tamara [1 ]
Milicevic, Nemanja [1 ]
机构
[1] Univ Novi Sad, Fac Tech Sci, Novi Sad, Serbia
来源
2018 IEEE 16TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY 2018) | 2018年
关键词
sentiment analysis; bag-of-words; word embeddings; paragraph vector; convolutional neural networks; long-short term memory;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the last few years sentiment analysis has made much progress. Sentiment analysis has been used in several applications to identify the opinions of people, products, brands, services, etc., which can, for example, improve a company's business. Some of these applications claim to have more effective document representation models than merely Information Retrieval approaches like the bag-of-words representation. Document representation models have increased interest to solve some of the limitations that bag-of-words representation has. In this paper, the several sentiment analysis and document representation methods of Amazon reviews are compared. In this paper, traditional models such as a bag-of-words, bag-of-ngrams and their TF-IDF variants combined with linear classifiers such as Logistic Regression and SVM, and deep learning models such as word-based convolutional neural networks (ConvNets) and the simple long short-term memory (LSTM) recurrent neural network were used. Various document representation techniques such as Paragraph Vector or using pre-trained Word2Vec and Glove word embeddings to compute the vector for each word in the document were tested, and word vectors are aggregated using the element-wise mean. It is shown that deep learning models perform better on our large dataset than traditional models. LSTM resulted with the best accuracy of 95.55%. Deep learning models generally work better than traditional models as training set size increases. Our best performing model can be used for automatic sentiment classification for future product reviews in retail stores.
引用
收藏
页码:283 / 288
页数:6
相关论文
共 22 条
  • [1] Angiani G., 2016, KDWeb
  • [2] [Anonymous], 1999, MODERN INFORM RETRIE
  • [3] [Anonymous], 2017, ARXIV170904219
  • [4] [Anonymous], AC SPEECH SIGN PROC
  • [5] [Anonymous], PYENCHANT SPELL CHEC
  • [6] [Anonymous], 2014, INT C MACH LEARN ICM
  • [7] [Anonymous], 2014, P INT C INT C MACH L
  • [8] [Anonymous], 1988, FORSCHUNGSBERICHT DT
  • [9] [Anonymous], P IEEE C COMP VIS PA
  • [10] Chollet F., 2015, about us