Similarity Measurement for Sentiment Classification on Textual Reviews

被引:0
|
作者
Thongtan, Tan [1 ]
Phienthrakul, Tanasanee [2 ]
机构
[1] Mahidol Univ, Fac Engn, Dept Comp Engn, Mahidol Univ Int Coll, Nakhon Pathom, Thailand
[2] Mahidol Univ, Fac Engn, Dept Comp Engn, Nakhon Pathom, Thailand
来源
ISMSI 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE | 2018年
关键词
Similarity Measure; Sentiment Classification; Textual Reviews; Document Vector;
D O I
10.1145/3206185.3206204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification on textual reviews refers to classifying textual reviews based on whether they are positive or negative. This research focuses on classifying movie reviews, and is benchmarked on the IMDB dataset, which consists of long movie reviews, using accuracy as the evaluation metric. In sentiment classification, each document must be mapped to a fixed length vector. Document embedding models map each document to a dense, low-dimensional vector in continuous vector space. This research proposes to train document embedding using cosine similarity instead of dot product. Experiments on the IMDB dataset show that accuracy is improved when using cosine similarity compared to using dot product, while using feature combination with Naive-Bayes weighted bag of n-grams achieves a new state of the art accuracy of 97.4%.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [1] How textual quality of online reviews affect classification performance: a case of deep learning sentiment analysis
    Li, Lin
    Goh, Tiong-Thye
    Jin, Dawei
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09) : 4387 - 4415
  • [2] How textual quality of online reviews affect classification performance: a case of deep learning sentiment analysis
    Lin Li
    Tiong-Thye Goh
    Dawei Jin
    Neural Computing and Applications, 2020, 32 : 4387 - 4415
  • [3] Multilingual Sentiment Classification on Large Textual Data
    Polpinij, Jantima
    2014 IEEE FOURTH INTERNATIONAL CONFERENCE ON BIG DATA AND CLOUD COMPUTING (BDCLOUD), 2014, : 183 - 188
  • [4] Sentiment classification of customer reviews on electric products
    Miyoshi, Tetsuya
    Nakagaini, Yu
    2007 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-8, 2007, : 3558 - 3563
  • [5] Sentiment Analysis and classification for Software as a Service Reviews
    Alkalbani, Asma Musabah
    Ghamry, Ahmed Mohamed
    Hussain, Farookh Khadeer
    Hussain, Omar Khadeer
    IEEE 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS IEEE AINA 2016, 2016, : 53 - 58
  • [6] Experiments in sentiment classification of movie reviews in Spanish
    Cruz, Fermin L.
    Troyano, Jose A.
    Enriquez, Fernando
    Ortega, Javier
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (41): : 73 - 80
  • [7] Intelligent Hybrid Feature Selection for Textual Sentiment Classification
    Khan, Jawad
    Alam, Aftab
    Lee, Youngmoon
    IEEE ACCESS, 2021, 9 : 140590 - 140608
  • [8] SENTIMENT CLASSIFICATION ON CHINESE REVIEWS BASED ON AMBIGUOUS SENTIMENT CONFINED LIBRARY
    Liu, Meijuan
    Yang, Shicai
    Chen, Qiaofen
    2012 IEEE 2nd International Conference on Cloud Computing and Intelligent Systems (CCIS) Vols 1-3, 2012, : 1470 - 1473
  • [9] A Hybrid Method for Sentiment Classification in Chinese Movie Reviews Based on Sentiment Labels
    Zhao, Kai
    Jin, Yaohong
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, : 86 - 89
  • [10] Intensified Sentiment Analysis of Customer Product Reviews Using Acoustic and Textual Features
    Govindaraj, Sureshkumar
    Gopalakrishnan, Kumaravelan
    ETRI JOURNAL, 2016, 38 (03) : 494 - 501