Similarity Measurement for Sentiment Classification on Textual Reviews

被引:0
|
作者
Thongtan, Tan [1 ]
Phienthrakul, Tanasanee [2 ]
机构
[1] Mahidol Univ, Fac Engn, Dept Comp Engn, Mahidol Univ Int Coll, Nakhon Pathom, Thailand
[2] Mahidol Univ, Fac Engn, Dept Comp Engn, Nakhon Pathom, Thailand
来源
ISMSI 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE | 2018年
关键词
Similarity Measure; Sentiment Classification; Textual Reviews; Document Vector;
D O I
10.1145/3206185.3206204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification on textual reviews refers to classifying textual reviews based on whether they are positive or negative. This research focuses on classifying movie reviews, and is benchmarked on the IMDB dataset, which consists of long movie reviews, using accuracy as the evaluation metric. In sentiment classification, each document must be mapped to a fixed length vector. Document embedding models map each document to a dense, low-dimensional vector in continuous vector space. This research proposes to train document embedding using cosine similarity instead of dot product. Experiments on the IMDB dataset show that accuracy is improved when using cosine similarity compared to using dot product, while using feature combination with Naive-Bayes weighted bag of n-grams achieves a new state of the art accuracy of 97.4%.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [41] Cross Domain Sentiment Classification of Thai Reviews using Co-Train Model
    Boonpetch, Warakorn
    Chitsobhuk, Orachat
    ELEVENTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2019, 11384
  • [42] Sentiment Classification for Chinese Product Reviews Using an Unsupervised Internet-based Method
    Zhang Zi-qiong
    Li Yi-jun
    Ye Qiang
    Law Rob
    2008 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING (15TH), VOLS I AND II, CONFERENCE PROCEEDINGS, 2008, : 3 - +
  • [43] Deep learning based bi-polar sentiment classification of movie reviews in Hindi
    Sharma, Ankita
    Ghose, Udayan
    JOURNAL OF STATISTICS AND MANAGEMENT SYSTEMS, 2024, 27 (01) : 59 - 86
  • [44] Harnessing consumer reviews for marketing intelligence: a domain-adapted sentiment classification approach
    Yang, Chin-Sheng
    Chen, Cheng-Hsiung
    Chang, Pei-Chann
    INFORMATION SYSTEMS AND E-BUSINESS MANAGEMENT, 2015, 13 (03) : 403 - 419
  • [45] Design of a CWA-wbiLSTM Model for Aspect based Sentiment Classification for Product Reviews
    Darshini, Priya
    Shekhawat, Hardayal Singh
    Wireless Personal Communications, 2024, 139 (03) : 1709 - 1733
  • [46] Sentiment Classification from Online Customer Reviews Using Lexical Contextual Sentence Structure
    Khan, Aurangzeb
    Baharudin, Baharum
    Khan, Khairullah
    SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 1, 2011, 179 : 317 - 331
  • [47] A survey on sentiment detection of reviews
    Tang, Huifeng
    Tan, Songbo
    Cheng, Xueqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10760 - 10773
  • [48] Kernel Optimized-Support Vector Machine and MapReduce Framework for Sentiment Classification of Train Reviews
    Thakur, Rashmi K.
    Deshpande, Manojkumar, V
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2019, 27 (06) : 1025 - 1050
  • [49] Kernel Optimized-Support Vector Machine and Mapreduce framework for sentiment classification of train reviews
    Thakur, Rashmi K.
    Deshpande, Manojkumar V.
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (01):
  • [50] Document-Level Multi-Aspect Sentiment Classification for Online Reviews of Medical Experts
    Shi, Tian
    Rakesh, Vineeth
    Wang, Suhang
    Reddy, Chandan K.
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2723 - 2731