Similarity Measurement for Sentiment Classification on Textual Reviews

被引:0
|
作者
Thongtan, Tan [1 ]
Phienthrakul, Tanasanee [2 ]
机构
[1] Mahidol Univ, Fac Engn, Dept Comp Engn, Mahidol Univ Int Coll, Nakhon Pathom, Thailand
[2] Mahidol Univ, Fac Engn, Dept Comp Engn, Nakhon Pathom, Thailand
来源
ISMSI 2018: PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, METAHEURISTICS & SWARM INTELLIGENCE | 2018年
关键词
Similarity Measure; Sentiment Classification; Textual Reviews; Document Vector;
D O I
10.1145/3206185.3206204
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification on textual reviews refers to classifying textual reviews based on whether they are positive or negative. This research focuses on classifying movie reviews, and is benchmarked on the IMDB dataset, which consists of long movie reviews, using accuracy as the evaluation metric. In sentiment classification, each document must be mapped to a fixed length vector. Document embedding models map each document to a dense, low-dimensional vector in continuous vector space. This research proposes to train document embedding using cosine similarity instead of dot product. Experiments on the IMDB dataset show that accuracy is improved when using cosine similarity compared to using dot product, while using feature combination with Naive-Bayes weighted bag of n-grams achieves a new state of the art accuracy of 97.4%.
引用
收藏
页码:24 / 28
页数:5
相关论文
共 50 条
  • [31] SNIPPET-BASED UNSUPERVISED APPROACH FOR SENTIMENT CLASSIFICATION OF CHINESE ONLINE REVIEWS
    Li, Yijun
    Ye, Qiang
    Zhang, Ziqiong
    Wang, Tienan
    INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2011, 10 (06) : 1097 - 1110
  • [32] Textual sentiment classification in tourism research: between manual computing model and machine learning
    Liu, Yi
    Han, Fangfei
    Meng, Lingkun
    Lai, Jun
    Gao, Xuan
    CURRENT ISSUES IN TOURISM, 2025,
  • [33] Sentiment Classification of Consumer-Generated Online Reviews Using Topic Modeling
    Calheiros, Ana Catarina
    Moro, Sergio
    Rita, Paulo
    JOURNAL OF HOSPITALITY MARKETING & MANAGEMENT, 2017, 26 (07) : 675 - 693
  • [34] Sentiment classification of online reviews to travel destinations by supervised machine learning approaches
    Ye, Qiang
    Zhang, Ziqiong
    Law, Rob
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 6527 - 6535
  • [35] RSCOEWR: Radical-Based Sentiment Classification of Online Education Website Reviews
    Li, Jie
    Sun, Guoying
    COMPUTER JOURNAL, 2023, 66 (12) : 3000 - 3014
  • [36] A multi-granularity fuzzy computing model for sentiment classification of Chinese reviews
    Wang, Bingkun
    Huang, Yongfeng
    Yuan, Zhigang
    Li, Xing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 30 (03) : 1445 - 1460
  • [37] Sentiment Classification of Chinese Movie Reviews in Micro-Blog Based on Context
    Mou, Xing
    Du, Yajun
    PROCEEDINGS OF 2016 IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA 2016), 2016, : 313 - 318
  • [38] SVM-Based Comments Classification and Mining of Virtual Community: For Case of Sentiment Classification of Hotel Reviews
    Xia, Huosong
    Peng, Liuyan
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION SYSTEMS AND APPLICATIONS, PROCEEDINGS, 2009, : 507 - 511
  • [39] Harnessing consumer reviews for marketing intelligence: a domain-adapted sentiment classification approach
    Chin-Sheng Yang
    Cheng-Hsiung Chen
    Pei-Chann Chang
    Information Systems and e-Business Management, 2015, 13 : 403 - 419
  • [40] A Rule-Based Sentiment Classification Framework for Health Reviews on Mobile Social Media
    Khan, Aurangzeb
    Asghar, Muhammad Zubair
    Ahmad, Hussain
    Kundi, Fazal Masud
    Ismail, Sadia
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2017, 7 (06) : 1445 - 1453