Estimating Sentiment via Probability and Information Theory

被引:9
|
作者
Labille, Kevin [1 ]
Alfarhood, Sultan [1 ]
Gauch, Susan [1 ]
机构
[1] Univ Arkansas, Dept Comp Sci & Comp Engn, Fayetteville, AR 72701 USA
来源
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1 | 2016年
关键词
Lexicons; Sentiment Analysis; Data Mining; Text Mining; Opinion Mining;
D O I
10.5220/0006072101210129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Opinion detection and opinion analysis is a challenging but important task. Such sentiment analysis can be done using traditional supervised learning methods such as naive Bayes classification and support vector machines (SVM) or unsupervised approaches based on a lexicon may be employed. Because lexicon-based sentiment analysis methods make use of an opinion dictionary that is a list of opinion-bearing or sentiment words, sentiment lexicons play a key role. Our work focuses on the task of generating such a lexicon. We propose several novel methods to automatically generate a general-purpose sentiment lexicon using a corpus-based approach. While most existing methods generate a lexicon using a list of seed sentiment words and a domain corpus, our work differs from these by generating a lexicon from scratch using probabilistic techniques and information theoretical text mining techniques on a large diverse corpus. We conclude by presenting an ensemble method that combines the two approaches. We evaluate and demonstrate the effectiveness of our methods by utilizing the various automatically-generated lexicons during sentiment analysis. When used for sentiment analysis, our best single lexicon achieves an accuracy of 87.60% and the ensemble approach achieves 88.75% accuracy, both statistically significant improvements over 81.60% with a widely-used sentiment lexicon.
引用
收藏
页码:121 / 129
页数:9
相关论文
共 50 条
  • [1] Building a Restaurant-Specific Sentiment Lexicon via Probability Theory
    de Melo, Tiago
    PROCEEDINGS OF THE 27TH BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA '21), 2021, : 129 - 132
  • [2] Sentiment Analysis of Teachers Using Social Information in Educational Platform Environments
    Spatiotis, Nikolaos
    Perikos, Isidoros
    Mporas, Losif
    Paraskevas, Michael
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2020, 29 (02)
  • [3] SentiDiff: Combining Textual Information and Sentiment Diffusion Patterns for Twitter Sentiment Analysis
    Wang, Lei
    Niu, Jianwei
    Yu, Shui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (10) : 2026 - 2039
  • [4] Using information retrieval for sentiment polarity prediction
    Kauer, Anderson Uilian
    Moreira, Viviane P.
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 61 : 282 - 289
  • [5] Sentiment Analysis via Deep Multichannel Neural Networks With Variational Information Bottleneck
    Gu, Tong
    Xu, Guoliang
    Luo, Jiangtao
    IEEE ACCESS, 2020, 8 : 121014 - 121021
  • [6] Sentiment analysis on microblog utilizing appraisal theory
    Korenek, Peter
    Simko, Marian
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (04): : 847 - 867
  • [7] Sentiment analysis on microblog utilizing appraisal theory
    Peter Korenek
    Marián Šimko
    World Wide Web, 2014, 17 : 847 - 867
  • [8] Refining Word Embeddings with Sentiment Information for Sentiment Analysis
    Kasri M.
    Birjali M.
    Nabil M.
    Beni-Hssane A.
    El-Ansari A.
    El Fissaoui M.
    Journal of ICT Standardization, 2022, 10 (03): : 353 - 382
  • [9] Sentiment Analysis Based on Financial Tweets and Market Information
    Ao, Shen
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 321 - 326
  • [10] SentiMI: Introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection
    Khan, Farhan Hassan
    Qamar, Usman
    Bashir, Saba
    APPLIED SOFT COMPUTING, 2016, 39 : 140 - 153