Estimating Sentiment via Probability and Information Theory

被引:9
|
作者
Labille, Kevin [1 ]
Alfarhood, Sultan [1 ]
Gauch, Susan [1 ]
机构
[1] Univ Arkansas, Dept Comp Sci & Comp Engn, Fayetteville, AR 72701 USA
来源
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1 | 2016年
关键词
Lexicons; Sentiment Analysis; Data Mining; Text Mining; Opinion Mining;
D O I
10.5220/0006072101210129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Opinion detection and opinion analysis is a challenging but important task. Such sentiment analysis can be done using traditional supervised learning methods such as naive Bayes classification and support vector machines (SVM) or unsupervised approaches based on a lexicon may be employed. Because lexicon-based sentiment analysis methods make use of an opinion dictionary that is a list of opinion-bearing or sentiment words, sentiment lexicons play a key role. Our work focuses on the task of generating such a lexicon. We propose several novel methods to automatically generate a general-purpose sentiment lexicon using a corpus-based approach. While most existing methods generate a lexicon using a list of seed sentiment words and a domain corpus, our work differs from these by generating a lexicon from scratch using probabilistic techniques and information theoretical text mining techniques on a large diverse corpus. We conclude by presenting an ensemble method that combines the two approaches. We evaluate and demonstrate the effectiveness of our methods by utilizing the various automatically-generated lexicons during sentiment analysis. When used for sentiment analysis, our best single lexicon achieves an accuracy of 87.60% and the ensemble approach achieves 88.75% accuracy, both statistically significant improvements over 81.60% with a widely-used sentiment lexicon.
引用
收藏
页码:121 / 129
页数:9
相关论文
共 50 条
  • [41] INFORMATION EXTRACTION AND SENTIMENT ANALYSIS OF HOTEL REVIEWS IN CROATIA
    Suman, Sabrina
    Vignjevic, Milorad
    Car, Tomislav
    ZBORNIK VELEUCILISTA U RIJECI-JOURNAL OF THE POLYTECHNICS OF RIJEKA, 2023, 11 (01): : 69 - 89
  • [42] Quantifying the effect of sentiment on information diffusion in social media
    Ferrara, Emilio
    Yang, Zeyao
    PEERJ COMPUTER SCIENCE, 2015, 2015 (09)
  • [43] Sentiment Analysis Method Review in Information Systems Research
    Tao, Youyou
    AMCIS 2014 PROCEEDINGS, 2014,
  • [44] Sentiment classification improvement using semantically enriched information
    Scheicher, Ricardo B.
    Sinoara, Roberta A.
    Felinto, Jonas C.
    Rezende, Solange O.
    DOCENG'19: PROCEEDINGS OF THE ACM SYMPOSIUM ON DOCUMENT ENGINEERING 2019, 2019,
  • [45] A Framework for Sentiment Analysis on Schema-based Research Content Via Lexica Analysis
    Goodarzi, Marjan
    Mahmoudi, Maryam Tayefeh
    Zamani, Ramin
    2014 7TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2014, : 405 - 411
  • [46] Leveraging sentiment analysis via text mining to improve customer satisfaction in UK banks
    Ghadiridehkordi, Amirreza
    Shao, Jia
    Boojihawon, Roshan
    Wang, Qianxi
    Li, Hui
    INTERNATIONAL JOURNAL OF BANK MARKETING, 2025, 43 (04) : 780 - 802
  • [47] Fuzzy information granulation towards interpretable sentiment analysis
    Liu H.
    Cocea M.
    Granular Computing, 2017, 2 (4) : 289 - 302
  • [48] The Multimodal Sentiment Analysis of Online Product Marketing Information Using Text Mining and Big Data
    Fang, Zhuo
    Qian, Yufeng
    Su, Chang
    Miao, Yurong
    Li, Yanmin
    JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2022, 34 (01)
  • [49] Sentiment Analysis of Product Reviews to Identify Deceptive Rating Information in Social Media: A SentiDeceptive Approach
    Marwat, M. Irfan
    Khan, Javed Ali
    Alshehri, Mohammad Dahman
    Ali, Muhammad Asghar
    Hizbullah
    Ali, Haider
    Assam, Muhammad
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (03): : 830 - 860
  • [50] Automated measures of sentiment via transformer- and lexicon-based sentiment analysis (TLSA)
    Zhao, Xinyan
    Wong, Chau-Wai
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 145 - 170