Estimating Sentiment via Probability and Information Theory

被引:9
|
作者
Labille, Kevin [1 ]
Alfarhood, Sultan [1 ]
Gauch, Susan [1 ]
机构
[1] Univ Arkansas, Dept Comp Sci & Comp Engn, Fayetteville, AR 72701 USA
来源
KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1 | 2016年
关键词
Lexicons; Sentiment Analysis; Data Mining; Text Mining; Opinion Mining;
D O I
10.5220/0006072101210129
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Opinion detection and opinion analysis is a challenging but important task. Such sentiment analysis can be done using traditional supervised learning methods such as naive Bayes classification and support vector machines (SVM) or unsupervised approaches based on a lexicon may be employed. Because lexicon-based sentiment analysis methods make use of an opinion dictionary that is a list of opinion-bearing or sentiment words, sentiment lexicons play a key role. Our work focuses on the task of generating such a lexicon. We propose several novel methods to automatically generate a general-purpose sentiment lexicon using a corpus-based approach. While most existing methods generate a lexicon using a list of seed sentiment words and a domain corpus, our work differs from these by generating a lexicon from scratch using probabilistic techniques and information theoretical text mining techniques on a large diverse corpus. We conclude by presenting an ensemble method that combines the two approaches. We evaluate and demonstrate the effectiveness of our methods by utilizing the various automatically-generated lexicons during sentiment analysis. When used for sentiment analysis, our best single lexicon achieves an accuracy of 87.60% and the ensemble approach achieves 88.75% accuracy, both statistically significant improvements over 81.60% with a widely-used sentiment lexicon.
引用
收藏
页码:121 / 129
页数:9
相关论文
共 50 条
  • [21] Creating sentiment dictionaries via triangulation
    Steinberger, Josef
    Ebrahim, Mohamed
    Ehrmann, Maud
    Hurriyetoglu, Ali
    Kabadjov, Mijail
    Lenkova, Polina
    Steinberger, Ralf
    Tanev, Hristo
    Vazquez, Silvia
    Zavarella, Vanni
    DECISION SUPPORT SYSTEMS, 2012, 53 (04) : 689 - 694
  • [22] A Process for Exploring Employees' Relationships via Social Network and Sentiment Analysis
    Barahona, Jeydels
    Sun, Hung-Min
    DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 3 - 8
  • [24] Fine-grained Sentiment Analysis of Reviews Using Shallow Semantic Information
    Shi, Hanxiao
    Zhang, Yahui
    Zou, Yi
    Li, Xiaojun
    PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC 2017), 2017, : 235 - 239
  • [25] Sentiment analysis of vegan related tweets using mutual information for feature selection
    Shamoi, Elvina
    Turdybay, Akniyet
    Shamoi, Pakizar
    Akhmetov, Iskander
    Jaxylykova, Assel
    Pak, Alexandr
    PEERJ COMPUTER SCIENCE, 2022, 8
  • [26] Tourism Companies Assessment via Social Media Using Sentiment Analysis
    AL-Bakri, Nadia F.
    Yonan, Janan Farag
    Sadiq, Ahmed T.
    Abid, Ali Sami
    BAGHDAD SCIENCE JOURNAL, 2022, 19 (02) : 422 - 429
  • [27] Estimating Telecommuting Rates in the USA Using Twitter Sentiment Analysis
    Juan Acosta-Sequeda
    Motahare Mohammadi
    Sarthak Patipati
    Abolfazl Mohammadian
    Sybil Derrible
    Data Science for Transportation, 2024, 6 (3):
  • [28] SeCredISData 2018: Special Session on Sentiment, Emotion, and Credibility of Information in Social Data
    Benamara, Farah
    Bosco, Cristina
    Patti, Viviana
    Fersini, Elisabetta
    Pasi, Gabriella
    Viviani, Marco
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 638 - 640
  • [29] Encoding Syntactic Information into Transformers for Aspect-Based Sentiment Triplet Extraction
    Yuan, Li
    Wang, Jin
    Yu, Liang-Chih
    Zhang, Xuejie
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (02) : 722 - 735
  • [30] Estimating Trust in Virtual Teams A Framework based on Sentiment Analysis
    Maldonado da Cruz, Guilherme A.
    Moriya Huzita, Elisa Hatsue
    Feltrim, Valeria D.
    PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1 (ICEIS), 2016, : 464 - 471