A model for sentiment and emotion analysis of unstructured social media text

被引:0
作者
Jitendra Kumar Rout
Kim-Kwang Raymond Choo
Amiya Kumar Dash
Sambit Bakshi
Sanjay Kumar Jena
Karen L. Williams
机构
[1] National Institute of Technology,Department of Computer Science
[2] University of Texas at San Antonio,Department of Information Systems and Cyber Security
来源
Electronic Commerce Research | 2018年 / 18卷
关键词
Sentiment analysis; Bag-of-words; Lexicon; Laplace smoothing; Parts-of-Speech (POS); Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis has applications in diverse contexts such as in the gathering and analysis of opinions of individuals about various products, issues, social, and political events. Understanding public opinion can help improve decision making. Opinion mining is a way of retrieving information via search engines, blogs, microblogs and social networks. Individual opinions are unique to each person, and Twitter tweets are an invaluable source of this type of data. However, the huge volume and unstructured nature of text/opinion data pose a challenge to analyzing the data efficiently. Accordingly, proficient algorithms/computational strategies are required for mining and condensing tweets as well as finding sentiment bearing words. Most existing computational methods/models/algorithms in the literature for identifying sentiments from such unstructured data rely on machine learning techniques with the bag-of-word approach as their basis. In this work, we use both unsupervised and supervised approaches on various datasets. Unsupervised approach is being used for the automatic identification of sentiment for tweets acquired from Twitter public domain. Different machine learning algorithms such as Multinomial Naive Bayes (MNB), Maximum Entropy and Support Vector Machines are applied for sentiment identification of tweets as well as to examine the effectiveness of various feature combinations. In our experiment on tweets, we achieve an accuracy of 80.68% using the proposed unsupervised approach, in comparison to the lexicon based approach (the latter gives an accuracy of 75.20%). In our experiments, the supervised approach where we combine unigram, bigram and Part-of-Speech as feature is efficient in finding emotion and sentiment of unstructured data. For short message services, using the unigram feature with MNB classifier allows us to achieve an accuracy of 67%.
引用
收藏
页码:181 / 199
页数:18
相关论文
共 50 条
[11]   Social Media Sentiment Analysis for Solar Eclipse with Text Mining [J].
Korkmaz, Adem ;
Bulut, Selma .
ACTA INFOLOGICA, 2023, 7 (01) :187-196
[12]   Virtual human on social media: Text mining and sentiment analysis [J].
Li, Sihong ;
Chen, Jinglong .
TECHNOLOGY IN SOCIETY, 2024, 78
[13]   Fine-Grained Sentiment Analysis of Social Media with Emotion Sensing [J].
Wang, Zhaoxia ;
Chong, Chee Seng ;
Lan, Landy ;
Yang, Yinping ;
Ho, Seng Beng ;
Tong, Joo Chuan .
PROCEEDINGS OF 2016 FUTURE TECHNOLOGIES CONFERENCE (FTC), 2016, :1361-1364
[14]   Understanding Environmental Posts: Sentiment and Emotion Analysis of Social Media Data [J].
Amangeldi, Daniyar ;
Usmanova, Aida ;
Shamoi, Pakizar .
IEEE ACCESS, 2024, 12 :33504-33523
[15]   Beyond Sentiment Analysis: A Review of Recent Trends in Text Based Sentiment Analysis and Emotion Detection [J].
Hung, Lai Po ;
Alias, Suraya .
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2023, 27 (01) :84-95
[16]   Mixed emotion extraction analysis and visualisation of social media text [J].
Li, Yuming ;
Chan, Johnny ;
Peko, Gabrielle ;
Sundaram, David .
DATA & KNOWLEDGE ENGINEERING, 2023, 148
[17]   Emotion detection in text: advances in sentiment analysis [J].
Tamilkodi, R. ;
Sujatha, B. ;
Leelavathy, N. .
INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2025, 16 (02) :552-560
[18]   Emotion and sentiment analysis from Twitter text [J].
Sailunaz, Kashfia ;
Alhajj, Reda .
JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 36
[19]   A survey of sentiment analysis in social media [J].
Lin Yue ;
Weitong Chen ;
Xue Li ;
Wanli Zuo ;
Minghao Yin .
Knowledge and Information Systems, 2019, 60 :617-663
[20]   SentiVerb system: classification of social media text using sentiment analysis [J].
Shailendra Kumar Singh ;
Manoj Kumar Sachan .
Multimedia Tools and Applications, 2019, 78 :32109-32136