A model for sentiment and emotion analysis of unstructured social media text

被引:0
作者
Jitendra Kumar Rout
Kim-Kwang Raymond Choo
Amiya Kumar Dash
Sambit Bakshi
Sanjay Kumar Jena
Karen L. Williams
机构
[1] National Institute of Technology,Department of Computer Science
[2] University of Texas at San Antonio,Department of Information Systems and Cyber Security
来源
Electronic Commerce Research | 2018年 / 18卷
关键词
Sentiment analysis; Bag-of-words; Lexicon; Laplace smoothing; Parts-of-Speech (POS); Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis has applications in diverse contexts such as in the gathering and analysis of opinions of individuals about various products, issues, social, and political events. Understanding public opinion can help improve decision making. Opinion mining is a way of retrieving information via search engines, blogs, microblogs and social networks. Individual opinions are unique to each person, and Twitter tweets are an invaluable source of this type of data. However, the huge volume and unstructured nature of text/opinion data pose a challenge to analyzing the data efficiently. Accordingly, proficient algorithms/computational strategies are required for mining and condensing tweets as well as finding sentiment bearing words. Most existing computational methods/models/algorithms in the literature for identifying sentiments from such unstructured data rely on machine learning techniques with the bag-of-word approach as their basis. In this work, we use both unsupervised and supervised approaches on various datasets. Unsupervised approach is being used for the automatic identification of sentiment for tweets acquired from Twitter public domain. Different machine learning algorithms such as Multinomial Naive Bayes (MNB), Maximum Entropy and Support Vector Machines are applied for sentiment identification of tweets as well as to examine the effectiveness of various feature combinations. In our experiment on tweets, we achieve an accuracy of 80.68% using the proposed unsupervised approach, in comparison to the lexicon based approach (the latter gives an accuracy of 75.20%). In our experiments, the supervised approach where we combine unigram, bigram and Part-of-Speech as feature is efficient in finding emotion and sentiment of unstructured data. For short message services, using the unigram feature with MNB classifier allows us to achieve an accuracy of 67%.
引用
收藏
页码:181 / 199
页数:18
相关论文
共 50 条
[41]   Supervised sentiment analysis in Czech social media [J].
Habernal, Ivan ;
Ptacek, Tomas ;
Steinberger, Josef .
INFORMATION PROCESSING & MANAGEMENT, 2014, 50 (05) :693-707
[42]   An Unsupervised Approach for Sentiment Analysis on Social Media Short Text Classification in Roman Urdu Sentiment analysis on short text classification in Roman Urdu [J].
Rana, Toqir A. ;
Shahzadi, Kiran ;
Rana, Tauseef ;
Arshad, Ahsan ;
Tubishat, Mohammad .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (02)
[43]   Sentiment Analysis and Opinion Mining on Social Media using Machine Learning [J].
Charaan, Dilip R. M. ;
Ithayan, Vimala J. ;
Sankar, M. ;
Chithambaramani, R. ;
Sivaprakash, P. ;
Marichamy, D. .
2024 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT CYBER PHYSICAL SYSTEMS AND INTERNET OF THINGS, ICOICI 2024, 2024, :1176-1182
[44]   Bilingual Sentiment Analysis for a Code-mixed Punjabi English Social Media Text [J].
Yadav, Konark ;
Lamba, Aashish ;
Gupta, Dhruv ;
Gupta, Ansh ;
Karmakar, Purnendu ;
Saini, Sandeep .
PROCEEDINGS OF THE 2020 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS-2020), 2020,
[45]   Sentiment Analysis for Code-Mixed Indian Social Media Text With Distributed Representation [J].
Shalini, K. ;
Ganesh, Barathi H. B. ;
Kumar, Anand M. ;
Soman, K. P. .
2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2018, :1126-1131
[46]   Emotion Predictor Using Social Media Text and Graphology [J].
Roy, Chhanda ;
Dey, Rishi ;
Chaudhuri, Chitrita ;
Das, Dipankar .
PROCEEDINGS OF THE 2019 IEEE 9TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC 2019), 2019, :96-102
[47]   Integrating Emotion Detection with Sentiment Analysis for Enhanced Text Interpretation [J].
Ghosh, Arpan ;
Pandey, Naimish ;
AshokKumar, C. .
2024 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTING AND INFORMATICS, ICICI 2024, 2024, :562-568
[48]   Automatic Indonesian Sentiment Lexicon Curation with Sentiment Valence Tuning for Social Media Sentiment Analysis [J].
Wijayanti, Rini ;
Arisal, Andria .
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 20 (01)
[49]   New Words Enlightened Sentiment Analysis in Social Media [J].
Cai, Chiyu ;
Li, Linjing ;
Zeng, Daniel .
IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS: CYBERSECURITY AND BIG DATA, 2016, :202-204
[50]   Applying Transfer Learning to Sentiment Analysis in Social Media [J].
de Arriba, Ariadna ;
Oriol, Marc ;
Franch, Xavier .
29TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE WORKSHOPS (REW 2021), 2021, :342-348