A model for sentiment and emotion analysis of unstructured social media text

被引:0
|
作者
Jitendra Kumar Rout
Kim-Kwang Raymond Choo
Amiya Kumar Dash
Sambit Bakshi
Sanjay Kumar Jena
Karen L. Williams
机构
[1] National Institute of Technology,Department of Computer Science
[2] University of Texas at San Antonio,Department of Information Systems and Cyber Security
来源
Electronic Commerce Research | 2018年 / 18卷
关键词
Sentiment analysis; Bag-of-words; Lexicon; Laplace smoothing; Parts-of-Speech (POS); Machine learning;
D O I
暂无
中图分类号
学科分类号
摘要
Sentiment analysis has applications in diverse contexts such as in the gathering and analysis of opinions of individuals about various products, issues, social, and political events. Understanding public opinion can help improve decision making. Opinion mining is a way of retrieving information via search engines, blogs, microblogs and social networks. Individual opinions are unique to each person, and Twitter tweets are an invaluable source of this type of data. However, the huge volume and unstructured nature of text/opinion data pose a challenge to analyzing the data efficiently. Accordingly, proficient algorithms/computational strategies are required for mining and condensing tweets as well as finding sentiment bearing words. Most existing computational methods/models/algorithms in the literature for identifying sentiments from such unstructured data rely on machine learning techniques with the bag-of-word approach as their basis. In this work, we use both unsupervised and supervised approaches on various datasets. Unsupervised approach is being used for the automatic identification of sentiment for tweets acquired from Twitter public domain. Different machine learning algorithms such as Multinomial Naive Bayes (MNB), Maximum Entropy and Support Vector Machines are applied for sentiment identification of tweets as well as to examine the effectiveness of various feature combinations. In our experiment on tweets, we achieve an accuracy of 80.68% using the proposed unsupervised approach, in comparison to the lexicon based approach (the latter gives an accuracy of 75.20%). In our experiments, the supervised approach where we combine unigram, bigram and Part-of-Speech as feature is efficient in finding emotion and sentiment of unstructured data. For short message services, using the unigram feature with MNB classifier allows us to achieve an accuracy of 67%.
引用
收藏
页码:181 / 199
页数:18
相关论文
共 50 条
  • [1] A model for sentiment and emotion analysis of unstructured social media text
    Rout, Jitendra Kumar
    Choo, Kim-Kwang Raymond
    Dash, Amiya Kumar
    Bakshi, Sambit
    Jena, Sanjay Kumar
    Williams, Karen L.
    ELECTRONIC COMMERCE RESEARCH, 2018, 18 (01) : 181 - 199
  • [2] An emotion feature highlighting method for sentiment analysis of social media text
    Shen, Zi-Qiang
    Song, Tao
    Mao, Qi-Rong
    Jiang, Zhen
    Journal of Computers (Taiwan), 2019, 30 (03) : 117 - 129
  • [3] A Hybrid Model for Social Media Sentiment Analysis for Indonesian Text
    Putra, Syopiansyah Jaya
    Khalil, Ismail
    Gunawan, Muhamad Nur
    Amin, Riva'I
    Sutabri, Tata
    IIWAS2018: THE 20TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2014, : 297 - 301
  • [4] ANALYSIS OF SENTIMENT IN UNSTRUCTURED TEXT
    Ministr, Jan
    Racek, Jaroslav
    IDIMT-2011: INTERDISCIPLINARITY IN COMPLEX SYSTEMS, 2011, 36 : 299 - +
  • [5] Sentiment Analysis on Social Media for Emotion Classification
    Tanna, Dilesh
    Dudhane, Manasi
    Sardar, Amrut
    Deshpande, Kiran
    Deshmukh, Neha
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 911 - 915
  • [6] Sentiment Analysis of Malay Social Media Text
    Chekima, Khalifa
    Alfred, Rayner
    COMPUTATIONAL SCIENCE AND TECHNOLOGY, ICCST 2017, 2018, 488 : 205 - 219
  • [7] Detection of Sentiment Polarity of Unstructured Multi-Language Text from Social Media
    Ahmed, Saad
    Hina, Saman
    Asif, Raheela
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (07) : 199 - 203
  • [8] Corpora For Sentiment Analysis Of Arabic Text In Social Media
    Itani, Maher
    Roast, Chris
    Al-Khayatt, Samir
    2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2017, : 64 - 69
  • [9] Statistical Text Analysis and Sentiment Classification in Social Media
    Cho, Sang-Hyun
    Kang, Hang-Bong
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 1112 - 1117
  • [10] Sentiment Miner: A Prototype for Sentiment Analysis of Unstructured Data and Text
    Shahbaz, Muhammad
    Guergachi, Aziz
    Rehman, Rana Tanzeel ur
    2014 IEEE 27TH CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2014,