A Hybrid Approach of Machine Learning and Lexicons to Sentiment Analysis: Enhanced Insights from Twitter Data of Natural Disasters

被引:0
|
作者
Shalak Mendon
Pankaj Dutta
Abhishek Behl
Stefan Lessmann
机构
[1] Electronic City,Wipro Limited
[2] Indian Institute of Technology Bombay,SJM School of Management
[3] Humboldt-Universität zu Berlin,Chair of Information Systems, School of Business and Economics
来源
Information Systems Frontiers | 2021年 / 23卷
关键词
Sentimental analysis; K-means clustering; Latent Dirichlet allocation; Machine learning; Twitter; Natural disasters;
D O I
暂无
中图分类号
学科分类号
摘要
The success factor of sentimental analysis lies in identifying the most occurring and relevant opinions among users relating to the particular topic. In this paper, we develop a framework to analyze users’ sentiments on Twitter on natural disasters using the data pre-processing techniques and a hybrid of machine learning, statistical modeling, and lexicon-based approach. We choose TF-IDF and K-means for sentiment classification among affinitive and hierarchical clustering. Latent Dirichlet Allocation, a pipeline of Doc2Vec and K-means used to capture themes, then perform multi-level polarity indices classification and its time series analysis. In our study, we draw insights from 243,746 tweets for Kerala’s 2018 natural disasters in India. The key findings of the study are the classification of sentiments based on similarity and polarity indices and identifying themes among the topics discussed on Twitter. We observe different sets of emotions and influencers, among others. Through this case example of Kerala floods, it shows how the government and other organizations could track the positive/negative sentiments concerning time and location; gain a better understanding of the topic of discussion trending among the public, and collaborate with crucial Twitter users/influencers to spread and figure out the gaps in the implementation of schemes in terms of design and execution. This research’s uniqueness is the streamlined and efficient combination of algorithms and techniques embedded in the framework used in achieving the above output, which can be integrated into a platform with GUI for further automation.
引用
收藏
页码:1145 / 1168
页数:23
相关论文
共 50 条
  • [1] A Hybrid Approach of Machine Learning and Lexicons to Sentiment Analysis: Enhanced Insights from Twitter Data of Natural Disasters
    Mendon, Shalak
    Dutta, Pankaj
    Behl, Abhishek
    Lessmann, Stefan
    INFORMATION SYSTEMS FRONTIERS, 2021, 23 (05) : 1145 - 1168
  • [2] Sentiment Analysis of Twitter Data: A Hybrid Approach
    Srivastava, Ankit
    Singh, Vijendra
    Drall, Gurdeep Singh
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2019, 14 (02) : 1 - 16
  • [3] Exerting 2D-Space of Sentiment Lexicons with Machine Learning Techniques: A Hybrid Approach for Sentiment Analysis
    Khan, Muhammad Yaseen
    Junejo, Khurum Nazir
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (06) : 599 - 608
  • [4] A Hybrid Approach for the Sentiment Analysis of Turkish Twitter Data
    Shehu, H. A.
    Tokat, S.
    ARTIFICIAL INTELLIGENCE AND APPLIED MATHEMATICS IN ENGINEERING PROBLEMS, 2020, 43 : 182 - 190
  • [5] Sentiment Analysis of Twitter Data Using Machine Learning Approaches and Semantic Analysis
    Gautam, Geetika
    Yadav, Divakar
    2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 437 - 442
  • [6] Machine learning tool for exploring sentiment analysis on twitter data
    Biradar, Shanta H.
    Gorabal, J. V.
    Gupta, Gaurav
    MATERIALS TODAY-PROCEEDINGS, 2022, 56 : 1927 - 1934
  • [7] Sentiment Analysis of Twitter Data with Hybrid Learning for Recommender Applications
    Gandhe, Ketaki
    Varde, Aparna S.
    Du, Xu
    2018 9TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2018, : 57 - 63
  • [8] Sentiment analysis of financial Twitter posts on Twitter with the machine learning classifiers
    Cam, Handan
    Cam, Alper Veli
    Demirel, Ugur
    Ahmed, Sana
    HELIYON, 2024, 10 (01)
  • [9] Sentiment Analysis for Tourism Insights: A Machine Learning Approach
    Charfaoui, Kenza
    Mussard, Stephane
    STATS, 2024, 7 (04):
  • [10] Machine Learning Techniques for Sentiment Analysis of COVID-19-Related Twitter Data
    Braig, Niklas
    Benz, Alina
    Voth, Soeren
    Breitenbach, Johannes
    Buettner, Ricardo
    IEEE ACCESS, 2023, 11 : 14778 - 14803