A Hybrid Approach of Machine Learning and Lexicons to Sentiment Analysis: Enhanced Insights from Twitter Data of Natural Disasters

被引:0
作者
Shalak Mendon
Pankaj Dutta
Abhishek Behl
Stefan Lessmann
机构
[1] Electronic City,Wipro Limited
[2] Indian Institute of Technology Bombay,SJM School of Management
[3] Humboldt-Universität zu Berlin,Chair of Information Systems, School of Business and Economics
来源
Information Systems Frontiers | 2021年 / 23卷
关键词
Sentimental analysis; K-means clustering; Latent Dirichlet allocation; Machine learning; Twitter; Natural disasters;
D O I
暂无
中图分类号
学科分类号
摘要
The success factor of sentimental analysis lies in identifying the most occurring and relevant opinions among users relating to the particular topic. In this paper, we develop a framework to analyze users’ sentiments on Twitter on natural disasters using the data pre-processing techniques and a hybrid of machine learning, statistical modeling, and lexicon-based approach. We choose TF-IDF and K-means for sentiment classification among affinitive and hierarchical clustering. Latent Dirichlet Allocation, a pipeline of Doc2Vec and K-means used to capture themes, then perform multi-level polarity indices classification and its time series analysis. In our study, we draw insights from 243,746 tweets for Kerala’s 2018 natural disasters in India. The key findings of the study are the classification of sentiments based on similarity and polarity indices and identifying themes among the topics discussed on Twitter. We observe different sets of emotions and influencers, among others. Through this case example of Kerala floods, it shows how the government and other organizations could track the positive/negative sentiments concerning time and location; gain a better understanding of the topic of discussion trending among the public, and collaborate with crucial Twitter users/influencers to spread and figure out the gaps in the implementation of schemes in terms of design and execution. This research’s uniqueness is the streamlined and efficient combination of algorithms and techniques embedded in the framework used in achieving the above output, which can be integrated into a platform with GUI for further automation.
引用
收藏
页码:1145 / 1168
页数:23
相关论文
共 50 条
  • [31] User-Level Twitter Sentiment Analysis with a Hybrid Approach
    Er, Meng Joo
    Liu, Fan
    Wang, Ning
    Zhang, Yong
    Pratama, Mahardhika
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 426 - 433
  • [32] A domain transferable lexicon set for Twitter sentiment analysis using a supervised machine learning approach
    Ghiassi, M.
    Lee, S.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 106 : 197 - 216
  • [33] Analysis of Various Machine Learning Algorithms for Enhanced Opinion Mining using Twitter Data Streams
    Kumar, Praveen
    Choudhury, Tanupriya
    Rawat, Seema
    Jayaraman, Shobhna
    2016 INTERNATIONAL CONFERENCE ON MICRO-ELECTRONICS AND TELECOMMUNICATION ENGINEERING (ICMETE), 2016, : 265 - 270
  • [34] Twitter Sentiment Analysis Based Public Emotion Detection using Machine Learning Algorithms
    Fahim, Safa
    Imran, Azhar
    Alzahrani, Abdulkareem
    Fahim, Marwa
    Alheeti, Khattab M. Ali
    Alfateh, Muhammad
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 107 - 112
  • [35] A Prediction of South African Public Twitter Opinion using a Hybrid Sentiment Analysis Approach
    Shackleford, Matthew Brett
    Adeliyi, Timothy Temitope
    Joseph, Seena
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (10) : 156 - 165
  • [36] Sentiment analysis on product reviews on twitter using Machine Learning Approaches
    Jayakody, J. P. U. S. D.
    Kumara, B. T. G. S.
    2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,
  • [37] Sentiment analysis of tweets through Altmetrics: A machine learning approach
    Hassan, Saeed-Ul
    Saleem, Aneela
    Soroya, Saira Hanif
    Safder, Iqra
    Iqbal, Sehrish
    Jamil, Saqib
    Bukhari, Faisal
    Aljohani, Naif Radi
    Nawaz, Raheel
    JOURNAL OF INFORMATION SCIENCE, 2021, 47 (06) : 712 - 726
  • [38] A new big data approach for topic classification and sentiment analysis of Twitter data
    Rodrigues, Anisha P.
    Chiplunkar, Niranjan N.
    EVOLUTIONARY INTELLIGENCE, 2022, 15 (02) : 877 - 887
  • [39] An enhanced discovery of multiple natural disasters using machine learning model
    Thirukrishna, J. T.
    EARTH SCIENCE INFORMATICS, 2025, 18 (03)
  • [40] A new big data approach for topic classification and sentiment analysis of Twitter data
    Anisha P. Rodrigues
    Niranjan N. Chiplunkar
    Evolutionary Intelligence, 2022, 15 : 877 - 887