Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches

被引:23
|
作者
Sham, Nabila Mohamad [1 ]
Mohamed, Azlinah [2 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci, Shah Alam 40450, Malaysia
[2] Univ Teknol MARA UiTM, Inst Big Data Analyt & Artificial Intelligence, Shah Alam 40450, Malaysia
关键词
climate change; sentiment analysis; lexicon; machine learning; social media;
D O I
10.3390/su14084723
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The emissions of greenhouse gases, such as carbon dioxide, into the biosphere have the consequence of warming up the planet, hence the existence of climate change. Sentiment analysis has been a popular subject and there has been a plethora of research conducted in this area in recent decades, typically on social media platforms such as Twitter, due to the proliferation of data generated today during discussions on climate change. However, there is not much research on the performances of different sentiment analysis approaches using lexicon, machine learning and hybrid methods, particularly within this domain-specific sentiment. This study aims to find the most effective sentiment analysis approach for climate change tweets and related domains by performing a comparative evaluation of various sentiment analysis approaches. In this context, seven lexicon-based approaches were used, namely SentiWordNet, TextBlob, VADER, SentiStrength, Hu and Liu, MPQA, and WKWSCI. Meanwhile, three machine learning classifiers were used, namely Support Vector Machine, Naive Bayes, and Logistic Regression, by using two feature extraction techniques, which were Bag-of-Words and TF-IDF. Next, the hybridization between lexicon-based and machine learning-based approaches was performed. The results indicate that the hybrid method outperformed the other two approaches, with hybrid TextBlob and Logistic Regression achieving an F1-score of 75.3%; thus, this has been chosen as the most effective approach. This study also found that lemmatization improved the accuracy of machine learning and hybrid approaches by 1.6%. Meanwhile, the TF-IDF feature extraction technique was slightly better than BoW by increasing the accuracy of the Logistic Regression classifier by 0.6%. However, TF-IDF and BoW had an identical effect on SVM and NB. Future works will include investigating the suitability of deep learning approaches toward this domain-specific sentiment on social media platforms.
引用
收藏
页数:28
相关论文
共 50 条
  • [1] Sentiment Analysis of Student Feedback Using Machine Learning and Lexicon Based Approaches
    Nasim, Zarmeen
    Rajput, Quratulain
    Haider, Sajjad
    2017 5TH INTERNATIONAL CONFERENCE ON RESEARCH AND INNOVATION IN INFORMATION SYSTEMS (ICRIIS 2017): SOCIAL TRANSFORMATION THROUGH DATA SCIENCE, 2017,
  • [2] Sentiment analysis in Nepali: Exploring machine learning and lexicon-based approaches
    Piryani, Rajesh
    Piryani, Bhawna
    Singh, Vivek Kumar
    Pinto, David
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (02) : 2201 - 2212
  • [3] Comparative Analysis of Lexicon and Machine Learning Approach for Sentiment Analysis
    Srivastava, Roopam
    Bharti, P. K.
    Verma, Parul
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (03) : 71 - 77
  • [4] An Analysis on Machine Learning Approaches for Sentiment Analysis
    Shrivash, Brajesh Kumar
    Verma, Dinesh Kumar
    Pandey, Prateek
    SMART SYSTEMS: INNOVATIONS IN COMPUTING (SSIC 2021), 2022, 235 : 499 - 513
  • [5] An empirical research on sentiment analysis using machine learning approaches
    Kabir M.
    Kabir M.M.J.
    Xu S.
    Badhon B.
    International Journal of Computers and Applications, 2021, 43 (10) : 1011 - 1019
  • [6] A Combination of Machine Learning and Lexicon Based Techniques for Sentiment Analysis
    Neshan, Seydeh Akram Saadat
    Akbari, Reza
    2020 6TH INTERNATIONAL CONFERENCE ON WEB RESEARCH (ICWR), 2020, : 8 - 14
  • [7] A Machine Learning-Based Lexicon Approach for Sentiment Analysis
    Sahu, Tirath Prasad
    Khandekar, Sarang
    INTERNATIONAL JOURNAL OF TECHNOLOGY AND HUMAN INTERACTION, 2020, 16 (02) : 8 - 22
  • [8] Context Deployed Sentiment Analysis Using Hybrid Lexicon
    John, Annet
    John, Anice
    Sheik, Reshma
    PROCEEDINGS OF 2019 1ST INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION AND COMMUNICATION TECHNOLOGY (ICIICT 2019), 2019,
  • [9] A Review on Lexicon-Based and Machine Learning Political Sentiment Analysis Using Tweets
    Britzolakis, Alexandros
    Kondylakis, Haridimos
    Papadakis, Nikolaos
    INTERNATIONAL JOURNAL OF SEMANTIC COMPUTING, 2020, 14 (04) : 517 - 563
  • [10] Sentiment Analysis Based on Multiple Reviews by using Machine learning approaches
    D'souza, Stephina Rodney
    Sonawane, Kavita
    PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 188 - 193