Leveraging distant supervision and deep learning for twitter sentiment and emotion classification

被引:4
|
作者
Kastrati, Muhamet [1 ]
Kastrati, Zenun [2 ]
Imran, Ali Shariq [3 ]
Biba, Marenglen [1 ]
机构
[1] Univ New York Tirana, Dept Comp Sci, Tirana 1046, Albania
[2] Linnaeus Univ, Dept Informat, S-35195 Vaxjo, Sweden
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, N-2815 Gjovik, Norway
关键词
Distant supervision; Emotion detection; Sentiment analysis; Deep learning; Transformers; Twitter; Emojis;
D O I
10.1007/s10844-024-00845-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, various applications across industries, healthcare, and security have begun adopting automatic sentiment analysis and emotion detection in short texts, such as posts from social media. Twitter stands out as one of the most popular online social media platforms due to its easy, unique, and advanced accessibility using the API. On the other hand, supervised learning is the most widely used paradigm for tasks involving sentiment polarity and fine-grained emotion detection in short and informal texts, such as Twitter posts. However, supervised learning models are data-hungry and heavily reliant on abundant labeled data, which remains a challenge. This study aims to address this challenge by creating a large-scale real-world dataset of 17.5 million tweets. A distant supervision approach relying on emojis available in tweets is applied to label tweets corresponding to Ekman's six basic emotions. Additionally, we conducted a series of experiments using various conventional machine learning models and deep learning, including transformer-based models, on our dataset to establish baseline results. The experimental results and an extensive ablation analysis on the dataset showed that BiLSTM with FastText and an attention mechanism outperforms other models in both classification tasks, achieving an F1-score of 70.92% for sentiment classification and 54.85% for emotion detection.
引用
收藏
页码:1045 / 1070
页数:26
相关论文
共 50 条
  • [21] Efficient Sentiment Classification of Twitter Feeds
    Chamansingh, Nicholas
    Hosein, Patrick
    2016 IEEE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND APPLICATIONS (ICKEA 2016), 2016, : 78 - 82
  • [22] Sentiment Analysis Using Machine Learning and Deep Learning on Covid 19 Vaccine Twitter Data with Hadoop MapReduce
    Kul, Seda
    Sayar, Ahmet
    6TH INTERNATIONAL CONFERENCE ON SMART CITY APPLICATIONS, 2022, 393 : 859 - 868
  • [23] Sentiment Analysis for Women in STEM using Twitter and Transfer Learning Models
    Fouad, Shereen
    Alkooheji, Ezzaldin
    2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 227 - 234
  • [24] Twitter Sentiment Analysis Based Public Emotion Detection using Machine Learning Algorithms
    Fahim, Safa
    Imran, Azhar
    Alzahrani, Abdulkareem
    Fahim, Marwa
    Alheeti, Khattab M. Ali
    Alfateh, Muhammad
    2022 17TH INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES (ICET'22), 2022, : 107 - 112
  • [25] Sentiment classification of twitter data belonging to renewable energy using machine learning
    Jain, Achin
    Jain, Vanita
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (02) : 521 - 533
  • [26] A Study of Sentiment Analysis Using Deep Learning Techniques on Thai Twitter Data
    Vateekul, Peerapon
    Koomsubha, Thanabhat
    2016 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (JCSSE), 2016, : 70 - 75
  • [27] A Deep Learning Model Enhanced with Emotion Semantics for Microblog Sentiment Analysis
    He Y.-X.
    Sun S.-T.
    Niu F.-F.
    Li F.
    Jisuanji Xuebao/Chinese Journal of Computers, 2017, 40 (04): : 773 - 790
  • [28] A Deep Neural Architecture for Sentence-Level Sentiment Classification in Twitter Social Networking
    Huy Nguyen
    Minh-Le Nguyen
    COMPUTATIONAL LINGUISTICS, PACLING 2017, 2018, 781 : 15 - 27
  • [29] Twitter Sentiment Analysis Using Deep Convolutional Neural Network
    Stojanovski, Dario
    Strezoski, Gjorgji
    Madjarov, Gjorgji
    Dimitrovski, Ivica
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2015), 2015, 9121 : 726 - 737
  • [30] A Deep Learning Approach for Sentiment Classification of COVID-19 Vaccination Tweets
    Said, Haidi
    Tawfik, BenBella S.
    Makhlouf, Mohamed A.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (04) : 530 - 538