Leveraging distant supervision and deep learning for twitter sentiment and emotion classification

被引:4
|
作者
Kastrati, Muhamet [1 ]
Kastrati, Zenun [2 ]
Imran, Ali Shariq [3 ]
Biba, Marenglen [1 ]
机构
[1] Univ New York Tirana, Dept Comp Sci, Tirana 1046, Albania
[2] Linnaeus Univ, Dept Informat, S-35195 Vaxjo, Sweden
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, N-2815 Gjovik, Norway
关键词
Distant supervision; Emotion detection; Sentiment analysis; Deep learning; Transformers; Twitter; Emojis;
D O I
10.1007/s10844-024-00845-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, various applications across industries, healthcare, and security have begun adopting automatic sentiment analysis and emotion detection in short texts, such as posts from social media. Twitter stands out as one of the most popular online social media platforms due to its easy, unique, and advanced accessibility using the API. On the other hand, supervised learning is the most widely used paradigm for tasks involving sentiment polarity and fine-grained emotion detection in short and informal texts, such as Twitter posts. However, supervised learning models are data-hungry and heavily reliant on abundant labeled data, which remains a challenge. This study aims to address this challenge by creating a large-scale real-world dataset of 17.5 million tweets. A distant supervision approach relying on emojis available in tweets is applied to label tweets corresponding to Ekman's six basic emotions. Additionally, we conducted a series of experiments using various conventional machine learning models and deep learning, including transformer-based models, on our dataset to establish baseline results. The experimental results and an extensive ablation analysis on the dataset showed that BiLSTM with FastText and an attention mechanism outperforms other models in both classification tasks, achieving an F1-score of 70.92% for sentiment classification and 54.85% for emotion detection.
引用
收藏
页码:1045 / 1070
页数:26
相关论文
共 50 条
  • [41] Learning Discriminative Sentiment Chunk Vectors for Twitter Sentiment Analysis
    Yan, Leiming
    Zheng, Wenying
    Zhang, Huajie
    Tao, Hao
    He, Ming
    JOURNAL OF INTERNET TECHNOLOGY, 2017, 18 (07): : 1605 - 1613
  • [42] A deep learning framework for sarcastic sentiment classification in opinion polls
    Fredrick Boafo
    Solomon Mensah
    Elizabeth Akuafum Dick
    Journal of Data, Information and Management, 2023, 5 (4): : 333 - 343
  • [43] Spanish Pre-Trained CaTrBETO Model for Sentiment Classification in Twitter
    Pijal, Washington
    Armijos, Arianna
    Llumiquinga, Jose
    Lalvay, Sebastian
    Allauca, Steven
    Cuenca, Erick
    2022 THIRD INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS AND SOFTWARE TECHNOLOGIES, ICI2ST, 2022, : 93 - 98
  • [44] Evaluating deep learning models for sentiment classification
    Karakus, Betul Ay
    Talo, Muhammed
    Hallac, Ibrahim Riza
    Aydin, Galip
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2018, 30 (21)
  • [45] Investigating the effectiveness of Twitter sentiment in cryptocurrency close price prediction by using deep learning
    Amirshahi, Bahareh
    Lahmiri, Salim
    EXPERT SYSTEMS, 2023,
  • [46] Investigating the effectiveness of Twitter sentiment in cryptocurrency close price prediction by using deep learning
    Amirshahi, Bahareh
    Lahmiri, Salim
    EXPERT SYSTEMS, 2025, 42 (01)
  • [47] Sentiment and Emotion on Twitter: The Case of the Global Consumer Electronics Industry
    Pezoa-Fuentes, Claudia
    Garcia-Rivera, Danilo
    Matamoros-Rojas, Sebastian
    JOURNAL OF THEORETICAL AND APPLIED ELECTRONIC COMMERCE RESEARCH, 2023, 18 (02): : 765 - 776
  • [48] Sentiment analysis of COVID-19 related social distancing using twitter data based on deep learning
    Lanxue Dang
    Chunyu Wang
    Ming-Hsiang Tsou
    Yan-e Hou
    Hongyu Han
    Multimedia Tools and Applications, 2024, 83 : 32587 - 32612
  • [49] Deep Learning and Machine Learning-Based Model for Conversational Sentiment Classification
    Ullah, Sami
    Talib, Muhammad Ramzan
    Rana, Toqir A.
    Hanif, Muhammad Kashif
    Awais, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 2323 - 2339
  • [50] Sentiment classification on product reviews using machine learning and deep learning techniques
    Singh, Neha
    Jaiswal, Umesh Chandra
    INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT, 2024, 15 (12) : 5726 - 5741