Leveraging distant supervision and deep learning for twitter sentiment and emotion classification

被引:4
|
作者
Kastrati, Muhamet [1 ]
Kastrati, Zenun [2 ]
Imran, Ali Shariq [3 ]
Biba, Marenglen [1 ]
机构
[1] Univ New York Tirana, Dept Comp Sci, Tirana 1046, Albania
[2] Linnaeus Univ, Dept Informat, S-35195 Vaxjo, Sweden
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, N-2815 Gjovik, Norway
关键词
Distant supervision; Emotion detection; Sentiment analysis; Deep learning; Transformers; Twitter; Emojis;
D O I
10.1007/s10844-024-00845-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, various applications across industries, healthcare, and security have begun adopting automatic sentiment analysis and emotion detection in short texts, such as posts from social media. Twitter stands out as one of the most popular online social media platforms due to its easy, unique, and advanced accessibility using the API. On the other hand, supervised learning is the most widely used paradigm for tasks involving sentiment polarity and fine-grained emotion detection in short and informal texts, such as Twitter posts. However, supervised learning models are data-hungry and heavily reliant on abundant labeled data, which remains a challenge. This study aims to address this challenge by creating a large-scale real-world dataset of 17.5 million tweets. A distant supervision approach relying on emojis available in tweets is applied to label tweets corresponding to Ekman's six basic emotions. Additionally, we conducted a series of experiments using various conventional machine learning models and deep learning, including transformer-based models, on our dataset to establish baseline results. The experimental results and an extensive ablation analysis on the dataset showed that BiLSTM with FastText and an attention mechanism outperforms other models in both classification tasks, achieving an F1-score of 70.92% for sentiment classification and 54.85% for emotion detection.
引用
收藏
页码:1045 / 1070
页数:26
相关论文
共 50 条
  • [31] Reducing the Need for Manual Annotated Datasets in Aspect Sentiment Classification by Transfer Learning and Weak-Supervision
    Oro, Ermelinda
    Ruffolo, Massimo
    Visalli, Francesco
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020, 2021, 12613 : 445 - 464
  • [32] Meta-Learning for Neural Relation Classification with Distant Supervision
    Li, Zhenzhen
    Nie, Jian-Yun
    Wang, Benyou
    Du, Pan
    Zhang, Yuhan
    Zou, Lixin
    Li, Dongsheng
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 815 - 824
  • [33] Deep Sentiment Learning for Measuring Similarity Recommendations in Twitter Data
    Manikandan, S.
    Dhanalakshmi, P.
    Rajeswari, K. C.
    Rani, A. Delphin Carolina
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01) : 182 - 191
  • [34] Leveraging Emotional Consistency for Semi-supervised Sentiment Classification
    Minh Luan Nguyen
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT I, 2016, 9651 : 369 - 381
  • [35] Learning From Other Labels: Leveraging Enhanced Mixup and Transfer Learning for Twitter Sentiment Analysis
    Wang, Quansen
    2021 IEEE 33RD INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2021), 2021, : 336 - 343
  • [36] An Optimized Deep Learning Model for Emotion Classification in Tweets
    Singla, Chinu
    Al-Wesabi, Fahd N.
    Pathania, Yash Singh
    Alfurhood, Badria Sulaiman
    Hilal, Anwer Mustafa
    Rizwanullah, Mohammed
    Hamza, Manar Ahmed
    Mahzari, Mohammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 70 (03): : 6365 - 6380
  • [37] Cross-Cultural Polarity and Emotion Detection Using Sentiment Analysis and Deep Learning on COVID-19 Related Tweets
    Imran, Ali Shariq
    Daudpota, Sher Muhammad
    Kastrati, Zenun
    Batra, Rakhi
    IEEE ACCESS, 2020, 8 (08): : 181074 - 181090
  • [38] An ensemble deep learning model for fast classification of Twitter spam
    Dhar, Suparna
    Bose, Indranil
    INFORMATION & MANAGEMENT, 2024, 61 (08)
  • [39] Extracting Emotion and Sentiment Quotient of Viral Information Over Twitter
    Kumar, Pawan
    Reji, Reiben Eappen
    Singh, Vikram
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 23 - 33
  • [40] Document Modeling with Hierarchical Deep Learning Approach for Sentiment Classification
    Ghosh, Monalisa
    Sanyal, Goutam
    2018 2ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (ICDSP 2018), 2018, : 181 - 185