Improving Multi-Label Emotion Classification on Imbalanced Social Media Data With BERT and Clipped Asymmetric Loss

被引：0

作者：

Ramakrishnan, Sandhya ^{[1
]}

Dhinesh Babu, L. D. ^{[1
]}

机构：

[1] Vellore Inst Technol, Sch Comp Sci Engn & Informat Syst SCORE, Vellore 632014, India

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Social networking (online); Emotion recognition; Taxonomy; Multi label classification; Encoding; Bidirectional control; Adaptation models; Transformers; Data augmentation; Correlation; Asymmetric loss; BERT; emotion recognition; GoEmotions dataset; imbalanced data; multi-label classification; NLP; SENTIMENT ANALYSIS; RECOGNITION;

D O I：

10.1109/ACCESS.2025.3557091

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This research addresses the challenge of multi-label emotion classification on imbalanced datasets using a BERT-based model. Emotion classification, essential for applications like social media analysis and sentiment monitoring, often suffers from class imbalance, which hinders the detection of rare emotions. To address this, our model incorporates a clipped asymmetric loss function to prioritize minority classes while mitigating the dominance of frequent classes. We conducted extensive experimentation on GoEmotions and SemEval-2018 Task 1C datasets to demonstrate the model's effectiveness in achieving improved precision, recall, and F1-scores across various taxonomies, including GoEmotions, Ekman, and sentiment-grouped levels. Our approach achieved a notable improvement in macro-average F1-scores, increasing from 0.46 (baseline) to 0.54 on the GoEmotions dataset and 0.59 on the SemEval-2018 dataset. The results indicate significant advancements over standard BERT implementations and state-of-the-art models, particularly in recognizing rare emotions, making the model a robust solution for real-world, multi-label emotion classification tasks under imbalanced settings.

引用

页码：60589 / 60601

页数：13

共 61 条

[11] A systematic study of the class imbalance problem in convolutional neural networks [J].

Buda, Mateusz ;

Maki, Atsuto ;

Mazurowski, Maciej A. .

NEURAL NETWORKS, 2018, 106 :249-259

[12] Affective Computing and Sentiment Analysis [J].

Cambria, Erik .

IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) :102-107

[13]

Chochlakis G, 2022, Arxiv, DOI arXiv:2210.15842

[14]

Cortiz D, 2021, Arxiv, DOI arXiv:2104.02041

[15] Emotion recognition in human-computer interaction [J].

Cowie, R ;

Douglas-Cowie, E ;

Tsapatsoulis, N ;

Votsis, G ;

Kollias, S ;

Fellenz, W ;

Taylor, JG .

IEEE SIGNAL PROCESSING MAGAZINE, 2001, 18 (01) :32-80

[16] Class-Balanced Loss Based on Effective Number of Samples [J].

Cui, Yin ;

Jia, Menglin ;

Lin, Tsung-Yi ;

Song, Yang ;

Belongie, Serge .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9260-9269

[17]

Datareportal, Digital 2024: Global Overview Report

[18]

Demszky D, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4040

[19] Multi-Label Emotion Detection via Emotion-Specified Feature Extraction and Emotion Correlation Learning [J].

Deng, Jiawen ;

Ren, Fuji .

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) :475-486

[20]

Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171

← 1 2 3 4 5 6 7 →