A lightweight convolutional swin transformer with cutmix augmentation and CBAM attention for compound emotion recognition

被引:1
|
作者
Nidhi [1 ]
Verma, Bindu [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, New Delhi 110042, India
关键词
Swin transformer; CutMix augmentation; CBAM; Compound emotion recognition; FACIAL EXPRESSION;
D O I
10.1007/s10489-024-05598-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial emotion recognition has become a complicated task due to individual variations in facial characteristics, as well as racial and cultural variances. Different psychological studies show that there are complex expressions other than basic emotions which are made up of two basic emotions like"Happily Disgusted", "Happily Surprised", "Sadly Surprised", etc. Compound emotion recognition is challenging due to very less publicly available compound emotion datasets which are imbalanced too. In this paper, we have proposed an LSwin-CBAM for the classification of compound emotions. To address the problem of the imbalanced dataset, the proposed model exploits the cutmix augmentation technique for data augmentation. It also incorporates the CBAM attention mechanism to emphasize the relevant features in an image and swin transformer with fewer swin transformer blocks which leads to less computational complexity in terms of trainable parameters and improves the overall classification accuracy as well. The experimental results of LSwin-CBAM on RAF-DB and EmotioNet datasets show that the proposed transformer-based network can well recognize compound emotions.
引用
收藏
页码:7793 / 7809
页数:17
相关论文
共 50 条
  • [31] Hybrid data augmentation and deep attention-based dilated convolutional-recurrent neural networks for speech emotion recognition
    Pham, Nhat Truong
    Dang, Duc Ngoc Minh
    Nguyen, Ngoc Duy
    Nguyen, Thanh Thi
    Nguyen, Hai
    Manavalan, Balachandran
    Lim, Chee Peng
    Nguyen, Sy Dzung
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
  • [32] Speech Emotion Recognition Using Convolutional Neural Networks with Attention Mechanism
    Mountzouris, Konstantinos
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    Corchado, Juan M.
    Iglesias, Carlos A.
    Kim, Byung-Gyu
    Mehmood, Rashid
    Ren, Fuji
    Lee, In
    ELECTRONICS, 2023, 12 (20)
  • [33] Temporal Attention Convolutional Network for Speech Emotion Recognition with Latent Representation
    Liu, Jiaxing
    Liu, Zhilei
    Wang, Longbiao
    Gao, Yuan
    Guo, Lili
    Dang, Jianwu
    INTERSPEECH 2020, 2020, : 2337 - 2341
  • [34] Robust Finger Vein Recognition Based on Lightweight Attention Convolutional Neural Networks
    Wei, Ming-Yi
    Wang, Yu-Chi
    Ke, Liang-Ying
    Hsia, Chih-Hsien
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1892 - 1895
  • [35] ESERNet: Learning spectrogram structure relationship for effective speech emotion recognition with swin transformer in classroom discourse analysis
    Liu, Tingting
    Wang, Minghong
    Yang, Bing
    Liu, Hai
    Yi, Shaoxin
    NEUROCOMPUTING, 2025, 612
  • [36] LIGHT-SERNET: A LIGHTWEIGHT FULLY CONVOLUTIONAL NEURAL NETWORK FOR SPEECH EMOTION RECOGNITION
    Aftab, Arya
    Morsali, Alireza
    Ghaemmaghami, Shahrokh
    Champagne, Benoit
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6912 - 6916
  • [37] Data Augmentation for EEG-Based Emotion Recognition with Deep Convolutional Neural Networks
    Wang, Fang
    Zhong, Sheng-hua
    Peng, Jianfeng
    Jiang, Jianmin
    Liu, Yan
    MULTIMEDIA MODELING, MMM 2018, PT II, 2018, 10705 : 82 - 93
  • [38] EEG emotion recognition based on efficient-capsule network with convolutional attention
    Tang, Wei
    Fan, Linhui
    Lin, Xuefen
    Gu, Yifan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 103
  • [39] Hierarchical convolutional neural networks with post-attention for speech emotion recognition
    Fan, Yonghong
    Huang, Heming
    Han, Henry
    NEUROCOMPUTING, 2025, 615
  • [40] Multiple attention convolutional-recurrent neural networks for speech emotion recognition
    Zhang, Zhihao
    Wang, Kunxia
    2022 10TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2022,