A lightweight convolutional swin transformer with cutmix augmentation and CBAM attention for compound emotion recognition

被引:1
|
作者
Nidhi [1 ]
Verma, Bindu [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, New Delhi 110042, India
关键词
Swin transformer; CutMix augmentation; CBAM; Compound emotion recognition; FACIAL EXPRESSION;
D O I
10.1007/s10489-024-05598-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial emotion recognition has become a complicated task due to individual variations in facial characteristics, as well as racial and cultural variances. Different psychological studies show that there are complex expressions other than basic emotions which are made up of two basic emotions like"Happily Disgusted", "Happily Surprised", "Sadly Surprised", etc. Compound emotion recognition is challenging due to very less publicly available compound emotion datasets which are imbalanced too. In this paper, we have proposed an LSwin-CBAM for the classification of compound emotions. To address the problem of the imbalanced dataset, the proposed model exploits the cutmix augmentation technique for data augmentation. It also incorporates the CBAM attention mechanism to emphasize the relevant features in an image and swin transformer with fewer swin transformer blocks which leads to less computational complexity in terms of trainable parameters and improves the overall classification accuracy as well. The experimental results of LSwin-CBAM on RAF-DB and EmotioNet datasets show that the proposed transformer-based network can well recognize compound emotions.
引用
收藏
页码:7793 / 7809
页数:17
相关论文
共 50 条
  • [1] A Lightweight Transformer with Convolutional Attention
    Zeng, Kungan
    Paik, Incheon
    2020 11TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST), 2020,
  • [2] EEG emotion recognition using attention-based convolutional transformer neural network
    Gong, Linlin
    Li, Mingyang
    Zhang, Tao
    Chen, Wanzhong
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 84
  • [3] Attention-Guided CutMix Data Augmentation Network for Fine-Grained Bird Recognition
    Guo, Wenming
    Wang, Yifei
    Han, Fang
    PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [4] GCFormer: A Graph Convolutional Transformer for Speech Emotion Recognition
    Gao, Yingxue
    Zhao, Huan
    Xiao, Yufeng
    Zhang, Zixing
    PROCEEDINGS OF THE 25TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2023, 2023, : 307 - 313
  • [5] Plant disease recognition using residual convolutional enlightened Swin transformer networks
    Kalpana, Ponugoti
    Anandan, R.
    Hussien, Abdelazim G.
    Migdady, Hazem
    Abualigah, Laith
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [6] Facial Emotion Recognition Combining Auxiliary Classifiers and Multiscale CBAM Attention Mechanisms
    Shang, Yujie
    Yan, Fei
    Liu, Yunqing
    Li, Qi
    Zhang, Qiong
    IEEE ACCESS, 2024, 12 : 57356 - 57365
  • [7] SAM: Self Attention Mechanism for Scene Text Recognition Based on Swin Transformer
    Shuai, Xiang
    Wang, Xiao
    Wang, Wei
    Yuan, Xin
    Xu, Xin
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 443 - 454
  • [8] Reparameterization of Lightweight Transformer for On-Device Speech Emotion Recognition
    Zhang, Zixing
    Dong, Zhongren
    Xu, Weixiang
    Han, Jing
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4169 - 4182
  • [9] SeLT: Sonar Echo Image Recognition for Small Targets using Lightweight Swin Transformer
    Xia, Sijia
    Hou, Mengyang
    Han, Yina
    Xiao, Ziyuan
    Guo, Zihao
    Liu, Qingyu
    Ma, Yuanliang
    OCEANS 2024 - SINGAPORE, 2024,
  • [10] A coordinate attention enhanced swin transformer for handwriting recognition of Parkinson's disease
    Wang, Nana
    Niu, Xuesen
    Yuan, Yiyang
    Sun, Yunze
    Li, Ran
    You, Guoliang
    Zhao, Aite
    IET IMAGE PROCESSING, 2023, 17 (09) : 2686 - 2697