VaBTFER: An Effective Variant Binary Transformer for Facial Expression Recognition

被引:2
|
作者
Shen, Lei [1 ]
Jin, Xing [1 ]
机构
[1] Nanjing Forestry Univ, Coll Informat Sci & Technol, Nanjing 100190, Peoples R China
关键词
facial expression recognition; spatial-channel feature relevance Transformer; lightweight variant Transformer; binary quantization mechanism; multilayer channel reduction self-attention; dynamic learnable information extraction; NEURAL-NETWORK;
D O I
10.3390/s24010147
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Existing Transformer-based models have achieved impressive success in facial expression recognition (FER) by modeling the long-range relationships among facial muscle movements. However, the size of pure Transformer-based models tends to be in the million-parameter level, which poses a challenge for deploying these models. Moreover, the lack of inductive bias in Transformer usually leads to the difficulty of training from scratch on limited FER datasets. To address these problems, we propose an effective and lightweight variant Transformer for FER called VaTFER. In VaTFER, we firstly construct action unit (AU) tokens by utilizing action unit-based regions and their histogram of oriented gradient (HOG) features. Then, we present a novel spatial-channel feature relevance Transformer (SCFRT) module, which incorporates multilayer channel reduction self-attention (MLCRSA) and a dynamic learnable information extraction (DLIE) mechanism. MLCRSA is utilized to model long-range dependencies among all tokens and decrease the number of parameters. DLIE's goal is to alleviate the lack of inductive bias and improve the learning ability of the model. Furthermore, we use an excitation module to replace the vanilla multilayer perception (MLP) for accurate prediction. To further reduce computing and memory resources, we introduce a binary quantization mechanism, formulating a novel lightweight Transformer model called variant binary Transformer for FER (VaBTFER). We conduct extensive experiments on several commonly used facial expression datasets, and the results attest to the effectiveness of our methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Robust facial expression recognition with Transformer Block Enhancement Module
    Xie, Yuanlun
    Tian, Wenhong
    Yu, Zitong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [2] Appearance and geometry transformer for facial expression recognition in the wild
    Sun, Ning
    Song, Yao
    Liu, Jixin
    Chai, Lei
    Sun, Haian
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 107
  • [3] Facial Expression Recognition Based on Squeeze Vision Transformer
    Kim, Sangwon
    Nam, Jaeyeal
    Ko, Byoung Chul
    SENSORS, 2022, 22 (10)
  • [4] Context Transformer and Adaptive Method with Visual Transformer for Robust Facial Expression Recognition
    Xiong, Lingxin
    Zhang, Jicun
    Zheng, Xiaojia
    Wang, Yuxin
    APPLIED SCIENCES-BASEL, 2024, 14 (04):
  • [5] TFE: A Transformer Architecture for Occlusion Aware Facial Expression Recognition
    Gao, Jixun
    Zhao, Yuanyuan
    FRONTIERS IN NEUROROBOTICS, 2021, 15
  • [6] Research on Facial Expression Recognition Algorithm Based on Lightweight Transformer
    Jiang, Bin
    Li, Nanxing
    Cui, Xiaomei
    Liu, Weihua
    Yu, Zeqi
    Xie, Yongheng
    INFORMATION, 2024, 15 (06)
  • [7] Swin-FER: Swin Transformer for Facial Expression Recognition
    Bie, Mei
    Xu, Huan
    Gao, Yan
    Song, Kai
    Che, Xiangjiu
    APPLIED SCIENCES-BASEL, 2024, 14 (14):
  • [8] Facial expression recognition based on the binary code of edges
    Feng, Xiaoyi
    Lai, Yangming
    Wang, Wenxing
    Cui, Shaoxing
    Peng, Jinye
    Jiang, Xiaoyu
    Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2015, 42 (03): : 186 - 191
  • [9] Vision Transformer With Attentive Pooling for Robust Facial Expression Recognition
    Xue, Fanglei
    Wang, Qiangchang
    Tan, Zichang
    Ma, Zhongsong
    Guo, Guodong
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 3244 - 3256
  • [10] Facial Expression Recognition Using Facial Effective Areas And Fuzzy Logic
    Ghasemi, Roja
    Ahmady, Maryam
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,