MVT-CEAM: a lightweight MobileViT with channel expansion and attention mechanism for facial expression recognition

被引:2
作者
Wang, Kunxia [1 ,2 ]
Yu, Wancheng [1 ,2 ]
Yamauchi, Takashi [3 ]
机构
[1] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
[2] Anhui Jianzhu Univ, Anhui Int Joint Res Ctr Ancient Architecture Intel, Hefei 230601, Peoples R China
[3] Texas A&M Univ, Dept Psychol & Brain Sci, College Stn, TX 77845 USA
关键词
Expression recognition; Transformer; Channel expansion; Attention mechanism; TRANSFORMER; NETWORK;
D O I
10.1007/s11760-024-03356-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Facial expression recognition is a crucial area of study in psychology that can be applied to many fields, such as intelligent healthcare, human-computer interaction, fuzzy control and other domains. However, current deep learning models usually encounter high complexity, expensive computational requirements and outsized parameters. These obstacles hinder the deployment of applications on resource-constrained mobile terminals. This paper proposes an improved lightweight MobileViT with channel expansion and attention mechanism for facial expression recognition to address these challenges. In this model, we adopt a channel expansion strategy to effectively extract more critical facial expression feature information from multi-scale feature maps. Furthermore, we introduce a channel attention module within the model to improve feature extraction performance. Compared with typical lightweight models, our proposed model significantly improves the accuracy rate while maintaining a lightweight network. Our proposed model achieves 94.35 and 87.41% accuracy on the KDEF and RAF-DB datasets, respectively, demonstrating superior recognition performance.
引用
收藏
页码:6853 / 6865
页数:13
相关论文
共 44 条
[11]   Spatio-Temporal Facial Expression Recognition Using Convolutional Neural Networks and Conditional Random Fields [J].
Hasani, Behzad ;
Mahoor, Mohammad H. .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :790-795
[12]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[13]   A novel facial expression recognition algorithm using geometry β-skeleton in fusion based on deep CNN [J].
Jabbooree, Abbas Issa ;
Khanli, Leyli Mohammad ;
Salehpour, Pedram ;
Pourbahrami, Shahin .
IMAGE AND VISION COMPUTING, 2023, 134
[14]   Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition? [J].
Khorrami, Pooya ;
Le Paine, Tom ;
Huang, Thomas S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :19-27
[15]   Early detection of depression through facial expression recognition and electroencephalogram-based artificial intelligence-assisted graphical user interface [J].
Kumar, Gajendra ;
Das, Tanaya ;
Singh, Kuldeep .
NEURAL COMPUTING & APPLICATIONS, 2024, 36 (12) :6937-6954
[16]   Automatic facial expression recognition combining texture and shape features from prominent facial regions [J].
Kumar, Naveen H. N. ;
Kumar, A. Suresh ;
Prasad, Guru M. S. ;
Shah, Mohd Asif .
IET IMAGE PROCESSING, 2023, 17 (04) :1111-1125
[17]   PMVT: a lightweight vision transformer for plant disease identification on mobile devices [J].
Li, Guoqiang ;
Wang, Yuchao ;
Zhao, Qing ;
Yuan, Peiyan ;
Chang, Baofang .
FRONTIERS IN PLANT SCIENCE, 2023, 14
[18]   Deep Facial Expression Recognition: A Survey [J].
Li, Shan ;
Deng, Weihong .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (03) :1195-1215
[19]   Reliable Crowdsourcing and Deep Locality-Preserving Learning for Unconstrained Facial Expression Recognition [J].
Li, Shan ;
Deng, Weihong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) :356-370
[20]   Facial Expression Recognition in the Wild Using Multi-Level Features and Attention Mechanisms [J].
Li, Yingjian ;
Lu, Guangming ;
Li, Jinxing ;
Zhang, Zheng ;
Zhang, David .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (01) :451-462