MVT-CEAM: a lightweight MobileViT with channel expansion and attention mechanism for facial expression recognition

被引：2

作者：

Wang, Kunxia ^{[1
,2
]}

Yu, Wancheng ^{[1
,2
]}

Yamauchi, Takashi ^{[3
]}

机构：

[1] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China

[2] Anhui Jianzhu Univ, Anhui Int Joint Res Ctr Ancient Architecture Intel, Hefei 230601, Peoples R China

[3] Texas A&M Univ, Dept Psychol & Brain Sci, College Stn, TX 77845 USA

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2024年 / 18卷 / 10期

关键词：

Expression recognition; Transformer; Channel expansion; Attention mechanism; TRANSFORMER; NETWORK;

D O I：

10.1007/s11760-024-03356-1

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Facial expression recognition is a crucial area of study in psychology that can be applied to many fields, such as intelligent healthcare, human-computer interaction, fuzzy control and other domains. However, current deep learning models usually encounter high complexity, expensive computational requirements and outsized parameters. These obstacles hinder the deployment of applications on resource-constrained mobile terminals. This paper proposes an improved lightweight MobileViT with channel expansion and attention mechanism for facial expression recognition to address these challenges. In this model, we adopt a channel expansion strategy to effectively extract more critical facial expression feature information from multi-scale feature maps. Furthermore, we introduce a channel attention module within the model to improve feature extraction performance. Compared with typical lightweight models, our proposed model significantly improves the accuracy rate while maintaining a lightweight network. Our proposed model achieves 94.35 and 87.41% accuracy on the KDEF and RAF-DB datasets, respectively, demonstrating superior recognition performance.

引用

页码：6853 / 6865

页数：13

共 44 条

[1] Facial Expression Recognition with High Response-Based Local Directional Pattern (HR-LDP) Network [J].

Alphonse, Sherly ;

Verma, Harshit .

CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (02) :2067-2086

[2] MSM-ViT: A multi-scale MobileViT for pulmonary nodule classification using CT images [J].

Cao, Keyan ;

Tao, Hangbo ;

Wang, Zhiqiong ;

Jin, Xi .

JOURNAL OF X-RAY SCIENCE AND TECHNOLOGY, 2023, 31 (04) :731-744

[3] Self-supervised vision transformer-based few-shot learning for facial expression recognition [J].

Chen, Xuanchi ;

Zheng, Xiangwei ;

Sun, Kai ;

Liu, Weilong ;

Zhang, Yuang .

INFORMATION SCIENCES, 2023, 634 :206-226

[4] Drone Detection Method Based on MobileViT and CA-PANet [J].

Cheng, Qianqing ;

Li, Xiuhe ;

Zhu, Bin ;

Shi, Yingchun ;

Xie, Bo .

ELECTRONICS, 2023, 12 (01)

[5] Deep learning-based facial emotion recognition for human-computer interaction applications [J].

Chowdary, M. Kalpana ;

Nguyen, Tu N. ;

Hemanth, D. Jude .

NEURAL COMPUTING & APPLICATIONS, 2023, 35 (32) :23311-23328

[6]

Chu X., 2021, P INT C LEARNING REP, DOI DOI 10.48550/ARXIV.2102.10882

[7]

Dosovitskiy A., 2021, 9 INT C LEARN REPR I

[8] Fine-Tuning Swin Transformer and Multiple Weights Optimality-Seeking for Facial Expression Recognition [J].

Feng, Hongqi ;

Huang, Weikai ;

Zhang, Denghui ;

Zhang, Bangze .

IEEE ACCESS, 2023, 11 :9995-10003

[9] Classroom Facial Expression Recognition Method Based on Conv3D-ConvLSTM-SEnet in Online Education Environment [J].

Fu, Rong ;

Tian, Mijuan .

JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (07)

[10] A Survey on Vision Transformer [J].

Han, Kai ;

Wang, Yunhe ;

Chen, Hanting ;

Chen, Xinghao ;

Guo, Jianyuan ;

Liu, Zhenhua ;

Tang, Yehui ;

Xiao, An ;

Xu, Chunjing ;

Xu, Yixing ;

Yang, Zhaohui ;

Zhang, Yiman ;

Tao, Dacheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) :87-110

← 1 2 3 4 5 →