A Dual-Direction Attention Mixed Feature Network for Facial Expression Recognition

被引:27
作者
Zhang, Saining [1 ]
Zhang, Yuhang [1 ]
Zhang, Ye [2 ]
Wang, Yufei [3 ,4 ]
Song, Zhigang [4 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[3] Univ Chinese Acad Sci, Coll Mat Sci & Optoelect Technol, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
MobileFaceNets; coordinate attention; facial expression recognition; MixConv;
D O I
10.3390/electronics12173595
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, facial expression recognition (FER) has garnered significant attention within the realm of computer vision research. This paper presents an innovative network called the Dual-Direction Attention Mixed Feature Network (DDAMFN) specifically designed for FER, boasting both robustness and lightweight characteristics. The network architecture comprises two primary components: the Mixed Feature Network (MFN) serving as the backbone, and the Dual-Direction Attention Network (DDAN) functioning as the head. To enhance the network's capability in the MFN, resilient features are extracted by utilizing mixed-size kernels. Additionally, a new Dual-Direction Attention (DDA) head that generates attention maps in two orientations is proposed, enabling the model to capture long-range dependencies effectively. To further improve the accuracy, a novel attention loss mechanism for the DDAN is introduced with different heads focusing on distinct areas of the input. Experimental evaluations on several widely used public datasets, including AffectNet, RAF-DB, and FERPlus, demonstrate the superiority of the DDAMFN compared to other existing models, which establishes that the DDAMFN as the state-of-the-art model in the field of FER.
引用
收藏
页数:17
相关论文
共 37 条
[31]   Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition [J].
Wang, Kai ;
Peng, Xiaojiang ;
Yang, Jianfei ;
Meng, Debin ;
Qiao, Yu .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4057-4069
[32]   Distract Your Attention: Multi-Head Cross Attention Network for Facial Expression Recognition [J].
Wen, Zhengyao ;
Lin, Wenzhong ;
Wang, Tao ;
Xu, Ge .
BIOMIMETICS, 2023, 8 (02)
[33]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[34]  
Xue F., 2021, P IEEE CVF INT C COM, P3601
[35]   Learning a Facial Expression Embedding Disentangled from Identity [J].
Zhang, Wei ;
Ji, Xianpeng ;
Chen, Keyu ;
Ding, Yu ;
Fan, Changjie .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6755-6764
[36]   Graph-Preserving Sparse Nonnegative Matrix Factorization With Application to Facial Expression Recognition [J].
Zhi, Ruicong ;
Flierl, Markus ;
Ruan, Qiuqi ;
Kleijn, W. Bastiaan .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2011, 41 (01) :38-52
[37]  
Zhong L, 2012, PROC CVPR IEEE, P2562, DOI 10.1109/CVPR.2012.6247974