A Dual-Direction Attention Mixed Feature Network for Facial Expression Recognition

被引:27
作者
Zhang, Saining [1 ]
Zhang, Yuhang [1 ]
Zhang, Ye [2 ]
Wang, Yufei [3 ,4 ]
Song, Zhigang [4 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci Technol, Beijing 100081, Peoples R China
[2] Beijing Informat Sci & Technol Univ, Sch Automat, Beijing 100192, Peoples R China
[3] Univ Chinese Acad Sci, Coll Mat Sci & Optoelect Technol, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Semicond, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
MobileFaceNets; coordinate attention; facial expression recognition; MixConv;
D O I
10.3390/electronics12173595
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, facial expression recognition (FER) has garnered significant attention within the realm of computer vision research. This paper presents an innovative network called the Dual-Direction Attention Mixed Feature Network (DDAMFN) specifically designed for FER, boasting both robustness and lightweight characteristics. The network architecture comprises two primary components: the Mixed Feature Network (MFN) serving as the backbone, and the Dual-Direction Attention Network (DDAN) functioning as the head. To enhance the network's capability in the MFN, resilient features are extracted by utilizing mixed-size kernels. Additionally, a new Dual-Direction Attention (DDA) head that generates attention maps in two orientations is proposed, enabling the model to capture long-range dependencies effectively. To further improve the accuracy, a novel attention loss mechanism for the DDAN is introduced with different heads focusing on distinct areas of the input. Experimental evaluations on several widely used public datasets, including AffectNet, RAF-DB, and FERPlus, demonstrate the superiority of the DDAMFN compared to other existing models, which establishes that the DDAMFN as the state-of-the-art model in the field of FER.
引用
收藏
页数:17
相关论文
共 37 条
[1]   Emotion Recognition in Speech using Cross-Modal Transfer in the Wild [J].
Albanie, Samuel ;
Nagrani, Arsha ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :292-301
[2]   Remote Big Data Management Tools, Sensing and Computing Technologies, and Visual Perception and Environment Mapping Algorithms in the Internet of Robotic Things [J].
Andronie, Mihai ;
Lazaroiu, George ;
Karabolevski, Oana Ludmila ;
Stefanescu, Roxana ;
Hurloiu, Iulian ;
Dijmarescu, Adrian ;
Dijmarescu, Irina .
ELECTRONICS, 2023, 12 (01)
[3]  
Antoniadis P, 2021, Arxiv, DOI arXiv:2106.03487
[4]   Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].
Barsoum, Emad ;
Zhang, Cha ;
Ferrer, Cristian Canton ;
Zhang, Zhengyou .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283
[5]   MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices [J].
Chen, Sheng ;
Liu, Yang ;
Gao, Xiang ;
Han, Zhen .
BIOMETRIC RECOGNITION, CCBR 2018, 2018, 10996 :428-438
[6]   RetinaFace: Single-shot Multi-level Face Localisation in the Wild [J].
Deng, Jiankang ;
Guo, Jia ;
Ververas, Evangelos ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5202-5211
[7]   Collecting Large, Richly Annotated Facial-Expression Databases from Movies [J].
Dhall, Abhinav ;
Goecke, Roland ;
Lucey, Simon ;
Gedeon, Tom .
IEEE MULTIMEDIA, 2012, 19 (03) :34-41
[8]   Neuromanagement decision making in facial recognition biometric authentication as a mobile payment technology in retail, restaurant, and hotel business models [J].
Dijmarescu, Irina ;
Iatagan, Mariana ;
Hurloiu, Iulian ;
Geamanu, Marinela ;
Rusescu, Ciprian ;
Dijmarescu, Adrian .
OECONOMIA COPERNICANA, 2022, 13 (01) :225-250
[9]   Facial Expression Recognition in the Wild via Deep Attentive Center Loss [J].
Farzaneh, Amir Hossein ;
Qi, Xiaojun .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2401-2410
[10]  
Goodfellow Ian J., 2013, Neural Information Processing. 20th International Conference, ICONIP 2013. Proceedings: LNCS 8228, P117, DOI 10.1007/978-3-642-42051-1_16