TriCAFFNet: A Tri-Cross-Attention Transformer with a Multi-Feature Fusion Network for Facial Expression Recognition

被引:1
作者
Tian, Yuan [1 ]
Wang, Zhao [1 ]
Chen, Di [1 ]
Yao, Huang [1 ]
机构
[1] Cent China Normal Univ, Fac Artificial Intelligence Educ, Wuhan 430079, Peoples R China
关键词
facial expression recognition; vision transformer; multi-feature; tri-cross attention; CLASSIFICATION; SCALE;
D O I
10.3390/s24165391
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In recent years, significant progress has been made in facial expression recognition methods. However, tasks related to facial expression recognition in real environments still require further research. This paper proposes a tri-cross-attention transformer with a multi-feature fusion network (TriCAFFNet) to improve facial expression recognition performance under challenging conditions. By combining LBP (Local Binary Pattern) features, HOG (Histogram of Oriented Gradients) features, landmark features, and CNN (convolutional neural network) features from facial images, the model is provided with a rich input to improve its ability to discern subtle differences between images. Additionally, tri-cross-attention blocks are designed to facilitate information exchange between different features, enabling mutual guidance among different features to capture salient attention. Extensive experiments on several widely used datasets show that our TriCAFFNet achieves the SOTA performance on RAF-DB with 92.17%, AffectNet (7 cls) with 67.40%, and AffectNet (8 cls) with 63.49%, respectively.
引用
收藏
页数:16
相关论文
共 45 条
[11]   Facial Expression Recognition Using Enhanced Deep 3D Convolutional Neural Networks [J].
Hasani, Behzad ;
Mahoor, Mohammad H. .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :2278-2288
[12]   Local Binary Patterns and Its Application to Facial Image Analysis: A Survey [J].
Huang, Di ;
Shan, Caifeng ;
Ardabilian, Mohsen ;
Wang, Yunhong ;
Chen, Liming .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2011, 41 (06) :765-781
[13]  
Jabid T., 2010, Digest of Technical Papers Int. Conf. Consumer Electronics, P329, DOI DOI 10.1109/ICCE.2010.5418801
[14]   Quaternion Deformable Local Binary Pattern and Pose-Correction Facial Decomposition for Color Facial Expression Recognition in the Wild [J].
Jin, Lianghai ;
Zhou, Yu ;
Ma, Guangzhi ;
Song, Enmin .
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (02) :2464-2478
[15]   Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition [J].
Jung, Heechul ;
Lee, Sihaeng ;
Yim, Junho ;
Park, Sunjeong ;
Kim, Junmo .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2983-2991
[16]   Neural style transfer generative adversarial network (NST-GAN) for facial expression recognition [J].
Khemakhem, Faten ;
Ltifi, Hela .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
[17]  
Kim J., 2022, arXiv, DOI [DOI 10.48550/ARXIV.2203.13472, 10.48550/arXiv.2203.13472]
[18]  
Li H., 2021, arXiv, DOI DOI 10.48550/ARXIV.2106.045202106.04520
[19]   Learning Cognitive Features as Complementary for Facial Expression Recognition [J].
Li, Huihui ;
Xiao, Xiangling ;
Liu, Xiaoyong ;
Wen, Guihua ;
Liu, Lianqi .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2024, 2024
[20]   Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild [J].
Li, Shan ;
Deng, Weihong ;
Du, JunPing .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2584-2593