MCCA-VNet: A Vit-Based Deep Learning Approach for Micro-Expression Recognition Based on Facial Coding

被引:0
作者
Zhang, Dehao [1 ,2 ]
Zhang, Tao [1 ,2 ]
Sun, Haijiang [1 ]
Tang, Yanhui [1 ]
Liu, Qiaoyuan [1 ]
机构
[1] Chinese Acad Sci, Changchun Inst Opt Fine Mech & Phys, Changchun 130033, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
关键词
micro-expression; optical flow method; facial coding; MCCA-VNET; vision transformer; CNN;
D O I
10.3390/s24237549
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In terms of facial expressions, micro-expressions are more realistic than macro-expressions and provide more valuable information, which can be widely used in psychological counseling and clinical diagnosis. In the past few years, deep learning methods based on optical flow and Transformer have achieved excellent results in this field, but most of the current algorithms are mainly concentrated on establishing a serialized token through the self-attention model, and they do not take into account the spatial relationship between facial landmarks. For the locality and changes in the micro-facial conditions themselves, we propose the deep learning model MCCA-VNET on the basis of Transformer. We effectively extract the changing features as the input of the model, fusing channel attention and spatial attention into Vision Transformer to capture correlations between features in different dimensions, which enhances the accuracy of the identification of micro-expressions. In order to verify the effectiveness of the algorithm mentioned, we conduct experimental testing in the SAMM, CAS (ME) II, and SMIC datasets and compared the results with other former best algorithms. Our algorithms can improve the UF1 score and UAR score to, respectively, 0.8676 and 0.8622 for the composite dataset, and they are better than other algorithms on multiple indicators, achieving the best comprehensive performance.
引用
收藏
页数:18
相关论文
共 50 条
[1]  
Ali A., 2021, Adv. Neural Inf. Process. Syst., V34, DOI DOI 10.48550/ARXIV.2106.09681
[2]  
Ballester P, 2016, AAAI CONF ARTIF INTE, P1124
[3]   Video-Based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms [J].
Ben, Xianye ;
Ren, Yi ;
Zhang, Junping ;
Wang, Su-Jing ;
Kpalma, Kidiyo ;
Meng, Weixiao ;
Liu, Yong-Jin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5826-5846
[4]   CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification [J].
Chen, Chun-Fu ;
Fan, Quanfu ;
Panda, Rameswar .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :347-356
[5]  
Davison A.K., 1869, P 2015 IEEE INT C SY
[6]   SAMM: A Spontaneous Micro-Facial Movement Dataset [J].
Davison, Adrian K. ;
Lansley, Cliff ;
Costen, Nicholas ;
Tan, Kevin ;
Yap, Moi Hoon .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :116-129
[7]   OFF-ApexNet on micro-expression recognition system [J].
Gan, Y. S. ;
Liong, Sze-Teng ;
Yau, Wei-Chuen ;
Huang, Yen-Chang ;
Tan, Lit-Ken .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 74 :129-139
[8]   SL-Swin: A Transformer-Based Deep Learning Approach for Macro- and Micro-Expression Spotting on Small-Size Expression Datasets [J].
He, Erheng ;
Chen, Qianru ;
Zhong, Qinghua .
ELECTRONICS, 2023, 12 (12)
[9]   Attention on Attention for Image Captioning [J].
Huang, Lun ;
Wang, Wenmin ;
Chen, Jie ;
Wei, Xiao-Yong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4633-4642
[10]   Spontaneous facial micro-expression analysis using Spatiotemporal Completed Local Quantized Patterns [J].
Huang, Xiaohua ;
Zhao, Guoying ;
Hong, Xiaopeng ;
Zheng, Wenming ;
Pietikainen, Matti .
NEUROCOMPUTING, 2016, 175 :564-578