Boosting Micro-Expression Recognition via Self-Expression Reconstruction and Memory Contrastive Learning

被引:7
作者
Bao, Yongtang [1 ]
Wu, Chenxi [1 ]
Zhang, Peng [1 ]
Shan, Caifeng [2 ,3 ]
Qi, Yue [4 ,5 ]
Ben, Xianye [6 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
[3] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210023, Peoples R China
[4] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[5] Beihang Univ Qingdao Res Inst, Virtual Real Res Inst, Qingdao 266590, Peoples R China
[6] Shandong Univ, Sch Informat Sci & Engn, Jinan 250100, Peoples R China
基金
中国国家自然科学基金;
关键词
Image reconstruction; Task analysis; Convolutional neural networks; Transformers; Solid modeling; Representation learning; Prototypes; Micro-expression recognition; self-expression reconstruction; random patch dropout; memory contrastive learning; generalization;
D O I
10.1109/TAFFC.2024.3397701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Micro-expression (ME) is an instinctive reaction that is not controlled by thoughts. It reveals one's inner feelings, which is significant in sentiment analysis and lie detection. Since micro-expression is expressed as subtle facial changes within particular facial action units, learning discriminative and generalized features for Micro-expression Recognition (MER) is challenging. To achieve the purpose, this paper proposes a novel MER framework that simultaneously integrates supervised Prototype-based Memory Contrastive Learning (PMCL) for discriminative feature mining and adds Self-expression Reconstruction (SER) as an auxiliary task and regularization for better generalization. In particular, the proposed SER module is forced as a regularization by reconstructing input ME from the randomly dropped patch-wise features in the bottleneck. And, the PMCL module globally compares historical and current cluster agents learned from training instances to enhance intra-class compactness and inter-class separability. Extensive experiments are conducted on three benchmarks, e.g., SMIC, CASME II, and SAMM, under evaluation criteria of both Composite Database Evaluation (CDE) and Single Database Evaluation (SDE) protocols. The results show our method surpasses other state-of-the-art approaches under various evaluation metrics, achieving overall 86.30% unweighed F1-score and 88.30% unweighed average recall on the composite dataset. Furthermore, the ablation studies verify the effectiveness of our SER for better generalization and PMCL for better discrimination in learning feature representation from limited micro-expression samples.
引用
收藏
页码:2083 / 2096
页数:14
相关论文
共 60 条
[1]  
Ahonen T, 2004, LECT NOTES COMPUT SC, V3021, P469
[2]  
[Anonymous], 2003, Unmasking the Face
[3]   PERFORMANCE OF OPTICAL-FLOW TECHNIQUES [J].
BARRON, JL ;
FLEET, DJ ;
BEAUCHEMIN, SS .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1994, 12 (01) :43-77
[4]  
Belghazi MI, 2018, PR MACH LEARN RES, V80
[5]   Video-Based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms [J].
Ben, Xianye ;
Ren, Yi ;
Zhang, Junping ;
Wang, Su-Jing ;
Kpalma, Kidiyo ;
Meng, Weixiao ;
Liu, Yong-Jin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5826-5846
[6]   Learning effective binary descriptors for micro-expression recognition transferred by macro-information [J].
Ben, Xianye ;
Jia, Xitong ;
Yan, Rui ;
Zhang, Xin ;
Meng, Weixiao .
PATTERN RECOGNITION LETTERS, 2018, 107 :50-58
[7]   Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition [J].
Chen, Bin ;
Liu, Kun-Hong ;
Xu, Yong ;
Wu, Qing-Qiang ;
Yao, Jun-Feng .
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :1345-1358
[8]  
Dai Zuozhuo, 2023, Computer Vision - ACCV 2022: 16th Asian Conference on Computer Vision, Proceedings. Lecture Notes in Computer Science (13846), P319, DOI 10.1007/978-3-031-26351-4_20
[9]   SAMM: A Spontaneous Micro-Facial Movement Dataset [J].
Davison, Adrian K. ;
Lansley, Cliff ;
Costen, Nicholas ;
Tan, Kevin ;
Yap, Moi Hoon .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :116-129
[10]   Variational Prototype Learning for Deep Face Recognition [J].
Deng, Jiankang ;
Guo, Jia ;
Yang, Jing ;
Lattas, Alexandros ;
Zafeiriou, Stefanos .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11901-11910