MERCoL: video-based facial micro-expression recognition via bimodal contrastive learning

被引:1
作者
Song, Yanxin [1 ]
Wang, Pengyu [1 ]
Sun, Hao [1 ]
Chen, Lei [1 ]
Ben, Xianye [1 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao, Shandong, Peoples R China
关键词
micro-expression recognition; contrastive representative learning; optical flow; facial features; spotting; OPTICAL-FLOW FEATURE;
D O I
10.1504/IJCAT.2023.132402
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Micro-expressions are brief, subtle, involuntary facial gestures revealing genuine mental activity, with numerous real-world applications. Owing to their transient nature, low intensity, and capture difficulty, many existing recognition algorithms based on handcrafted features and deep learning methods lack accuracy. We propose a micro-expression recognition framework called MERCoL, which utilises bimodal contrastive learning to extract common and distinctive features from limited dataset samples. The network comprises three modules: bimodal feature extraction, bimodal contrastive learning fusion, and classification. First, micro-expression sequences are divided into RGB and optical flow sequences, with a contrastive learning-based loss function for learning common bimodal features. Second, bimodal features are then fused and labelled data is used to optimise the network for distinctive feature learning. At last, experiments on CASME II, SAMM, and MMEW datasets demonstrate our algorithm's superiority compared to state-of-the-art methods.
引用
收藏
页码:311 / 320
页数:11
相关论文
共 28 条
[1]   Video-Based Facial Micro-Expression Analysis: A Survey of Datasets, Features and Algorithms [J].
Ben, Xianye ;
Ren, Yi ;
Zhang, Junping ;
Wang, Su-Jing ;
Kpalma, Kidiyo ;
Meng, Weixiao ;
Liu, Yong-Jin .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) :5826-5846
[2]   Gait recognition and micro-expression recognition based on maximum margin projection with tensor representation [J].
Ben, Xianye ;
Zhang, Peng ;
Yan, Rui ;
Yang, Mingqiang ;
Ge, Guodong .
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (08) :2629-2646
[3]  
Chen Ting, 2019, PMLR
[4]   Dermoscopic image segmentation method based on convolutional neural networks [J].
Dang Ngoc Hoang Thanh ;
Le Thi Thanh ;
Erkan, Ugur ;
Khamparia, Aditya ;
Prasath, V. B. Surya .
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2021, 66 (02) :89-99
[5]   SAMM: A Spontaneous Micro-Facial Movement Dataset [J].
Davison, Adrian K. ;
Lansley, Cliff ;
Costen, Nicholas ;
Tan, Kevin ;
Yap, Moi Hoon .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2018, 9 (01) :116-129
[6]  
Ekman P., 2009, Telling Lies: Clues to Deceit in the Marketplace, Politics, and Marriage, Vrevised ed.
[7]   Two-frame motion estimation based on polynomial expansion [J].
Farnebäck, G .
IMAGE ANALYSIS, PROCEEDINGS, 2003, 2749 :363-370
[8]  
Fathy W. E., 2018, J IMAGE GRAPHICS, V6
[9]   A Magnitude and Angle Combined Optical Flow Feature for Microexpression Spotting [J].
Guo, Yifei ;
Li, Bing ;
Ben, Xianye ;
Ren, Yi ;
Zhang, Junping ;
Yan, Rui ;
Li, Yujun .
IEEE MULTIMEDIA, 2021, 28 (02) :29-39
[10]  
Gutmann M., 2010, P 13 INT C ARTIFICIA, P297