Investigating LSTM for Micro-Expression Recognition

被引:13
作者
Bai, Mengjiong [1 ]
Goecke, Roland [1 ]
机构
[1] Univ Canberra, Fac Sci & Technol, Human Centred Technol, Canberra, ACT, Australia
来源
COMPANION PUBLICATON OF THE 2020 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION (ICMI '20 COMPANION) | 2020年
关键词
Micro-expression; Deep Learning; Long Short-Term Memory;
D O I
10.1145/3395035.3425248
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This study investigates the utility of Long Short-Term Memory (LSTM) networks for modelling spatial-temporal patterns for micro-expression recognition (MER). Micro-expressions are involuntary, short facial expressions, often of low intensity. RNNs have attracted a lot of attention in recent years for modelling temporal sequences. The RNN-LSTM combination to be highly effective results in many application areas. The proposed method combines the recent VGGFace2 model, basically a ResNet-50 CNN trained on the VGGFace2 dataset, with uni-directional and bi-directional LSTM to explore different ways modelling spatial-temporal facial patterns for MER. The Grad-CAM heat map visualisation is used in the training stages to determine the most appropriate layer of the VGGFace2 model for retraining. Experiments are conducted with pure VGGFace2, VGGFace2 + uni-directional LSTM, and VGGFace2 + Bi-directional LSTM on the SMIC database using 5-fold cross-validation.
引用
收藏
页码:7 / 11
页数:5
相关论文
共 32 条
[1]  
Ngo ACL, 2016, INT CONF ACOUST SPEE, P1243, DOI 10.1109/ICASSP.2016.7471875
[2]   Cross-validation methods [J].
Browne, MW .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :108-132
[3]   VGGFace2: A dataset for recognising faces across pose and age [J].
Cao, Qiong ;
Shen, Li ;
Xie, Weidi ;
Parkhi, Omkar M. ;
Zisserman, Andrew .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :67-74
[4]   Long-Term Recurrent Convolutional Networks for Visual Recognition and Description [J].
Donahue, Jeff ;
Hendricks, Lisa Anne ;
Rohrbach, Marcus ;
Venugopalan, Subhashini ;
Guadarrama, Sergio ;
Saenko, Kate ;
Darrell, Trevor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :677-691
[5]   NONVERBAL LEAKAGE AND CLUES TO DECEPTION [J].
EKMAN, P ;
FRIESEN, WV .
PSYCHIATRY, 1969, 32 (01) :88-+
[6]  
Franke M., 2009, ELECT TECHNOLOGY ISS, P1
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]  
Hochreiter S, 1997, NEURAL COMPUT, V9, P1735, DOI [10.1162/neco.1997.9.8.1735, 10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[9]   Spontaneous facial micro-expression analysis using Spatiotemporal Completed Local Quantized Patterns [J].
Huang, Xiaohua ;
Zhao, Guoying ;
Hong, Xiaopeng ;
Zheng, Wenming ;
Pietikainen, Matti .
NEUROCOMPUTING, 2016, 175 :564-578
[10]   Facial Micro-Expression Recognition using Spatiotemporal Local Binary Pattern with Integral Projection [J].
Huang, Xiaohua ;
Wang, Su-Jing ;
Zhao, Guoying ;
Pietikainen, Matti .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1-9