Micro-Expression Recognition with Expression-State Constrained Spatio-Temporal Feature Representations

被引:146
作者
Kim, Dae Hoe [1 ]
Baddar, Wissam J. [1 ]
Ro, Yong Man [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Image & Video Syst Lab, Daejeon, South Korea
来源
MM'16: PROCEEDINGS OF THE 2016 ACM MULTIMEDIA CONFERENCE | 2016年
关键词
Micro-Expression Recognition; Recurrent Neural Networks; Long Short Term Memory;
D O I
10.1145/2964284.2967247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recognizing spontaneous micro-expression in video sequences is a challenging problem. In this paper, we propose a new method of small scale spatio-temporal feature learning. The proposed learning method consists of two parts. First, the spatial features of micro-expressions at different expression-states (i.e., onset, onset to apex transition, apex, apex to offset transition and offset) are encoded using convolutional neural networks (CNN). The expression-states are taken into account in the objective functions, to improve the expression class separability of the learned feature representation. Next, the learned spatial features with expression-state constraints are transferred to learn temporal features of micro-expression. The temporal feature learning encodes the temporal characteristics of the different states of the micro-expression using long short-term memory (LSTM) recurrent neural networks. Extensive and comprehensive experiments have been conducted on the publically available CASME II micro-expression dataset. The experimental results showed that the proposed method outperformed state-of-the-art micro-expression recognition methods in terms of recognition accuracy.
引用
收藏
页码:382 / 386
页数:5
相关论文
共 17 条
[1]  
Ngo ACL, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), P1246, DOI 10.1109/ICDSP.2015.7252080
[2]  
[Anonymous], 2014, ACCV, DOI DOI 10.1007/978-3-319-16865-4_34
[3]   Face recognition by independent component analysis [J].
Bartlett, MS ;
Movellan, JR ;
Sejnowski, TJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (06) :1450-1464
[4]   Feature design for offline Arabic handwriting recognition: handcrafted vs automated? [J].
Chherawala, Youssouf ;
Roy, Partha Pratim ;
Cheriet, Mohamed .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :290-294
[5]   Learning precise timing with LSTM recurrent networks [J].
Gers, FA ;
Schraudolph, NN ;
Schmidhuber, J .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (01) :115-143
[6]   Spontaneous facial micro-expression analysis using Spatiotemporal Completed Local Quantized Patterns [J].
Huang, Xiaohua ;
Zhao, Guoying ;
Hong, Xiaopeng ;
Zheng, Wenming ;
Pietikainen, Matti .
NEUROCOMPUTING, 2016, 175 :564-578
[7]   Facial Micro-Expression Recognition using Spatiotemporal Local Binary Pattern with Integral Projection [J].
Huang, Xiaohua ;
Wang, Su-Jing ;
Zhao, Guoying ;
Pietikainen, Matti .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :1-9
[8]   Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition? [J].
Khorrami, Pooya ;
Le Paine, Tom ;
Huang, Thomas S. .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, :19-27
[9]  
Mignon A, 2012, PROC CVPR IEEE, P2666, DOI 10.1109/CVPR.2012.6247987
[10]  
Nair V., 2010, P 27 INT C MACH LEAR, P807