Deep Structure Inference Network for Facial Action Unit Recognition

被引:87
作者
Corneanu, Ciprian [1 ,2 ]
Madadi, Meysam [2 ]
Escalera, Sergio [1 ,2 ]
机构
[1] Univ Barcelona, Barcelona, Spain
[2] Comp Vis Ctr, Barcelona, Spain
来源
COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷
关键词
Computer vision; Machine learning; Deep learning; Facial expression analysis; Facial action units; Structure inference;
D O I
10.1007/978-3-030-01258-8_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for general facial expression analysis. Recently, efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between AUs. We propose a deep neural architecture that tackles both problems by combining learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages. We show that by training the model end-to-end with increased supervision we improve state-of-the-art by 5.3% and 8.2% performance on BP4D and DISFA datasets, respectively.
引用
收藏
页码:309 / 324
页数:16
相关论文
共 28 条
[1]  
Bakkes Sander, 2012, P SE 8 AUSTR C INT E, P1, DOI 10.1145/2336727.2336731
[2]  
Chu X., 2016, P 30 INT C NEUR INF, P316
[3]   Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition [J].
Deng, Zhiwei ;
Vandat, Arash ;
Hu, Hexiang ;
Mori, Greg .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4772-4781
[4]  
DeVault D, 2014, AAMAS'14: PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, P1061
[5]  
Ekman P., 2002, Facs manual. A human face
[6]   Multi-conditional Latent Variable Model for Joint Facial Action Unit Detection [J].
Eleftheriadis, Stefanos ;
Rudovic, Ognjen ;
Pantic, Maja .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3792-3800
[7]  
Fabian B, 2017, IEEE ICC, DOI 10.1109/ICC.2017.7996828
[8]  
Jaiswal S, 2016, IEEE WINT CONF APPL
[9]   Automatic prediction of frustration [J].
Kapoor, Ashish ;
Burleson, Winslow ;
Picard, Rosalind W. .
INTERNATIONAL JOURNAL OF HUMAN-COMPUTER STUDIES, 2007, 65 (08) :724-736
[10]   One Millisecond Face Alignment with an Ensemble of Regression Trees [J].
Kazemi, Vahid ;
Sullivan, Josephine .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1867-1874