Deep Structure Inference Network for Facial Action Unit Recognition

被引:87
作者
Corneanu, Ciprian [1 ,2 ]
Madadi, Meysam [2 ]
Escalera, Sergio [1 ,2 ]
机构
[1] Univ Barcelona, Barcelona, Spain
[2] Comp Vis Ctr, Barcelona, Spain
来源
COMPUTER VISION - ECCV 2018, PT XII | 2018年 / 11216卷
关键词
Computer vision; Machine learning; Deep learning; Facial expression analysis; Facial action units; Structure inference;
D O I
10.1007/978-3-030-01258-8_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial expressions are combinations of basic components called Action Units (AU). Recognizing AUs is key for general facial expression analysis. Recently, efforts in automatic AU recognition have been dedicated to learning combinations of local features and to exploiting correlations between AUs. We propose a deep neural architecture that tackles both problems by combining learned local and global features in its initial stages and replicating a message passing algorithm between classes similar to a graphical model inference approach in later stages. We show that by training the model end-to-end with increased supervision we improve state-of-the-art by 5.3% and 8.2% performance on BP4D and DISFA datasets, respectively.
引用
收藏
页码:309 / 324
页数:16
相关论文
共 28 条
[11]  
Kulkarni K., 2017, ARXIV170704061
[12]   Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing [J].
Li, Wei ;
Abtahi, Farnaz ;
Zhu, Zhigang .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6766-6775
[13]   DISFA: A Spontaneous Facial Action Intensity Database [J].
Mavadati, S. Mohammad ;
Mahoor, Mohammad H. ;
Bartlett, Kevin ;
Trinh, Philip ;
Cohn, Jeffrey F. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2013, 4 (02) :151-160
[14]  
Selvaraju RR, 2020, INT J COMPUT VISION, V128, P336, DOI [10.1109/ICCV.2017.74, 10.1007/s11263-019-01228-7]
[15]  
Simonyan K, 2015, Arxiv, DOI arXiv:1409.1556
[16]   Fisher Vector Faces in the Wild [J].
Simonyan, Karen ;
Parkhi, Omkar M. ;
Vedaldi, Andrea ;
Zisserman, Andrew .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
[17]   DeepFace: Closing the Gap to Human-Level Performance in Face Verification [J].
Taigman, Yaniv ;
Yang, Ming ;
Ranzato, Marc'Aurelio ;
Wolf, Lior .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :1701-1708
[18]   Social signal processing: Survey of an emerging domain [J].
Vinciarelli, Alessandro ;
Pantic, Maja ;
Bourlard, Herve .
IMAGE AND VISION COMPUTING, 2009, 27 (12) :1743-1759
[19]   Deep Structured Learning for Facial Action Unit Intensity Estimation [J].
Walecki, Robert ;
Rudovic, Ognjen ;
Pavlovic, Vladimir ;
Schuller, Bjoern ;
Pantic, Maja .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5709-5718
[20]   Capturing Global Semantic Relationships for Facial Action Unit Recognition [J].
Wang, Ziheng ;
Li, Yongqiang ;
Wang, Shangfei ;
Ji, Qiang .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3304-3311