FACSCaps: Pose-Independent Facial Action Coding with Capsules

被引:14
作者
Ertugrul, Itir Onal [1 ]
Jeni, Laszlo A. [1 ]
Cohn, Jeffrey F. [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Robot Inst, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Dept Psychol, Pittsburgh, PA 15260 USA
来源
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年
基金
美国国家卫生研究院;
关键词
D O I
10.1109/CVPRW.2018.00287
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most automated facial expression analysis methods treat the face as a 2D object, flat like a sheet of paper. That works well provided images are frontal or nearly so. In real-world conditions, moderate to large head rotation is common and system performance to recognize expression degrades. Multi-view Convolutional Neural Networks (CNNs) have been proposed to increase robustness to pose, but they require greater model sizes and may generalize poorly across views that are not included in the training set. We propose FACSCaps architecture to handle multi-view and multi-label facial action unit (AU) detection within a single model that can generalize to novel views. Additionally, FACSCaps's ability to synthesize faces enables insights into what is leaned by the model. FACSCaps models video frames using matrix capsules, where hierarchical pose relationships between face parts are built into internal representations. The model is trained by jointly optimizing a multi-label loss and the reconstruction accuracy. FACSCaps was evaluated using the FERA 2017 facial expression dataset that includes spontaneous facial expressions in a wide range of head orientations. FACSCaps outperformed both state-of-the-art CNNs and their temporal extensions.
引用
收藏
页码:2211 / 2220
页数:10
相关论文
共 38 条
[1]  
[Anonymous], SENTIMENT ANAL CAPSU
[2]  
[Anonymous], 2018, ARXIV180210200
[3]  
[Anonymous], P IEEE C COMP VIS PA
[4]  
[Anonymous], 2016, P IEEE WINT C APPL C
[5]  
[Anonymous], 2017, P IEEE INT C COMP VI
[6]  
[Anonymous], 2018, ARXIV180206167
[7]  
[Anonymous], 2015, 2015 11 IEEE INT C W
[8]  
[Anonymous], 2016, APPL COMP VIS WACV 2
[9]  
[Anonymous], ARXIV171203480
[10]  
[Anonymous], 2018, MATRIX CAPSULES EM R