Hierarchical Committee of Deep CNNs with Exponentially-Weighted Decision Fusion for Static Facial Expression Recognition

被引:90
作者
Kim, Bo-Kyeong [1 ]
Lee, Hwaran [1 ]
Roh, Jihyeon [1 ]
Lee, Soo-Young [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Daejeon, South Korea
来源
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION | 2015年
关键词
Hierarchical Committee; Exponentially-Weighted Decision Fusion; Deep Convolutional Neural Network; CLASSIFICATION; CLASSIFIERS; EXPERTS;
D O I
10.1145/2818346.2830590
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We present a pattern recognition framework to improve committee machines of deep convolutional neural networks (deep CNNs) and its application to static facial expression recognition in the wild (SFEW). In order to generate enough diversity of decisions, we trained multiple deep CNNs by varying network architectures, input normalization, and weight initialization as well as by adopting several learning strategies to use large external databases. Moreover, with these deep models, we formed hierarchical committees using the validation-accuracy-based exponentially-weighted average (VA-Expo-WA) rule. Through extensive experiments, the great strengths of our committee machines were demonstrated in both structural and decisional ways. On the SFEW2.0 dataset released for the 3rd Emotion Recognition in the Wild (EmotiW) sub-challenge, a test accuracy of 57.3% was obtained from the best single deep CNN, while the single-level committees yielded 58.3% and 60.5% with the simple average rule and with the VA-Expo-WA rule, respectively. Our final submission based on the 3-level hierarchy using the VA-Expo-WA achieved 61.6%, significantly higher than the SFEW baseline of 39.1%.
引用
收藏
页码:427 / 434
页数:8
相关论文
共 35 条
[1]  
Agostinelli F., 2013, Advances in Neural Information Processing Systems (NIPS), V1, P1493
[2]   Using diversity of errors for selecting members of a committee classifier [J].
Aksela, M ;
Laaksonen, J .
PATTERN RECOGNITION, 2006, 39 (04) :608-623
[3]  
[Anonymous], ECCV 2014 WORKSH
[4]  
[Anonymous], ICML 2013 WORKSH REP
[5]  
[Anonymous], ICMI 2015 IN PRESS
[6]  
[Anonymous], ABS14124564 CORR
[7]   On combining classifier mass functions for text categorization [J].
Bell, DA ;
Guan, JW ;
Bi, YX .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1307-1319
[8]   Microarray-based classification and clinical predictors: on combined classifiers and additional predictive value [J].
Boulesteix, Anne-Laure ;
Porzelius, Christine ;
Daumer, Martin .
BIOINFORMATICS, 2008, 24 (15) :1698-1706
[9]  
Ciresan D, 2012, PROC CVPR IEEE, P3642, DOI 10.1109/CVPR.2012.6248110
[10]   Multi-column deep neural network for traffic sign classification [J].
Ciresan, Dan ;
Meier, Ueli ;
Masci, Jonathan ;
Schmidhuber, Juergen .
NEURAL NETWORKS, 2012, 32 :333-338