Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

被引:19
作者
Xie, Weicheng [1 ,2 ,3 ]
Chen, Wenting [1 ,2 ,3 ]
Shen, Linlin [1 ,2 ,3 ]
Duan, Jinming [4 ]
Yang, Meng [5 ]
机构
[1] Shenzhen Univ, Sch Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
[3] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China
[4] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England
[5] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China
关键词
Expression recognition; Deep sparseness strategies; Hyper-parameter optimization; Surrogate network; Heuristic optimizer;
D O I
10.1016/j.patcog.2020.107701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:14
相关论文
共 42 条
[11]   Dynamic Facial Expression Recognition With Atlas Construction and Sparse Representation [J].
Guo, Yimo ;
Zhao, Guoying ;
Pietikainen, Matti .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) :1977-1992
[12]  
Hasani B., 2020, IEEE T AFFECT COMPUT, V99, P1
[13]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[14]  
Hoyer PO, 2004, J MACH LEARN RES, V5, P1457
[15]  
Ilievski I, 2017, AAAI CONF ARTIF INTE, P822
[16]   Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition [J].
Jung, Heechul ;
Lee, Sihaeng ;
Yim, Junho ;
Park, Sunjeong ;
Kim, Junmo .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2983-2991
[17]  
Kanade T., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P46, DOI 10.1109/AFGR.2000.840611
[18]   Fusing Aligned and Non-Aligned Face Information for Automatic Affect Recognition in the Wild: A Deep Learning Approach [J].
Kim, Bo-Kyeong ;
Dong, Suh-Yeon ;
Roh, Jihyeon ;
Kim, Geonmin ;
Lee, Soo-Young .
PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :1499-1508
[19]   Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning [J].
Koch, Patrick ;
Golovidov, Oleg ;
Gardner, Steven ;
Wujek, Brett ;
Griffin, Joshua ;
Xu, Yan .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :443-452
[20]   Gradient-based learning applied to document recognition [J].
Lecun, Y ;
Bottou, L ;
Bengio, Y ;
Haffner, P .
PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324