Surrogate network-based sparseness hyper-parameter optimization for deep expression recognition

被引：19

作者：

Xie, Weicheng ^{[1
,2
,3
]}

Chen, Wenting ^{[1
,2
,3
]}

Shen, Linlin ^{[1
,2
,3
]}

Duan, Jinming ^{[4
]}

Yang, Meng ^{[5
]}

机构：

[1] Shenzhen Univ, Sch Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China

[3] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen, Peoples R China

[4] Univ Birmingham, Sch Comp Sci, Birmingham, W Midlands, England

[5] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Guangdong, Peoples R China

来源：

PATTERN RECOGNITION | 2021年 / 111卷

关键词：

Expression recognition; Deep sparseness strategies; Hyper-parameter optimization; Surrogate network; Heuristic optimizer;

D O I：

10.1016/j.patcog.2020.107701

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

For facial expression recognition, the sparseness constraints of the features or weights can improve the generalization ability of a deep network. However, the optimization of the hyper-parameters in fusing different sparseness strategies demands much computation, when the traditional gradient-based algorithms are used. In this work, an iterative framework with surrogate network is proposed for the optimization of hyper-parameters in fusing different sparseness strategies. In each iteration, a network with significantly smaller model complexity is fitted to the original large network based on four Euclidean losses, where the hyper-parameters are optimized with heuristic optimizers. Since the surrogate network uses the same deep metrics and embeds the same hyper-parameters as the original network, the optimized hyper-parameters are then used for the training of the original deep network in the next iteration. While the performance of the proposed algorithm is justified with a tiny model, i.e. LeNet on the FER2013 database, our approach achieved competitive performances on six publicly available expression datasets, i.e., FER2013, CK+, Oulu-CASIA, MMI, AFEW and AffectNet. (C) 2020 Elsevier Ltd. All rights reserved.

引用

页数：14

共 42 条

[11] Dynamic Facial Expression Recognition With Atlas Construction and Sparse Representation [J].

Guo, Yimo ;

Zhao, Guoying ;

Pietikainen, Matti .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (05) :1977-1992

[12]

Hasani B., 2020, IEEE T AFFECT COMPUT, V99, P1

[13] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[14]

Hoyer PO, 2004, J MACH LEARN RES, V5, P1457

[15]

Ilievski I, 2017, AAAI CONF ARTIF INTE, P822

[16] Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition [J].

Jung, Heechul ;

Lee, Sihaeng ;

Yim, Junho ;

Park, Sunjeong ;

Kim, Junmo .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2983-2991

[17]

Kanade T., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P46, DOI 10.1109/AFGR.2000.840611

[18] Fusing Aligned and Non-Aligned Face Information for Automatic Affect Recognition in the Wild: A Deep Learning Approach [J].

Kim, Bo-Kyeong ;

Dong, Suh-Yeon ;

Roh, Jihyeon ;

Kim, Geonmin ;

Lee, Soo-Young .

PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, :1499-1508

[19] Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning [J].

Koch, Patrick ;

Golovidov, Oleg ;

Gardner, Steven ;

Wujek, Brett ;

Griffin, Joshua ;

Xu, Yan .

KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :443-452

[20] Gradient-based learning applied to document recognition [J].

Lecun, Y ;

Bottou, L ;

Bengio, Y ;

Haffner, P .

PROCEEDINGS OF THE IEEE, 1998, 86 (11) :2278-2324

← 1 2 3 4 5 →