Robust and energy-efficient expression recognition based on improved deep ResNets

被引：4

作者：

Chen, Yunhua ^{[1
]}

Du, Jin ^{[1
]}

Liu, Qian ^{[2
]}

Zhang, Ling ^{[1
]}

Zeng, Yanjun ^{[3
]}

机构：

[1] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Guangdong, Peoples R China

[2] aiCTX AG, Thurgauerstr 40, CH-8050 Zurich, Switzerland

[3] Beijing Univ Technol, Biomed Engn Ctr, Beijing 100124, Peoples R China

来源：

BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK | 2019年 / 64卷 / 05期

关键词：

Convolutional Neural Networks; deep residual networks; facial expression recognition; leaky integrate- and-fire (LIF) neurons; Noisy Softplus; MODEL;

D O I：

10.1515/bmt-2018-0027

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

To improve the robustness and to reduce the energy consumption of facial expression recognition, this study proposed a facial expression recognition method based on improved deep residual networks (ResNets). Residual learning has solved the degradation problem of deep Convolutional Neural Networks (CNNs); therefore, in theory, a ResNet can consist of infinite number of neural layers. On the one hand, ResNets benefit from better performance on artificial intelligence (AI) tasks, thanks to its deeper network structure; meanwhile, on the other hand, it faces a severe problem of energy consumption, especially on mobile devices. Hence, this study employs a novel activation function, the Noisy Softplus (NSP), to replace rectified linear units (ReLU) to get improved ResNets. NSP is a biologically plausible activation function, which was first proposed in training Spiking Neural Networks (SNNs); thus, NSP-trained models can be directly implemented on ultra-low-power neuromorphic hardware. We built an 18-layered ResNet using NSP to perform facial expression recognition across datasets Cohn-Kanade (CK+), Karolinska Directed Emotional Faces (KDEF) and GENKI-4K. The results achieved better antinoise ability than ResNet using the activation function ReLU and showed low energy consumption running on neuromorphic hardware. This study not only contributes a solution for robust facial expression recognition, but also consolidates the low energy cost of their implementation on neuromorphic devices, which could pave the way for high-performance, noise-robust and energy-efficient vision applications on mobile hardware.

引用

页码：519 / 528

页数：10

共 40 条

[21] Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis
Liu, Mengyi
Li, Shaoxin
Shan, Shiguang
Wang, Ruiping
Chen, Xilin
[J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 143 - 157
[22] Noisy Softplus: A Biology Inspired Activation Function
Liu, Qian
Furber, Steve
[J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 405 - 412
[23] Liu Qian., 2018, Deep spiking neural networks
[24] Lucey P., 2010, 2010 IEEE COMP SOC C, P94
[25] Unsupervised learning of visual features through spike timing dependent plasticity
Masquelier, Timothee
Thorpe, Simon J.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (02) : 247 - 257
[26] Mehrotra S, 2010, IEEE INT CON MULTI, P1, DOI 10.1109/ICME.2010.5582531
[27] Paugam-Moisy H., 2012, Handbook of Natural Computing, Vvol 1, P1
[28] Perez-Carrasco J. A., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3085, DOI 10.1109/ICPR.2010.756
[29] Reece JB., 2011, Campbell biology, V9
[30] Ruiz-Garcia A, 2017, IEEE IJCNN, P1586, DOI 10.1109/IJCNN.2017.7966040

← 1 2 3 4 →