Robust and energy-efficient expression recognition based on improved deep ResNets

被引:4
作者
Chen, Yunhua [1 ]
Du, Jin [1 ]
Liu, Qian [2 ]
Zhang, Ling [1 ]
Zeng, Yanjun [3 ]
机构
[1] Guangdong Univ Technol, Sch Comp, Guangzhou 510006, Guangdong, Peoples R China
[2] aiCTX AG, Thurgauerstr 40, CH-8050 Zurich, Switzerland
[3] Beijing Univ Technol, Biomed Engn Ctr, Beijing 100124, Peoples R China
来源
BIOMEDICAL ENGINEERING-BIOMEDIZINISCHE TECHNIK | 2019年 / 64卷 / 05期
关键词
Convolutional Neural Networks; deep residual networks; facial expression recognition; leaky integrate- and-fire (LIF) neurons; Noisy Softplus; MODEL;
D O I
10.1515/bmt-2018-0027
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
To improve the robustness and to reduce the energy consumption of facial expression recognition, this study proposed a facial expression recognition method based on improved deep residual networks (ResNets). Residual learning has solved the degradation problem of deep Convolutional Neural Networks (CNNs); therefore, in theory, a ResNet can consist of infinite number of neural layers. On the one hand, ResNets benefit from better performance on artificial intelligence (AI) tasks, thanks to its deeper network structure; meanwhile, on the other hand, it faces a severe problem of energy consumption, especially on mobile devices. Hence, this study employs a novel activation function, the Noisy Softplus (NSP), to replace rectified linear units (ReLU) to get improved ResNets. NSP is a biologically plausible activation function, which was first proposed in training Spiking Neural Networks (SNNs); thus, NSP-trained models can be directly implemented on ultra-low-power neuromorphic hardware. We built an 18-layered ResNet using NSP to perform facial expression recognition across datasets Cohn-Kanade (CK+), Karolinska Directed Emotional Faces (KDEF) and GENKI-4K. The results achieved better antinoise ability than ResNet using the activation function ReLU and showed low energy consumption running on neuromorphic hardware. This study not only contributes a solution for robust facial expression recognition, but also consolidates the low energy cost of their implementation on neuromorphic devices, which could pave the way for high-performance, noise-robust and energy-efficient vision applications on mobile hardware.
引用
收藏
页码:519 / 528
页数:10
相关论文
共 40 条
  • [21] Deeply Learning Deformable Facial Action Parts Model for Dynamic Expression Analysis
    Liu, Mengyi
    Li, Shaoxin
    Shan, Shiguang
    Wang, Ruiping
    Chen, Xilin
    [J]. COMPUTER VISION - ACCV 2014, PT IV, 2015, 9006 : 143 - 157
  • [22] Noisy Softplus: A Biology Inspired Activation Function
    Liu, Qian
    Furber, Steve
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 405 - 412
  • [23] Liu Qian., 2018, Deep spiking neural networks
  • [24] Lucey P., 2010, 2010 IEEE COMP SOC C, P94
  • [25] Unsupervised learning of visual features through spike timing dependent plasticity
    Masquelier, Timothee
    Thorpe, Simon J.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (02) : 247 - 257
  • [26] Mehrotra S, 2010, IEEE INT CON MULTI, P1, DOI 10.1109/ICME.2010.5582531
  • [27] Paugam-Moisy H., 2012, Handbook of Natural Computing, Vvol 1, P1
  • [28] Perez-Carrasco J. A., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P3085, DOI 10.1109/ICPR.2010.756
  • [29] Reece JB., 2011, Campbell biology, V9
  • [30] Ruiz-Garcia A, 2017, IEEE IJCNN, P1586, DOI 10.1109/IJCNN.2017.7966040