Dual integrated convolutional neural network for real-time facial expression recognition in the wild

被引:0
作者
Sumeet Saurav
Prashant Gidde
Ravi Saini
Sanjay Singh
机构
[1] Academy of Scientific and Innovative Research,
[2] CSIR-Central Electronics Engineering Research Institute,undefined
来源
The Visual Computer | 2022年 / 38卷
关键词
Deep convolutional neural network; Embedded implementation; CNN optimization; Facial expression recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Automatic recognition of facial expressions in the wild is a challenging problem and has drawn a lot of attention from the computer vision and pattern recognition community. Since their emergence, the deep learning techniques have proved their efficacy in facial expression recognition (FER) tasks. However, these techniques are parameter intensive, and thus, could not be deployed on resource-constrained embedded platforms for real-world applications. To mitigate these limitations of the deep learning inspired FER systems, in this paper, we present an efficient dual integrated convolution neural network (DICNN) model for the recognition of facial expressions in the wild in real-time, running on an embedded platform. The designed DICNN model with just 1.08M parameters and 5.40 MB memory storage size achieves optimal performance by maintaining a proper balance between recognition accuracy and computational efficiency. We evaluated the DICNN model on four FER benchmark datasets (FER2013, FERPlus, RAF-DB, and CKPlus) using different performance evaluation metrics, namely the recognition accuracy, precision, recall, and F1-score. Finally, to provide a portable solution with high throughput inference, we optimized the designed DICNN model using TensorRT SDK and deployed it on an Nvidia Xavier embedded platform. Comparative analysis results with the other state-of-the-art methods revealed the effectiveness of the designed FER system, which achieved competitive accuracy with multi-fold improvement in the execution speed.
引用
收藏
页码:1083 / 1096
页数:13
相关论文
共 165 条
  • [1] Zhao J(2019)Speech emotion recognition using deep 1D & 2D cnn lstm networks Biomed. Signal Process. Control 47 312-323
  • [2] Mao X(2019)3D cnn-based speech emotion recognition using k-means clustering and spectrograms Entropy 21 479-2881
  • [3] Chen L(2019)Sae+ lstm: a new framework for emotion recognition from multi-channel eeg Front. Neurorobot. 13 37-4536
  • [4] Hajarolasvadi N(2019)Eeg based emotion recognition by combining functional connectivity network and local activations IEEE Trans. Biomed. Eng. 66 2869-985
  • [5] Demirel H(2017)Facial expression recognition utilizing local direction-based robust features and deep belief network IEEE Access 5 4525-29
  • [6] Xing X(2019)Facial emotion recognition using an ensemble of multi-level convolutional neural networks Int. J. Pattern Recognit Artif. Intell. 33 1940015-134
  • [7] Li Z(2019)Audiovisual emotion recognition in wild Mach. Vis. Appl. 30 975-94011
  • [8] Xu T(2020)The design of cnn architectures for optimal six basic emotion classification using multiple physiological signals Sensors 20 866-498
  • [9] Shu L(2019)Automatic detection of students’ affective states in classroom environment using hybrid convolutional neural networks Educ. Inf. Technol. 4 1-1475
  • [10] Hu B(2020)Deep convolution network based emotion analysis towards mental health care Neurocomputing 3 10-8