Expression Recognition Based on Residual Attention Mechanism and Pyramid Convolution

被引:0
|
作者
Bao Z. [1 ]
Chen H. [1 ]
机构
[1] Faculty of Electrical Engineering and Computer Science, Ningbo University, Ningbo
基金
中国国家自然科学基金;
关键词
Expression Recognition; Multiscale Feature; Pyramid Convolution; Residual Attention Mechanism;
D O I
10.16451/j.cnki.issn1003-6059.202206002
中图分类号
学科分类号
摘要
With the widespread application of deep learning, facial expression recognition technology develops rapidly. However, how to extract multi-scale features and utilize key features efficiently is still a challenge for facial expression recognition network. To solve these problems, pyramid convolution is employed to extract multi-scale features effectively, and spatial channel attention mechanism is introduced to enhance the expression of key features. An expression recognition network based on residual attention mechanism and pyramidal convolution is constructed to improve the recognition accuracy. Multi-task convolutional neural network is utilized for face detection, face clipping and face alignment, and then the preprocessed images are sent to the feature extraction network. Meanwhile, the network is trained by combining Softmax Loss and the Center Loss to narrow the difference between the same expressions and enlarge the distance between different expressions. Experiments show that the accuracy of the proposed network on Fer2013 dataset and CK+ dataset is high, the number of network parameters is small and the proposed method is more suitable for the application of realistic scenarios of expression recognition. © 2022 Journal of Pattern Recognition and Artificial Intelligence. All rights reserved.
引用
收藏
页码:497 / 506
页数:9
相关论文
共 28 条
  • [1] PANTIC M, ROTHKRANTZ L J M., Expert System for Automatic Analysis of Facial Expressions, Image and Vision Computing, 18, 11, pp. 881-905, (2000)
  • [2] LI T H, DU C F, NAREN T, Et al., Using Feature Points and Angles between Them to Recognise Facial Expressions by a Neural Network Approach, IET Image Processing, 12, 11, pp. 1951-1955, (2018)
  • [3] BARTLETT M S, LITTLEWORT G, FASEL I, Et al., Real Time Face Detection and Facial Expression Recognition: Development and Applications to Human Computer Interaction, Proc of the Conference on Computer Vision and Pattern Recognition Workshop, (2003)
  • [4] MASE K, PENTLAND A., Automatic Lipreading by Optical-Flow Analysis, Systems and Computers in Japan, 22, 6, pp. 67-76, (1991)
  • [5] TANG Y C., Deep Learning Using Linear Support Vector Machines
  • [6] JUNG H, LEE S, YIM J, Et al., Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition, Proc of the IEEE International Conference on Computer Vision, pp. 2983-2991, (2015)
  • [7] LIU Y Y, YUAN X H, GONG X, Et al., Conditional Convolution Neural Network Enhanced Random Forest for Facial Expression Recognition, Pattern Recognition, 84, pp. 251-261, (2018)
  • [8] HARIRI W, FARAH N., Recognition of 3D Emotional Facial Expression Based on Handcrafted and Deep Feature Combination, Pattern Recognition Letters, 148, pp. 84-91, (2021)
  • [9] LI M, XU H, HUANG X C, Et al., Facial Expression Recognition with Identity and Emotion Joint Learning, IEEE Transactions on Affective Computing, 12, 2, pp. 544-550, (2021)
  • [10] HU J, SHEN L, ALBANIE S, Et al., Squeeze-and-Excitation Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 42, 8, pp. 2011-2023, (2020)