Improving the Accuracy of Automatic Facial Expression Recognition in Speaking Subjects with Deep Learning

被引:15
|
作者
Bursic, Sathya [1 ,2 ]
Boccignone, Giuseppe [1 ]
Ferrara, Alfio [2 ]
D'Amelio, Alessandro [1 ]
Lanzarotti, Raffaella [1 ]
机构
[1] Univ Milan, Dept Comp Sci, PHuSe Lab, Via Giovanni Celoria 18, I-20133 Milan, Italy
[2] Univ Milan, Dept Comp Sci, ISLab, Via Giovanni Celoria 18, I-20133 Milan, Italy
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 11期
关键词
facial expression recognition; speaking effect; emotion recognition; affective computing; deep learning;
D O I
10.3390/app10114002
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
When automatic facial expression recognition is applied to video sequences of speaking subjects, the recognition accuracy has been noted to be lower than with video sequences of still subjects. This effect known as the speaking effect arises during spontaneous conversations, and along with the affective expressions the speech articulation process influences facial configurations. In this work we question whether, aside from facial features, other cues relating to the articulation process would increase emotion recognition accuracy when added in input to a deep neural network model. We develop two neural networks that classify facial expressions in speaking subjects from the RAVDESS dataset, a spatio-temporal CNN and a GRU cell RNN. They are first trained on facial features only, and afterwards both on facial features and articulation related cues extracted from a model trained for lip reading, while varying the number of consecutive frames provided in input as well. We show that using DNNs the addition of features related to articulation increases classification accuracy up to 12%, the increase being greater with more consecutive frames provided in input to the model.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Local Learning With Deep and Handcrafted Features for Facial Expression Recognition
    Georgescu, Mariana-Iuliana
    Ionescu, Radu Tudor
    Popescu, Marius
    IEEE ACCESS, 2019, 7 : 64827 - 64836
  • [42] 3D Facial Expression Recognition Using Multi-channel Deep Learning Framework
    R. Ramya
    K. Mala
    S. Selva Nidhyananthan
    Circuits, Systems, and Signal Processing, 2020, 39 : 789 - 804
  • [43] A dynamic fusion of features from deep learning and the HOG-TOP algorithm for facial expression recognition
    Chouhayebi, Hajar
    Mahraz, Mohamed Adnane
    Riffi, Jamal
    Tairi, Hamid
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32993 - 33017
  • [44] Facial expression recognition via learning deep sparse autoencoders
    Zeng, Nianyin
    Zhang, Hong
    Song, Baoye
    Liu, Weibo
    Li, Yurong
    Dobaie, Abdullah M.
    NEUROCOMPUTING, 2018, 273 : 643 - 649
  • [45] Deep Disturbance-Disentangled Learning for Facial Expression Recognition
    Ruan, Delian
    Yan, Yan
    Chen, Si
    Xue, Jing-Hao
    Wang, Hanzi
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2833 - 2841
  • [46] Automatic facial expression recognition: feature extraction and selection
    Lajevardi, Seyed Mehdi
    Hussain, Zahir M.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (01) : 159 - 169
  • [47] Automatic facial expression recognition: feature extraction and selection
    Seyed Mehdi Lajevardi
    Zahir M. Hussain
    Signal, Image and Video Processing, 2012, 6 : 159 - 169
  • [48] Deep Learning Based Transfer Learning for Possible Facial Psychological Expression Recognition
    Li, Mi
    Cao, Lei
    Liu, Dachao
    Li, Leilei
    Lu, Shengfu
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (07) : 1478 - 1485
  • [49] AUTOMATIC FACIAL FEATURE DETECTION FOR FACIAL EXPRESSION RECOGNITION
    Danisman, Taner
    Bilasco, Marius
    Lhaddadene, Nacim
    Djeraba, Chabane
    VISAPP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2010, : 407 - 412
  • [50] Deep learning enhanced attributes conditional random forest for robust facial expression recognition
    Haibin Liao
    Dianhua Wang
    Ping Fan
    Ling Ding
    Multimedia Tools and Applications, 2021, 80 : 28627 - 28645