Improving the Accuracy of Automatic Facial Expression Recognition in Speaking Subjects with Deep Learning

被引：15

作者：

Bursic, Sathya ^{[1
,2
]}

Boccignone, Giuseppe ^{[1
]}

Ferrara, Alfio ^{[2
]}

D'Amelio, Alessandro ^{[1
]}

Lanzarotti, Raffaella ^{[1
]}

机构：

[1] Univ Milan, Dept Comp Sci, PHuSe Lab, Via Giovanni Celoria 18, I-20133 Milan, Italy

[2] Univ Milan, Dept Comp Sci, ISLab, Via Giovanni Celoria 18, I-20133 Milan, Italy

来源：

APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 11期

关键词：

facial expression recognition; speaking effect; emotion recognition; affective computing; deep learning;

D O I：

10.3390/app10114002

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

When automatic facial expression recognition is applied to video sequences of speaking subjects, the recognition accuracy has been noted to be lower than with video sequences of still subjects. This effect known as the speaking effect arises during spontaneous conversations, and along with the affective expressions the speech articulation process influences facial configurations. In this work we question whether, aside from facial features, other cues relating to the articulation process would increase emotion recognition accuracy when added in input to a deep neural network model. We develop two neural networks that classify facial expressions in speaking subjects from the RAVDESS dataset, a spatio-temporal CNN and a GRU cell RNN. They are first trained on facial features only, and afterwards both on facial features and articulation related cues extracted from a model trained for lip reading, while varying the number of consecutive frames provided in input as well. We show that using DNNs the addition of features related to articulation increases classification accuracy up to 12%, the increase being greater with more consecutive frames provided in input to the model.

引用

页数：15

共 50 条

[41] Local Learning With Deep and Handcrafted Features for Facial Expression Recognition
Georgescu, Mariana-Iuliana
Ionescu, Radu Tudor
Popescu, Marius
IEEE ACCESS, 2019, 7 : 64827 - 64836
[42] 3D Facial Expression Recognition Using Multi-channel Deep Learning Framework
R. Ramya
K. Mala
S. Selva Nidhyananthan
Circuits, Systems, and Signal Processing, 2020, 39 : 789 - 804
[43] A dynamic fusion of features from deep learning and the HOG-TOP algorithm for facial expression recognition
Chouhayebi, Hajar
Mahraz, Mohamed Adnane
Riffi, Jamal
Tairi, Hamid
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 32993 - 33017
[44] Facial expression recognition via learning deep sparse autoencoders
Zeng, Nianyin
Zhang, Hong
Song, Baoye
Liu, Weibo
Li, Yurong
Dobaie, Abdullah M.
NEUROCOMPUTING, 2018, 273 : 643 - 649
[45] Deep Disturbance-Disentangled Learning for Facial Expression Recognition
Ruan, Delian
Yan, Yan
Chen, Si
Xue, Jing-Hao
Wang, Hanzi
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 2833 - 2841
[46] Automatic facial expression recognition: feature extraction and selection
Lajevardi, Seyed Mehdi
Hussain, Zahir M.
SIGNAL IMAGE AND VIDEO PROCESSING, 2012, 6 (01) : 159 - 169
[47] Automatic facial expression recognition: feature extraction and selection
Seyed Mehdi Lajevardi
Zahir M. Hussain
Signal, Image and Video Processing, 2012, 6 : 159 - 169
[48] Deep Learning Based Transfer Learning for Possible Facial Psychological Expression Recognition
Li, Mi
Cao, Lei
Liu, Dachao
Li, Leilei
Lu, Shengfu
JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2018, 8 (07) : 1478 - 1485
[49] AUTOMATIC FACIAL FEATURE DETECTION FOR FACIAL EXPRESSION RECOGNITION
Danisman, Taner
Bilasco, Marius
Lhaddadene, Nacim
Djeraba, Chabane
VISAPP 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER VISION THEORY AND APPLICATIONS, VOL 2, 2010, : 407 - 412
[50] Deep learning enhanced attributes conditional random forest for robust facial expression recognition
Haibin Liao
Dianhua Wang
Ping Fan
Ling Ding
Multimedia Tools and Applications, 2021, 80 : 28627 - 28645

← 1 2 3 4 5 →