The limitations for expression recognition in computer vision introduced by facial masks

被引:14
作者
Abate, Andrea Francesco [1 ]
Cimmino, Lucia [1 ]
Mocanu, Bogdan-Costel [2 ]
Narducci, Fabio [1 ]
Pop, Florin [2 ]
机构
[1] Univ Salerno, Dept Comp Sci, Via Giovanni Paolo II 132, I-8484 Salerno, Italy
[2] Univ Politehn Bucuresti, Fac Automat Control & Comp, Dept Syst Engn, Splaiul Independentei 313, RO-060042 Bucharest, Romania
关键词
Expression recognition; Masked face analysis; Deep learning; FACE;
D O I
10.1007/s11042-022-13559-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial Expression recognition is a computer vision problem that took relevant benefit from the research in deep learning. Recent deep neural networks achieved superior results, demonstrating the feasibility of recognizing the expression of a user from a single picture or a video recording the face dynamics. Research studies reveal that the most discriminating portions of the face surfaces that contribute to the recognition of facial expressions are located on the mouth and the eyes. The restrictions for COVID pandemic reasons have also revealed that state-of-the-art solutions for the analysis of the face can severely fail due to the occlusions of using the facial masks. This study explores to what extend expression recognition can deal with occluded faces in presence of masks. To a fairer comparison, the analysis is performed in different occluded scenarios to effectively assess if the facial masks can really imply a decrease in the recognition accuracy. The experiments performed on two public datasets show that some famous top deep classifiers expose a significant reduction in accuracy in presence of masks up to half of the accuracy achieved in non-occluded conditions. Moreover, a relevant decrease in performance is also reported also in the case of occluded eyes but the overall drop in performance is not as severe as in presence of the facial masks, thus confirming that, like happens for face biometric recognition, occluded faces by facial mask still represent a challenging limitation for computer vision solutions.
引用
收藏
页码:11305 / 11319
页数:15
相关论文
共 28 条
[11]   Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition [J].
Jung, Heechul ;
Lee, Sihaeng ;
Yim, Junho ;
Park, Sunjeong ;
Kim, Junmo .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2983-2991
[12]   A Brief Review of Facial Emotion Recognition Based on Visual Information [J].
Ko, Byoung Chul .
SENSORS, 2018, 18 (02)
[13]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[14]   Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild [J].
Li, Shan ;
Deng, Weihong ;
Du, JunPing .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2584-2593
[15]   Cropping and attention based approach for masked face recognition [J].
Li, Yande ;
Guo, Kun ;
Lu, Yonggang ;
Liu, Li .
APPLIED INTELLIGENCE, 2021, 51 (05) :3012-3025
[16]   Multiple Spatio-temporal Feature Learning for Video-based Emotion Recognition in the Wild [J].
Lu, Cheng ;
Zheng, Wenming ;
Li, Chaolong ;
Tang, Chuangao ;
Liu, Suyuan ;
Yan, Simeng ;
Zong, Yuan .
ICMI'18: PROCEEDINGS OF THE 20TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2018, :646-652
[17]  
Luan P, 2020, PYTORCH
[18]   Facial Expression Recognition Using Residual Masking Network [J].
Luan Pham ;
The Huynh Vu ;
Tuan Anh Tran .
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :4513-4519
[19]   Convolutional neural networks: Computer vision-based workforce activity assessment in construction [J].
Luo, Hanbin ;
Xiong, Chaohua ;
Fang, Weili ;
Love, Peter E. D. ;
Zhang, Bowen ;
Ouyang, Xi .
AUTOMATION IN CONSTRUCTION, 2018, 94 :282-289
[20]  
Mackey L., 2021, arXiv