Progressive Spatio-Temporal Bilinear Network with Monte Carlo Dropout for Landmark-based Facial Expression Recognition with Uncertainty Estimation

被引:5
作者
Heidari, Negar [1 ]
Iosifidis, Alexandros [1 ]
机构
[1] Aarhus Univ, Dept Elect & Comp Engn, Aarhus, Denmark
来源
IEEE MMSP 2021: 2021 IEEE 23RD INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP) | 2021年
基金
欧盟地平线“2020”;
关键词
D O I
10.1109/MMSP53017.2021.9733455
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Deep neural networks have been widely used for feature learning in facial expression recognition systems. However, small datasets and large intra-class variability can lead to overfitting. In this paper, we propose a method which learns an optimized compact network topology for real-time facial expression recognition utilizing localized facial landmark features. Our method employs a spatio-temporal bilinear layer as backbone to capture the motion of facial landmarks during the execution of a facial expression effectively. Besides, it takes advantage of Monte Carlo Dropout to capture the model's uncertainty which is of great importance to analyze and treat uncertain cases. The performance of our method is evaluated on three widely used datasets and it is comparable to that of video-based state-of-the-art methods while it has much less complexity.
引用
收藏
页数:6
相关论文
共 30 条
[1]   Collecting Large, Richly Annotated Facial-Expression Databases from Movies [J].
Dhall, Abhinav ;
Goecke, Roland ;
Lucey, Simon ;
Gedeon, Tom .
IEEE MULTIMEDIA, 2012, 19 (03) :34-41
[2]  
Dhall Abhinav, 2014, P ACM INT C MULT INT
[3]   Style Aggregated Network for Facial Landmark Detection [J].
Dong, Xuanyi ;
Yan, Yan ;
Ouyang, Wanli ;
Yang, Yi .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :379-388
[4]  
Gal Y, 2016, PR MACH LEARN RES, V48
[5]   Landmark guidance independent spatio-channel attention and complementary context information based facial expression recognition [J].
Gera, Darshan ;
Balasubramanian, S. .
PATTERN RECOGNITION LETTERS, 2021, 145 :58-66
[6]   PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION [J].
Heidari, Negar ;
Iosifidis, Alexandros .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :3220-3224
[7]  
Heidari Negar, 2020, ARXIV PREPRINT ARXIV
[8]  
Hu P, 2017, PROCEEDINGS OF THE 19TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2017, P553, DOI 10.1145/3136755.3143009
[9]   Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition [J].
Jung, Heechul ;
Lee, Sihaeng ;
Yim, Junho ;
Park, Sunjeong ;
Kim, Junmo .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2983-2991
[10]  
Kanade T., 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), P46, DOI 10.1109/AFGR.2000.840611