Convolutional relation network for facial expression recognition in the wild with few-shot learning

被引:44
作者
Zhu, Qing [1 ]
Mao, Qirong [1 ]
Jia, Hongjie [1 ]
Noi, Ocquaye Elias Nii [1 ]
Tu, Juanjuan [2 ]
机构
[1] Jiangsu Univ, Sch Comp Sci & Commun Engn, Zhenjiang 212013, Jiangsu, Peoples R China
[2] Jiangsu Univ Sci & Technol, Sch Comp Sci & Engn, Zhenjiang 212003, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Facial expression recognition; Few-shot learning; Discriminative feature analysis; Feature learning;
D O I
10.1016/j.eswa.2021.116046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent deep learning based facial expression recognition (FER) methods are mostly driven by the availability of large amount of training data. However, availability of such data is not always possible for FER in the wild where the infeasibility of obtaining sufficient training samples for each emotion category. Therefore, in this paper, we introduce the few-shot learning to construct a deep learning model named Convolutional Relation Network (CRN) for FER in the wild, which is learnt by exploiting a feature similarity comparison among the sufficient samples of the emotion categories to identify new classes with few samples. Specifically, our method learns a metric space in which classification can be performed by computing distances to capitalize on powerful discriminative ability of deep expression features to generalize the predictive power of the network. To achieve this, the features are constrained to maximize the distance between the features of different classes and discover the commonality of the same classes. Extensive experiments on three challenging in-the-wild datasets demonstrate that the proposed model significantly outperforms state-of-the-art methods.
引用
收藏
页数:9
相关论文
共 44 条
[1]  
Acevedo D, 2016, INT C PATT RECOG, P4124, DOI 10.1109/ICPR.2016.7900280
[2]   Enhanced Gabor (E-Gabor), Hypersphere-based normalization and Pearson General Kernel-based discriminant analysis for dimension reduction and classification of facial emotions [J].
Alphonse, A. Sherly ;
Dharma, Dejey .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 :127-145
[3]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[4]  
[Anonymous], 2013, ARXIV
[5]   Detecting Regions of Maximal Divergence for Spatio-Temporal Anomaly Detection [J].
Barz, Bjorn ;
Rodner, Erik ;
Garcia, Yanira Guanche ;
Denzler, Joachim .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (05) :1088-1101
[6]   Selective Transfer Machine for Personalized Facial Expression Analysis [J].
Chu, Wen-Sheng ;
De la Torre, Fernando ;
Cohn, Jeffrey F. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (03) :529-545
[7]  
Dhall A, 2011, 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS)
[8]  
Finn C, 2017, PR MACH LEARN RES, V70
[9]   Dynamic Few-Shot Visual Learning without Forgetting [J].
Gidaris, Spyros ;
Komodakis, Nikos .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4367-4375
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587