A new multi-feature fusion based convolutional neural network for facial expression recognition

被引:32
作者
Zou, Wei [1 ]
Zhang, Dong [1 ]
Lee, Dah-Jye [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Guangdong, Peoples R China
[2] Brigham Young Univ, Dept Elect & Comp Engn, Provo, UT 84602 USA
关键词
Facial expression recognition; Multi-feature fusion convolutional neural network; Feature selection; Joint tuning; FACE;
D O I
10.1007/s10489-021-02575-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using lightweight networks for facial expression recognition (FER) is becoming an important research topic in recent years. The key to the success of FER with lightweight networks is to explore the potentials of expression features in distinct abstract levels and regions, and design robust features to characterize the facial appearance. This paper proposes a lightweight network called Multi-feature Fusion Based Convolutional Neural Network (MFF-CNN), for image-based FER. The proposed model uses the Image Branch to extract both mid-level and high-level global features from the whole input image and utilizes the Patch Branch to extract local features from sixteen image patches of the original image. In MFF-CNN, feature selection based on L2 norm is performed to obtain more discriminative local features. Joint tuning is employed to integrate the two branches and fuse features. Experiment results on three widely used datasets, CK+, JAFFE and Oulu-CASIA show the proposed MFF-CNN outperforms the state-of-the-art methods in terms of average recognition accuracy. Compared to other competitive models with similar or larger number of parameters, our MFF-CNN improves the average recognition accuracy by 9.80% to 15.05%.
引用
收藏
页码:2918 / 2929
页数:12
相关论文
共 37 条
[1]   Enhanced Gabor (E-Gabor), Hypersphere-based normalization and Pearson General Kernel-based discriminant analysis for dimension reduction and classification of facial emotions [J].
Alphonse, A. Sherly ;
Dharma, Dejey .
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 90 :127-145
[2]   Aggregating Deep Convolutional Features for Image Retrieval [J].
Babenko, Artem ;
Lempitsky, Victor .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1269-1277
[3]   A Face-to-Face Neural Conversation Model [J].
Chu, Hang ;
Li, Daiqing ;
Fidler, Sanja .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7113-7121
[4]   RetinaFace: Single-shot Multi-level Face Localisation in the Wild [J].
Deng, Jiankang ;
Guo, Jia ;
Ververas, Evangelos ;
Kotsia, Irene ;
Zafeiriou, Stefanos .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5202-5211
[5]   Video and Image based Emotion Recognition Challenges in the Wild: EmotiW 2015 [J].
Dhall, Abhinav ;
Murthy, O. V. Ramana ;
Goecke, Roland ;
Joshi, Jyoti ;
Gedeon, Tom .
ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, :423-426
[6]   FaceNet2ExpNet: Regularizing a Deep Face Recognition Net for Expression Recognition [J].
Ding, Hui ;
Zhou, Shaohua Kevin ;
Chellappa, Rama .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :118-126
[7]   CONSTANTS ACROSS CULTURES IN FACE AND EMOTION [J].
EKMAN, P ;
FRIESEN, WV .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1971, 17 (02) :124-&
[8]  
Ekman P., 1978, FACIAL ACTION CODING
[9]   Facial Expression Recognition Using a Temporal Ensemble of Multi-Level Convolutional Neural Networks [J].
Hai-Duong Nguyen ;
Kim, Sun-Hee ;
Lee, Guee-Sang ;
Yang, Hyung-Jeong ;
Na, In-Seop ;
Kim, Soo-Hyung .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (01) :226-237
[10]  
Hamester D, 2015, IEEE IJCNN