A ROI-guided deep architecture for robust facial expressions recognition

被引:26
作者
Sun, Xiao [1 ]
Xia, Pingping [1 ]
Zhang, Luming [2 ]
Shao, Ling [3 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci, Hangzhou, Zhejiang, Peoples R China
[3] Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
基金
中国国家自然科学基金;
关键词
Convolutional neural network; Facial expression recognition; Region of interest; Artificial face; REPRESENTATION; ENSEMBLE;
D O I
10.1016/j.ins.2020.02.047
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes a robust facial expression recognition framework, focusing on discovering the region of interest (ROI) to train an effective face-specific of convolutional neural networks (CNN). By exploiting the relationships among ROI areas, the proposed deep architecture can improve the reliability of predicted targets. Our designed deep model is fine-tuned based on a pre-specified deep CNN instead of a new one trained from scratch. To increase the face expressions toward a robust deep CNN training, a novel data augmentation strategy called artificial face is designed. The performance of our deep architecture is evaluated on state-of-the-art databases such as CK+. To demonstrate the high generalizability of our approach, cross-database validations are conducted on the JAFFE and our own compiled Wild database. Comprehensive experiments have demonstrated the superiority of the method, i.e., achieving a recognition accuracy of 94.67% on the CK+ database, 53.77% on the JAFFE cross-database, 40.13% on the FER-2013 cross-database, and 37.25% on the Wild cross-database respectively. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:35 / 48
页数:14
相关论文
共 44 条
  • [21] Do Deep Neural Networks Learn Facial Action Units When Doing Expression Recognition?
    Khorrami, Pooya
    Le Paine, Tom
    Huang, Thomas S.
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 19 - 27
  • [22] Hierarchical Committee of Deep CNNs with Exponentially-Weighted Decision Fusion for Static Facial Expression Recognition
    Kim, Bo-Kyeong
    Lee, Hwaran
    Roh, Jihyeon
    Lee, Soo-Young
    [J]. ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 427 - 434
  • [23] King DE, 2009, J MACH LEARN RES, V10, P1755
  • [24] A new approach to the detection of moving objects
    Li, CCP
    Kurz, L
    [J]. INFORMATION SCIENCES, 1997, 103 (1-4) : 115 - 134
  • [25] Facial Skin Beautification Using Adaptive Region-Aware Masks
    Liang, Lingyu
    Jin, Lianwen
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2014, 44 (12) : 2600 - 2612
  • [26] Unsupervised Local Feature Hashing for Image Similarity Search
    Liu, Li
    Yu, Mengyang
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (11) : 2548 - 2558
  • [27] AU-inspired Deep Networks for Facial Expression Feature Learning
    Liu, Mengyi
    Li, Shaoxin
    Shan, Shiguang
    Chen, Xilin
    [J]. NEUROCOMPUTING, 2015, 159 : 126 - 136
  • [28] Facial Expression Recognition via a Boosted Deep Belief Network
    Liu, Ping
    Han, Shizhong
    Meng, Zibo
    Tong, Yan
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1805 - 1812
  • [29] Facial expression recognition with Convolutional Neural Networks: Coping with few data and the training sample order
    Lopes, Andre Teixeira
    de Aguiar, Edilson
    De Souza, Alberto F.
    Oliveira-Santos, Thiago
    [J]. PATTERN RECOGNITION, 2017, 61 : 610 - 628
  • [30] Lucey P., 2010, P 2010 IEEE COMP VIS, P94, DOI [DOI 10.1109/CVPRW.2010.5543262, 10.1109/CVPRW.2010.5543262]