Discriminant Distribution-Agnostic Loss for Facial Expression Recognition in the Wild

被引:52
作者
Farzaneh, Amir Hossein [1 ]
Qi, Xiaojun [1 ]
机构
[1] Utah State Univ, Dept Comp Sci, Logan, UT 84322 USA
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020) | 2020年
关键词
D O I
10.1109/CVPRW50498.2020.00211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial Expression Recognition (FER) has demonstrated remarkable progress due to the advancement of deep Convolutional Neural Networks (CNNs). FER's goal as a visual recognition problem is to learn a mapping from the facial embedding space to a set of fixed expression categories using a supervised learning algorithm. Softmax loss as the de facto standard in practice fails to learn discriminative features for efficient learning. Center loss and its variants as promising solutions increase deep feature discriminability in the embedding space and enable efficient learning. They fundamentally aim to maximize intra-class similarity and inter-class separation in the embedding space. However, center loss and its variants ignore the underlying extreme class imbalance in challenging wild FER datasets. As a result, they lead to a separation bias toward majority classes and leave minority classes overlapped in the embedding space. In this paper, we propose a novel Discriminant Distribution-Agnostic loss (DDA loss) to optimize the embedding space for extreme class imbalance scenarios. Specifically, DDA loss enforces inter-class separation of deep features for both majority and minority classes. Any CNN model can be trained with the DDA loss to yield well separated deep feature clusters in the embedding space. We conduct experiments on two popular large-scale wild FER datasets (RAF-DB and AffectNet) to show the discriminative power of the proposed loss function.
引用
收藏
页码:1631 / 1639
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 2019, ACML, DOI DOI 10.1145/3357384.3358137
[2]  
[Anonymous], IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
[3]   Training Deep Networks for Facial Expression Recognition with Crowd-Sourced Label Distribution [J].
Barsoum, Emad ;
Zhang, Cha ;
Ferrer, Cristian Canton ;
Zhang, Zhengyou .
ICMI'16: PROCEEDINGS OF THE 18TH ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2016, :279-283
[4]   Island Loss for Learning Discriminative Features in Facial Expression Recognition [J].
Cai, Jie ;
Meng, Zibo ;
Khan, Ahmed Shehab ;
Li, Zhiyuan ;
O'Reilly, James ;
Tong, Yan .
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, :302-309
[5]  
Florea C., 2019, BMVC, P12
[6]   Local Learning With Deep and Handcrafted Features for Facial Expression Recognition [J].
Georgescu, Mariana-Iuliana ;
Ionescu, Radu Tudor ;
Popescu, Marius .
IEEE ACCESS, 2019, 7 :64827-64836
[7]   Challenges in representation learning: A report on three machine learning contests [J].
Goodfellow, Ian J. ;
Erhan, Dumitru ;
Carrier, Pierre Luc ;
Courville, Aaron ;
Mirza, Mehdi ;
Hamner, Ben ;
Cukierski, Will ;
Tang, Yichuan ;
Thaler, David ;
Lee, Dong-Hyun ;
Zhou, Yingbo ;
Ramaiah, Chetan ;
Feng, Fangxiang ;
Li, Ruifan ;
Wang, Xiaojie ;
Athanasakis, Dimitris ;
Shawe-Taylor, John ;
Milakov, Maxim ;
Park, John ;
Ionescu, Radu ;
Popescu, Marius ;
Grozea, Cristian ;
Bergstra, James ;
Xie, Jingjing ;
Romaszko, Lukasz ;
Xu, Bing ;
Chuang, Zhang ;
Bengio, Yoshua .
NEURAL NETWORKS, 2015, 64 :59-63
[8]   Gaussian Affinity for Max-margin Class Imbalanced Learning [J].
Hayat, Munawar ;
Khan, Salman ;
Zamir, Syed Waqas ;
Shen, Jianbing ;
Shao, Ling .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6478-6488
[9]   Learning from Imbalanced Data [J].
He, Haibo ;
Garcia, Edwardo A. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2009, 21 (09) :1263-1284
[10]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778