Handling Ambiguous Annotations for Facial Expression Recognition in the Wild

被引:0
作者
Gera, Darshan [1 ]
Vikas, G. N. [2 ]
Balasubramanian, S. [2 ]
机构
[1] SSSIHL, Brindavan Campus, Bengaluru, Karnataka, India
[2] SSSIHL, Prasanthi Nilayam Campus Anantpur, Anantapur, Andhra Pradesh, India
来源
PROCEEDINGS OF THE TWELFTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING, ICVGIP 2021 | 2021年
关键词
Ambiguous annotations; Consistency; Strong augmentation; Weak-augmentation; Facial Expression Recognition; DEEP;
D O I
10.1145/3490035.3490289
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Annotation ambiguity due to subjectivity of annotators, crowd-sourcing, inter-class similarity and poor quality of facial expression images has been a key challenge towards robust Facial Expression Recognition (FER). Recent deep learning (DL) solutions for this problem select clean samples for training by using two or more networks simultaneously. Based on the observation that wrongly annotated samples have inconsistent predictions compared to clean samples when transformed using different augmentations, we propose a simple and effective single network FER framework robust to noisy annotations. Specifically, we qualify an image to be clean (correctly labeled) if the Jenson-Shannon (JS) divergence between its ground truth distribution and the predicted distribution for its weak augmented version is smaller than a threshold. The threshold is dynamically tuned. The qualified clean samples facilitate supervision during training. Further, to learn hard samples (correctly labeled but difficult to classify), we enforce consistency between the predicted distributions of weak and strong augmented versions of every training image through a consistency loss. Comprehensive experiments on FER datasets like RAFDB, FERPlus, curated FEC and AffectNet in the presence of both synthetic and real noisy annotation settings demonstrate the robustness of the proposed method. The source codes are publicly available at https://github.com/1980x/HandlingAmbigiousFERAnnotations.
引用
收藏
页数:9
相关论文
共 48 条
[11]   Local Learning With Deep and Handcrafted Features for Facial Expression Recognition [J].
Georgescu, Mariana-Iuliana ;
Ionescu, Radu Tudor ;
Popescu, Marius .
IEEE ACCESS, 2019, 7 :64827-64836
[12]   Landmark guidance independent spatio-channel attention and complementary context information based facial expression recognition [J].
Gera, Darshan ;
Balasubramanian, S. .
PATTERN RECOGNITION LETTERS, 2021, 145 :58-66
[13]  
Gera Darshan, 2021, International Journal of Engineering Trends and Technology, V69, P244, DOI DOI 10.14445/22315381/IJETT-V69I7P231
[14]  
Ghosh A, 2017, AAAI CONF ARTIF INTE, P1919
[15]  
Gold JR, 2017, PLAN HIST ENVIRON SE, P1
[16]   MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [J].
Guo, Yandong ;
Zhang, Lei ;
Hu, Yuxiao ;
He, Xiaodong ;
Gao, Jianfeng .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :87-102
[17]   Co-teaching: Robust Training of Deep Neural Networks with Extremely Noisy Labels [J].
Han, Bo ;
Yao, Quanming ;
Yu, Xingrui ;
Niu, Gang ;
Xu, Miao ;
Hu, Weihua ;
Tsang, Ivor W. ;
Sugiyama, Masashi .
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[18]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[19]   HERO: Human Emotions Recognition for Realizing Intelligent Internet of Things [J].
Hua, Wentao ;
Dai, Fei ;
Huang, Liya ;
Xiong, Jian ;
Gui, Guan .
IEEE ACCESS, 2019, 7 :24321-24332
[20]   Facial Expression Recognition with Inconsistently Annotated Datasets [J].
Zeng, Jiabei ;
Shan, Shiguang ;
Chen, Xilin .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :227-243