WGAN-Based Robust Occluded Facial Expression Recognition

被引：36

作者：

Lu, Yang ^{[1
]}

Wang, Shigang ^{[1
]}

Zhao, Wenting ^{[1
]}

Zhao, Yan ^{[1
]}

机构：

[1] Jilin Univ, Coll Commun Engn, Changchun 130012, Jilin, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

中国国家自然科学基金;

关键词：

Facial expression recognition; partial occlusion; image complementation; Wasserstein generative adversarial network; SPARSE REPRESENTATION; GAUSSIAN-PROCESSES; FACE DETECTION; FEATURES; CONTEXT; VECTOR; SYSTEM; GABOR; LBP;

D O I：

10.1109/ACCESS.2019.2928125

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Research on facial expression recognition (FER) technology can promote the development of theoretical and practical applications for our daily life. Currently, most of the related works on this technology are focused on un-occluded FER. However, in real life, facial expression images often have partial occlusion; therefore, the accurate recognition of occluded facial expression images is a topic that should be explored. In this paper, we proposed a novel Wasserstein generative adversarial network-based method to perform occluded FER. After complementing the face occlusion image with complex facial expression information, the recognition is achieved by learning the facial expression features of the images. This method consists of a generator G and two discriminators D-1 and D-2. The generator naturally complements occlusion in the expression image under the triple constraints of weighted reconstruction loss l(wr), triplet loss l(t), and adversarial loss l(a). We optimize the discriminator D1 to distinguish between real and fake by constructing an adversarial loss l(a) between the generated complementing images, original un-occluded images, and small-scale-occluded images based on the Wasserstein distance. Finally, the FER is completed by introducing classification loss l(c) into D-2. To verify the effectiveness of the proposed method, an experimental analysis was performed on the AffectNet and RAF-DB datasets. The visual occlusion complementing results, comparison of recognition rates of facial expression images with and without de-occlusion processing, and T-distributed stochastic neighbor embedding visual analysis of facial expression features all prove the effectiveness of the proposed method. The experimental results show that the proposed method is better than the existing state-of-the-art methods.

引用

页码：93594 / 93610

页数：17

共 73 条

[1] Facial expression recognition and synthesis based on an appearance model
Abboud, B
Davoine, F
Dang, M
[J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2004, 19 (08) : 723 - 740
[2] [Anonymous], 2016, INT C LEARNING REPRE
[3] [Anonymous], 14091556 ARXIV
[4] [Anonymous], SIGNAL PROCESS
[5] [Anonymous], ACM T GRAPH
[6] Arjovsky M., 2017, ARXIV170107875
[7] CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training
Bao, Jianmin
Chen, Dong
Wen, Fang
Li, Houqiang
Hua, Gang
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2764 - 2773
[8] Detecting Driver Drowsiness A survey of system designs and technology
Chacon-Murguia, Mario I.
Prieto-Resendiz, Claudia
[J]. IEEE CONSUMER ELECTRONICS MAGAZINE, 2015, 4 (04) : 107 - 119
[9] Chen YJ, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SAFETY FOR ROBOTICS (ISR), P319, DOI 10.1109/IISR.2018.8535644
[10] Child's Perception of Robot's Emotions: Effects of Platform, Context and Experience
Cohen, I.
Looije, R.
Neerincx, M. A.
[J]. INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2014, 6 (04) : 507 - 518

← 1 2 3 4 5 6 7 8 →