Dual Encoder-Decoder Based Generative Adversarial Networks for Disentangled Facial Representation Learning

被引：8

作者：

Hu, Cong ^{[1
,2
,3
]}

Feng, Zhenhua ^{[4
,5
]}

Wu, Xiaojun ^{[1
,2
]}

Kittler, Josef ^{[5
]}

机构：

[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China

[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China

[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China

[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England

[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

英国工程与自然科学研究理事会; 中国国家自然科学基金;

关键词：

Face; Gallium nitride; Generative adversarial networks; Training; Generators; Face recognition; Task analysis; Disentangled representation learning; encoder-decoder; generative adversarial networks; face synthesis; pose invariant face recognition; FACE RECOGNITION;

D O I：

10.1109/ACCESS.2020.3009512

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimizing the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to inject richer pose information into our model. The pose estimation task is formulated as a regression problem, which helps to disentangle identity information from pose variations. The proposed network is evaluated on the tasks of pose-invariant face recognition (PIFR) and face synthesis across poses. An extensive quantitative and qualitative evaluation carried out on several controlled and in-the-wild benchmarking datasets demonstrates the superiority of the proposed DED-GAN method over the state-of-the-art approaches.

引用

页码：130159 / 130171

页数：13

共 50 条

[41] Frequency-Based Motion Representation for Video Generative Adversarial Networks
Hyun, Sangeek
Lew, Jaihyun
Chung, Jiwoo
Kim, Euiyeon
Heo, Jae-Pil
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3949 - 3963
[42] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
Chen, Chongqing
Han, Dezhi
Wang, Jun
IEEE ACCESS, 2020, 8 : 35662 - 35671
[43] JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks
Li, Ru
Liu, Shuaicheng
Wang, Guangfu
Liu, Guanghui
Zeng, Bing
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 513 - 524
[44] Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification
Bai, Jing
Lu, Jiawei
Xiao, Zhu
Chen, Zheng
Jiao, Licheng
REMOTE SENSING, 2022, 14 (14)
[45] Generative Adversarial Network and Auto Encoder based Anomaly Detection in Distributed IoT Networks
Tian Zixu
Liyanage, Kushan Sudheera Kalupahana
Gurusamy, Mohan
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[46] CDE-GAN: Cooperative Dual Evolution-Based Generative Adversarial Network
Chen, Shiming
Wang, Wenjie
Xia, Beihao
You, Xinge
Peng, Qinmu
Cao, Zehong
Ding, Weiping
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (05) : 986 - 1000
[47] Semi-Supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks
Yan, Peiyao
He, Feng
Yang, Yajie
Hu, Fei
IEEE ACCESS, 2020, 8 : 54135 - 54144
[48] Collaborative Learning of Generative Adversarial Networks
Tsukahara, Takuya
Hirakawa, Tsubasa
Yamashita, Takayoshi
Fujiyoshi, Hironobu
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 492 - 499
[49] Learning compact graph representations via an encoder-decoder network
John Boaz Lee
Xiangnan Kong
Applied Network Science, 4
[50] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
Horiguchi, Shota
Fujita, Yusuke
Watanabe, Shinji
Xue, Yawen
Garcia, Paola
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507

← 1 2 3 4 5 →