Dual Encoder-Decoder Based Generative Adversarial Networks for Disentangled Facial Representation Learning

被引:8
|
作者
Hu, Cong [1 ,2 ,3 ]
Feng, Zhenhua [4 ,5 ]
Wu, Xiaojun [1 ,2 ]
Kittler, Josef [5 ]
机构
[1] Jiangnan Univ, Sch Artificial Intelligence & Comp Sci, Wuxi 214122, Jiangsu, Peoples R China
[2] Jiangnan Univ, Jiangsu Prov Engn Lab Pattern Recognit & Computat, Wuxi 214122, Jiangsu, Peoples R China
[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350121, Peoples R China
[4] Univ Surrey, Dept Comp Sci, Guildford GU2 7XH, Surrey, England
[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会; 中国国家自然科学基金;
关键词
Face; Gallium nitride; Generative adversarial networks; Training; Generators; Face recognition; Task analysis; Disentangled representation learning; encoder-decoder; generative adversarial networks; face synthesis; pose invariant face recognition; FACE RECOGNITION;
D O I
10.1109/ACCESS.2020.3009512
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To learn disentangled representations of facial images, we present a Dual Encoder-Decoder based Generative Adversarial Network (DED-GAN). In the proposed method, both the generator and discriminator are designed with deep encoder-decoder architectures as their backbones. To be more specific, the encoder-decoder structured generator is used to learn a pose disentangled face representation, and the encoder-decoder structured discriminator is tasked to perform real/fake classification, face reconstruction, determining identity and estimating face pose. We further improve the proposed network architecture by minimizing the additional pixel-wise loss defined by the Wasserstein distance at the output of the discriminator so that the adversarial framework can be better trained. Additionally, we consider face pose variation to be continuous, rather than discrete in existing literature, to inject richer pose information into our model. The pose estimation task is formulated as a regression problem, which helps to disentangle identity information from pose variations. The proposed network is evaluated on the tasks of pose-invariant face recognition (PIFR) and face synthesis across poses. An extensive quantitative and qualitative evaluation carried out on several controlled and in-the-wild benchmarking datasets demonstrates the superiority of the proposed DED-GAN method over the state-of-the-art approaches.
引用
收藏
页码:130159 / 130171
页数:13
相关论文
共 50 条
  • [41] Frequency-Based Motion Representation for Video Generative Adversarial Networks
    Hyun, Sangeek
    Lew, Jaihyun
    Chung, Jiwoo
    Kim, Euiyeon
    Heo, Jae-Pil
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3949 - 3963
  • [42] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
    Chen, Chongqing
    Han, Dezhi
    Wang, Jun
    IEEE ACCESS, 2020, 8 : 35662 - 35671
  • [43] JigsawGAN: Auxiliary Learning for Solving Jigsaw Puzzles With Generative Adversarial Networks
    Li, Ru
    Liu, Shuaicheng
    Wang, Guangfu
    Liu, Guanghui
    Zeng, Bing
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 513 - 524
  • [44] Generative Adversarial Networks Based on Transformer Encoder and Convolution Block for Hyperspectral Image Classification
    Bai, Jing
    Lu, Jiawei
    Xiao, Zhu
    Chen, Zheng
    Jiao, Licheng
    REMOTE SENSING, 2022, 14 (14)
  • [45] Generative Adversarial Network and Auto Encoder based Anomaly Detection in Distributed IoT Networks
    Tian Zixu
    Liyanage, Kushan Sudheera Kalupahana
    Gurusamy, Mohan
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [46] CDE-GAN: Cooperative Dual Evolution-Based Generative Adversarial Network
    Chen, Shiming
    Wang, Wenjie
    Xia, Beihao
    You, Xinge
    Peng, Qinmu
    Cao, Zehong
    Ding, Weiping
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2021, 25 (05) : 986 - 1000
  • [47] Semi-Supervised Representation Learning for Remote Sensing Image Classification Based on Generative Adversarial Networks
    Yan, Peiyao
    He, Feng
    Yang, Yajie
    Hu, Fei
    IEEE ACCESS, 2020, 8 : 54135 - 54144
  • [48] Collaborative Learning of Generative Adversarial Networks
    Tsukahara, Takuya
    Hirakawa, Tsubasa
    Yamashita, Takayoshi
    Fujiyoshi, Hironobu
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP, 2021, : 492 - 499
  • [49] Learning compact graph representations via an encoder-decoder network
    John Boaz Lee
    Xiangnan Kong
    Applied Network Science, 4
  • [50] Encoder-Decoder Based Attractors for End-to-End Neural Diarization
    Horiguchi, Shota
    Fujita, Yusuke
    Watanabe, Shinji
    Xue, Yawen
    Garcia, Paola
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 1493 - 1507