Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning

被引:5
|
作者
Qi, Xingqun [1 ,2 ,3 ]
Sun, Muyi [4 ,5 ]
Wang, Zijian [6 ]
Liu, Jiaming [7 ]
Li, Qi [4 ]
Zhao, Fang [8 ]
Zhang, Shanghang [7 ]
Shan, Caifeng [8 ,9 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[3] Hong Kong Univ Sci & Technol, Acad Interdisciplinary Studies, Hong Kong, Peoples R China
[4] Chinese Acad Sci, Inst Automat, NLPR, CRIPAC, Beijing 100190, Peoples R China
[5] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[6] Univ Sydney, Sch Comp Sci, Sydney, NSW 2008, Australia
[7] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing 100871, Peoples R China
[8] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing 210023, Peoples R China
[9] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
关键词
Face photo-sketch synthesis; generative adversarial network; graph representation learning; intraclass and interclass; iterative cycle training (ICT);
D O I
10.1109/TNNLS.2023.3341246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biphasic face photo-sketch synthesis has significant practical value in wide-ranging fields such as digital entertainment and law enforcement. Previous approaches directly generate the photo-sketch in a global view, they always suffer from the low quality of sketches and complex photograph variations, leading to unnatural and low-fidelity results. In this article, we propose a novel semantic-driven generative adversarial network to address the above issues, cooperating with graph representation learning. Considering that human faces have distinct spatial structures, we first inject class-wise semantic layouts into the generator to provide style-based spatial information for synthesized face photographs and sketches. In addition, to enhance the authenticity of details in generated faces, we construct two types of representational graphs via semantic parsing maps upon input faces, dubbed the intraclass semantic graph (IASG) and the interclass structure graph (IRSG). Specifically, the IASG effectively models the intraclass semantic correlations of each facial semantic component, thus producing realistic facial details. To preserve the generated faces being more structure-coordinated, the IRSG models interclass structural relations among every facial component by graph representation learning. To further enhance the perceptual quality of synthesized images, we present a biphasic interactive cycle training strategy by fully taking advantage of the multilevel feature consistency between the photograph and sketch. Extensive experiments demonstrate that our method outperforms the state-of-the-art competitors on the CUHK Face Sketch (CUFS) and CUHK Face Sketch FERET (CUFSF) datasets.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 42 条
  • [31] Face Sketch-Photo Synthesis Method Based on Multi-residual Dynamic Fusion Generative Adversarial Networks
    Sun R.
    Sun Q.
    Shan X.
    Zhang X.
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 207 - 222
  • [32] Cross-domain representation learning by domain-migration generative adversarial network for sketch based image retrieval
    Bai, Cong
    Chen, Jian
    Ma, Qing
    Hao, Pengyi
    Chen, Shengyong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71 (71)
  • [33] AEGAN: Generating imperceptible face synthesis via autoencoder-based generative adversarial network
    Che, Aolin
    Yang, Jing-Hua
    Guo, Cai
    Dai, Hong-Ning
    Xie, Haoran
    Li, Ping
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
  • [34] Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network
    Wang, Min
    Lang, Congyan
    Feng, Songhe
    Wang, Tao
    Jin, Yi
    Li, Yidong
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [35] Cascade heterogeneous face sketch-photo synthesis via dual-scale Markov Network
    Yao, Saisai
    Chen, Zhenxue
    Jia, Yunyi
    Liu, Chengyun
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2018, 30 (02) : 217 - 233
  • [36] TAXOGAN: Hierarchical Network Representation Learning via Taxonomy Guided Generative Adversarial Networks (Extended Abstract)
    Yang, Carl
    Zhang, Jieyu
    Han, Jiawei
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4859 - 4863
  • [37] De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network
    Zhao, Yonghe
    Huang, Qiang
    Zeng, Haolong
    Peng, Yun
    Sun, Huiyan
    DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 3783 - 3804
  • [38] BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis
    Yi, Zili
    Chen, Zhiqin
    Cai, Hao
    Mao, Wendong
    Gong, Minglun
    Zhang, Hao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9073 - 9083
  • [39] Emotion-Preserving Representation Learning via Generative Adversarial Network for Multi-view Facial Expression Recognition
    Lai, Ying-Hsiu
    Lai, Shang-Hong
    PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 263 - 270
  • [40] Correlation via Synthesis: End-to-end Image Generation and Radiogenomic Learning Based on Generative Adversarial Network
    Xu, Ziyue
    Wang, Xiaosong
    Shin, Hoo-Chang
    Yang, Dong
    Roth, Holger
    Milletari, Fausto
    Zhang, Ling
    Xu, Daguang
    MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 857 - 866