Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning

被引：5

作者：

Qi, Xingqun ^{[1
,2
,3
]}

Sun, Muyi ^{[4
,5
]}

Wang, Zijian ^{[6
]}

Liu, Jiaming ^{[7
]}

Li, Qi ^{[4
]}

Zhao, Fang ^{[8
]}

Zhang, Shanghang ^{[7
]}

Shan, Caifeng ^{[8
,9
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China

[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China

[3] Hong Kong Univ Sci & Technol, Acad Interdisciplinary Studies, Hong Kong, Peoples R China

[4] Chinese Acad Sci, Inst Automat, NLPR, CRIPAC, Beijing 100190, Peoples R China

[5] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China

[6] Univ Sydney, Sch Comp Sci, Sydney, NSW 2008, Australia

[7] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing 100871, Peoples R China

[8] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing 210023, Peoples R China

[9] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年

关键词：

Face photo-sketch synthesis; generative adversarial network; graph representation learning; intraclass and interclass; iterative cycle training (ICT);

D O I：

10.1109/TNNLS.2023.3341246

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Biphasic face photo-sketch synthesis has significant practical value in wide-ranging fields such as digital entertainment and law enforcement. Previous approaches directly generate the photo-sketch in a global view, they always suffer from the low quality of sketches and complex photograph variations, leading to unnatural and low-fidelity results. In this article, we propose a novel semantic-driven generative adversarial network to address the above issues, cooperating with graph representation learning. Considering that human faces have distinct spatial structures, we first inject class-wise semantic layouts into the generator to provide style-based spatial information for synthesized face photographs and sketches. In addition, to enhance the authenticity of details in generated faces, we construct two types of representational graphs via semantic parsing maps upon input faces, dubbed the intraclass semantic graph (IASG) and the interclass structure graph (IRSG). Specifically, the IASG effectively models the intraclass semantic correlations of each facial semantic component, thus producing realistic facial details. To preserve the generated faces being more structure-coordinated, the IRSG models interclass structural relations among every facial component by graph representation learning. To further enhance the perceptual quality of synthesized images, we present a biphasic interactive cycle training strategy by fully taking advantage of the multilevel feature consistency between the photograph and sketch. Extensive experiments demonstrate that our method outperforms the state-of-the-art competitors on the CUHK Face Sketch (CUFS) and CUHK Face Sketch FERET (CUFSF) datasets.

引用

页码：1 / 14

页数：14

共 42 条

[31] Face Sketch-Photo Synthesis Method Based on Multi-residual Dynamic Fusion Generative Adversarial Networks
Sun R.
Sun Q.
Shan X.
Zhang X.
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (03): : 207 - 222
[32] Cross-domain representation learning by domain-migration generative adversarial network for sketch based image retrieval
Bai, Cong
Chen, Jian
Ma, Qing
Hao, Pengyi
Chen, Shengyong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71 (71)
[33] AEGAN: Generating imperceptible face synthesis via autoencoder-based generative adversarial network
Che, Aolin
Yang, Jing-Hua
Guo, Cai
Dai, Hong-Ning
Xie, Haoran
Li, Ping
COMPUTER ANIMATION AND VIRTUAL WORLDS, 2023, 34 (3-4)
[34] Text to photo-realistic image synthesis via chained deep recurrent generative adversarial network
Wang, Min
Lang, Congyan
Feng, Songhe
Wang, Tao
Jin, Yi
Li, Yidong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
[35] Cascade heterogeneous face sketch-photo synthesis via dual-scale Markov Network
Yao, Saisai
Chen, Zhenxue
Jia, Yunyi
Liu, Chengyun
JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2018, 30 (02) : 217 - 233
[36] TAXOGAN: Hierarchical Network Representation Learning via Taxonomy Guided Generative Adversarial Networks (Extended Abstract)
Yang, Carl
Zhang, Jieyu
Han, Jiawei
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 4859 - 4863
[37] De-confounding representation learning for counterfactual inference on continuous treatment via generative adversarial network
Zhao, Yonghe
Huang, Qiang
Zeng, Haolong
Peng, Yun
Sun, Huiyan
DATA MINING AND KNOWLEDGE DISCOVERY, 2024, 38 (06) : 3783 - 3804
[38] BSD-GAN: Branched Generative Adversarial Network for Scale-Disentangled Representation Learning and Image Synthesis
Yi, Zili
Chen, Zhiqin
Cai, Hao
Mao, Wendong
Gong, Minglun
Zhang, Hao
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 9073 - 9083
[39] Emotion-Preserving Representation Learning via Generative Adversarial Network for Multi-view Facial Expression Recognition
Lai, Ying-Hsiu
Lai, Shang-Hong
PROCEEDINGS 2018 13TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2018), 2018, : 263 - 270
[40] Correlation via Synthesis: End-to-end Image Generation and Radiogenomic Learning Based on Generative Adversarial Network
Xu, Ziyue
Wang, Xiaosong
Shin, Hoo-Chang
Yang, Dong
Roth, Holger
Milletari, Fausto
Zhang, Ling
Xu, Daguang
MEDICAL IMAGING WITH DEEP LEARNING, VOL 121, 2020, 121 : 857 - 866

← 1 2 3 4 5 →