Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning

被引:5
|
作者
Qi, Xingqun [1 ,2 ,3 ]
Sun, Muyi [4 ,5 ]
Wang, Zijian [6 ]
Liu, Jiaming [7 ]
Li, Qi [4 ]
Zhao, Fang [8 ]
Zhang, Shanghang [7 ]
Shan, Caifeng [8 ,9 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[3] Hong Kong Univ Sci & Technol, Acad Interdisciplinary Studies, Hong Kong, Peoples R China
[4] Chinese Acad Sci, Inst Automat, NLPR, CRIPAC, Beijing 100190, Peoples R China
[5] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
[6] Univ Sydney, Sch Comp Sci, Sydney, NSW 2008, Australia
[7] Peking Univ, Sch Comp Sci, Natl Key Lab Multimedia Informat Proc, Beijing 100871, Peoples R China
[8] Nanjing Univ, Sch Intelligence Sci & Technol, Nanjing 210023, Peoples R China
[9] Shandong Univ Sci & Technol, Coll Elect Engn & Automat, Qingdao 266590, Peoples R China
关键词
Face photo-sketch synthesis; generative adversarial network; graph representation learning; intraclass and interclass; iterative cycle training (ICT);
D O I
10.1109/TNNLS.2023.3341246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Biphasic face photo-sketch synthesis has significant practical value in wide-ranging fields such as digital entertainment and law enforcement. Previous approaches directly generate the photo-sketch in a global view, they always suffer from the low quality of sketches and complex photograph variations, leading to unnatural and low-fidelity results. In this article, we propose a novel semantic-driven generative adversarial network to address the above issues, cooperating with graph representation learning. Considering that human faces have distinct spatial structures, we first inject class-wise semantic layouts into the generator to provide style-based spatial information for synthesized face photographs and sketches. In addition, to enhance the authenticity of details in generated faces, we construct two types of representational graphs via semantic parsing maps upon input faces, dubbed the intraclass semantic graph (IASG) and the interclass structure graph (IRSG). Specifically, the IASG effectively models the intraclass semantic correlations of each facial semantic component, thus producing realistic facial details. To preserve the generated faces being more structure-coordinated, the IRSG models interclass structural relations among every facial component by graph representation learning. To further enhance the perceptual quality of synthesized images, we present a biphasic interactive cycle training strategy by fully taking advantage of the multilevel feature consistency between the photograph and sketch. Extensive experiments demonstrate that our method outperforms the state-of-the-art competitors on the CUHK Face Sketch (CUFS) and CUHK Face Sketch FERET (CUFSF) datasets.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 42 条
  • [1] Face Sketch Synthesis via Semantic-Driven Generative Adversarial Network
    Qi, Xingqun
    Sun, Muyi
    Wang, Weining
    Dong, Xiaoxiao
    Li, Qi
    Shan, Caifeng
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [2] Feature Encoder Guided Generative Adversarial Network for Face Photo-Sketch Synthesis
    Zheng, Jieying
    Song, Wanru
    Wu, Yahong
    Xu, Ran
    Liu, Feng
    IEEE ACCESS, 2019, 7 : 154971 - 154985
  • [3] IsGAN: Identity-sensitive generative adversarial network for face photo-sketch synthesis
    Yan, Lan
    Zheng, Wenbo
    Gou, Chao
    Wang, Fei-Yue
    PATTERN RECOGNITION, 2021, 119
  • [4] A Sketch-Transformer Network for Face Photo-Sketch Synthesis
    Zhu, Mingrui
    Liang, Changcheng
    Wang, Nannan
    Wang, Xiaoyu
    Li, Zhifeng
    Gao, Xinbo
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 1352 - 1358
  • [5] FACE PHOTO SYNTHESIS VIA INTERMEDIATE SEMANTIC ENHANCEMENT GENERATIVE ADVERSARIAL NETWORK
    Li, Haoxian
    Zheng, Jieying
    Liu, Feng
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 96 - 100
  • [6] Face Photo-Sketch Synthesis via Knowledge Transfer
    Zhu, Mingrui
    Wang, Nannan
    Gao, Xinbo
    Li, Jie
    Li, Zhifeng
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 1048 - 1054
  • [7] Dual Conditional Normalization Pyramid Network for Face Photo-Sketch Synthesis
    Zhu, Mingrui
    Wu, Zicheng
    Wang, Nannan
    Yang, Heng
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 5200 - 5211
  • [8] Face Photo-Sketch Recognition Using Bidirectional Collaborative Synthesis Network
    Bae, Seho
    Din, Nizam Ud
    Park, Hyunkyu
    Yi, Juneho
    PROCEEDINGS OF THE 2022 16TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2022), 2022,
  • [9] Face photo-sketch synthesis via intra-domain enhancement
    Peng, Chunlei
    Zhang, Congyu
    Liu, Decheng
    Wang, Nannan
    Gao, Xinbo
    KNOWLEDGE-BASED SYSTEMS, 2023, 259
  • [10] Smoothness-Constrained Face Photo-Sketch Synthesis Using Sparse Representation
    Chang, Liang
    Deng, Xiaoming
    Zhou, Mingquan
    Duan, Fuqing
    Wu, Zhongke
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 3025 - 3029