Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation

被引:1
|
作者
Li, Yongkang [1 ,2 ]
Liang, Qifan [1 ,2 ]
Han, Zhen [1 ,2 ]
Mai, Wenjun [1 ,2 ]
Wang, Zhongyuan [1 ,2 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
[2] Wuhan Univ, Sch Comp Sci, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Face sketch-to-photo synthesis; image-to-image translation; global-local face fusion; MODEL;
D O I
10.1145/3672400
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face sketch-to-photo synthesis is widely used in law enforcement and digital entertainment, which can be achieved by Image-to-Image (I2I) translation. Traditional I2I translation algorithms usually regard the bidirectional translation of two image domains as two symmetric processes, so the two translation networks adopt the same structure. However, due to the scarcity of face sketches and the abundance of face photos, the sketch-to-photo and photo-to-sketch processes are asymmetric. Considering this issue, we propose a few-shot face sketch-to-photo synthesis model based on asymmetric I2I translation, where the sketch-to-photo process uses a feature-embedded generating network, while the photo-to-sketch process uses a style transfer network. On this basis, a three-stage asymmetric training strategy with style transfer as the trigger is proposed to optimize the proposed model by utilizing the advantage that the style transfer network only needs few-shot face sketches for training. Additionally, we discover that stylistic differences between the global and local sketch faces lead to inconsistencies between the global and local sketch-to-photo processes. Thus, a dual branch of the global face and local face is adopted in the sketch-to-photo synthesis model to learn the specific transformation processes for global structure and local details. Finally, the high-quality synthetic face photo can be generated through the global-local face fusion sub-network. Extensive experimental results demonstrate that the proposed Global-Local Asymmetric (GLAS) I2I translation algorithm compared to SOTA methods, at least improves FSIM by 0.0126, and reduces LPIPS (alex), LPIPS (squeeze), and LPIPS (vgg) by 0.0610, 0.0883, and 0.0719, respectively.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Few-shot Image Generation via Cross-domain Correspondence
    Ojha, Utkarsh
    Li, Yijun
    Lu, Jingwan
    Efros, Alexei A.
    Lee, Yong Jae
    Shechtman, Eli
    Zhang, Richard
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10738 - 10747
  • [42] WeditGAN: Few-Shot Image Generation via Latent Space Relocation
    Duan, Yuxuan
    Niu, Li
    Hong, Yan
    Zhang, Liqing
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1653 - 1661
  • [43] Few-Shot Image Generation via Style Adaptation and Content Preservation
    He, Xiaosheng
    Yang, Fan
    Liu, Fayao
    Lin, Guosheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [44] Few-Shot Object Detection of Remote Sensing Image via Calibration
    Li, Ruolei
    Zeng, Yilong
    Wu, Jianfeng
    Wang, Yongli
    Zhang, Xiaoli
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [45] Global- and local-aware feature augmentation with semantic orthogonality for few-shot image classification
    Shi, Boyao
    Li, Wenbin
    Huo, Jing
    Zhu, Pengfei
    Wang, Lei
    Gao, Yang
    PATTERN RECOGNITION, 2023, 142
  • [46] Global-Local Query-Support Cross-Attention for Few-Shot Semantic Segmentation
    Xie, Fengxi
    Liang, Guozhen
    Chien, Ying-Ren
    MATHEMATICS, 2024, 12 (18)
  • [47] GLGAT-CFSL: Global-Local Graph Attention Network-Based Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
    Ding, Chen
    Deng, Zhicong
    Xu, Yaoyang
    Zheng, Mengmeng
    Zhang, Lei
    Cao, Yu
    Wei, Wei
    Zhang, Yanning
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
  • [48] Task-Aware Few-Shot Image Generation via Dynamic Local Distribution Estimation and Sampling
    Gu, Zheng
    Li, Wenbin
    Ding, Tianyu
    Wang, Zhengli
    Huo, Jing
    Huang, Kuihua
    Gao, Yang
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 462 - 476
  • [49] KLSANet: Key local semantic alignment Network for few-shot image classification
    Sun, Zhe
    Zheng, Wang
    Guo, Pengfei
    NEURAL NETWORKS, 2024, 178
  • [50] Advanced Global Prototypical Segmentation Framework for Few-Shot Hyperspectral Image Classification
    Xia, Kunming
    Yuan, Guowu
    Xia, Mengen
    Li, Xiaosen
    Gui, Jinkang
    Zhou, Hao
    SENSORS, 2024, 24 (16)