Few-Shot Face Sketch-to-Photo Synthesis via Global-Local Asymmetric Image-to-Image Translation

被引：1

作者：

Li, Yongkang ^{[1
,2
]}

Liang, Qifan ^{[1
,2
]}

Han, Zhen ^{[1
,2
]}

Mai, Wenjun ^{[1
,2
]}

Wang, Zhongyuan ^{[1
,2
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China

[2] Wuhan Univ, Sch Comp Sci, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

来源：

ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS | 2024年 / 20卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Face sketch-to-photo synthesis; image-to-image translation; global-local face fusion; MODEL;

D O I：

10.1145/3672400

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Face sketch-to-photo synthesis is widely used in law enforcement and digital entertainment, which can be achieved by Image-to-Image (I2I) translation. Traditional I2I translation algorithms usually regard the bidirectional translation of two image domains as two symmetric processes, so the two translation networks adopt the same structure. However, due to the scarcity of face sketches and the abundance of face photos, the sketch-to-photo and photo-to-sketch processes are asymmetric. Considering this issue, we propose a few-shot face sketch-to-photo synthesis model based on asymmetric I2I translation, where the sketch-to-photo process uses a feature-embedded generating network, while the photo-to-sketch process uses a style transfer network. On this basis, a three-stage asymmetric training strategy with style transfer as the trigger is proposed to optimize the proposed model by utilizing the advantage that the style transfer network only needs few-shot face sketches for training. Additionally, we discover that stylistic differences between the global and local sketch faces lead to inconsistencies between the global and local sketch-to-photo processes. Thus, a dual branch of the global face and local face is adopted in the sketch-to-photo synthesis model to learn the specific transformation processes for global structure and local details. Finally, the high-quality synthetic face photo can be generated through the global-local face fusion sub-network. Extensive experimental results demonstrate that the proposed Global-Local Asymmetric (GLAS) I2I translation algorithm compared to SOTA methods, at least improves FSIM by 0.0126, and reduces LPIPS (alex), LPIPS (squeeze), and LPIPS (vgg) by 0.0610, 0.0883, and 0.0719, respectively.

引用

页数：24

共 50 条

[41] Few-shot Image Generation via Cross-domain Correspondence
Ojha, Utkarsh
Li, Yijun
Lu, Jingwan
Efros, Alexei A.
Lee, Yong Jae
Shechtman, Eli
Zhang, Richard
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 10738 - 10747
[42] WeditGAN: Few-Shot Image Generation via Latent Space Relocation
Duan, Yuxuan
Niu, Li
Hong, Yan
Zhang, Liqing
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 2, 2024, : 1653 - 1661
[43] Few-Shot Image Generation via Style Adaptation and Content Preservation
He, Xiaosheng
Yang, Fan
Liu, Fayao
Lin, Guosheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[44] Few-Shot Object Detection of Remote Sensing Image via Calibration
Li, Ruolei
Zeng, Yilong
Wu, Jianfeng
Wang, Yongli
Zhang, Xiaoli
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[45] Global- and local-aware feature augmentation with semantic orthogonality for few-shot image classification
Shi, Boyao
Li, Wenbin
Huo, Jing
Zhu, Pengfei
Wang, Lei
Gao, Yang
PATTERN RECOGNITION, 2023, 142
[46] Global-Local Query-Support Cross-Attention for Few-Shot Semantic Segmentation
Xie, Fengxi
Liang, Guozhen
Chien, Ying-Ren
MATHEMATICS, 2024, 12 (18)
[47] GLGAT-CFSL: Global-Local Graph Attention Network-Based Cross-Domain Few-Shot Learning for Hyperspectral Image Classification
Ding, Chen
Deng, Zhicong
Xu, Yaoyang
Zheng, Mengmeng
Zhang, Lei
Cao, Yu
Wei, Wei
Zhang, Yanning
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 1
[48] Task-Aware Few-Shot Image Generation via Dynamic Local Distribution Estimation and Sampling
Gu, Zheng
Li, Wenbin
Ding, Tianyu
Wang, Zhengli
Huo, Jing
Huang, Kuihua
Gao, Yang
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT II, 2025, 15032 : 462 - 476
[49] KLSANet: Key local semantic alignment Network for few-shot image classification
Sun, Zhe
Zheng, Wang
Guo, Pengfei
NEURAL NETWORKS, 2024, 178
[50] Advanced Global Prototypical Segmentation Framework for Few-Shot Hyperspectral Image Classification
Xia, Kunming
Yuan, Guowu
Xia, Mengen
Li, Xiaosen
Gui, Jinkang
Zhou, Hao
SENSORS, 2024, 24 (16)

← 1 2 3 4 5 →