Unbalanced Feature Transport for Exemplar-based Image Translation

被引:40
作者
Zhan, Fangneng [1 ,2 ]
Yu, Yingchen [1 ,2 ]
Cui, Kaiwen [1 ]
Zhang, Gongjie [1 ]
Lu, Shijian [1 ]
Pan, Jianxiong [2 ]
Zhang, Changgong [2 ]
Ma, Feiying [2 ]
Xie, Xuansong [2 ]
Miao, Chunyan [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.01478
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the great success of GANs in images translation with different conditioned inputs such as semantic segmentation and edge maps, generating high-fidelity realistic images with reference styles remains a grand challenge in conditional image-to-image translation. This paper presents a general image translation framework that incorporates optimal transport for feature alignment between conditional inputs and style exemplars in image translation. The introduction of optimal transport mitigates the constraint of many-to-one feature matching significantly while building up accurate semantic correspondences between conditional inputs and exemplars. We design a novel unbalanced optimal transport to address the transport between features with deviational distributions which exists widely between conditional inputs and exemplars. In addition, we design a semantic-activation normalization scheme that injects style features of exemplars into the image translation process successfully. Extensive experiments over multiple image translation tasks show that our method achieves superior image translation qualitatively and quantitatively as compared with the state-of-the-art.
引用
收藏
页码:15023 / 15033
页数:11
相关论文
共 64 条
[1]  
[Anonymous], 2018, ARXIV180511145
[2]  
[Anonymous], 2019, PMLR
[3]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[4]  
[Anonymous], 2017, ADV NEURAL INFORM PR, DOI DOI 10.1109/ICSGEA.2017.74
[5]   COCO-Stuff: Thing and Stuff Classes in Context [J].
Caesar, Holger ;
Uijlings, Jasper ;
Ferrari, Vittorio .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1209-1218
[6]  
Chizat Lenaic, 2016, MATH COMPUTATION, V87
[7]   StarGAN v2: Diverse Image Synthesis for Multiple Domains [J].
Choi, Yunjey ;
Uh, Youngjung ;
Yoo, Jaejun ;
Ha, Jung-Woo .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8185-8194
[8]   Optimal Transport for Domain Adaptation [J].
Courty, Nicolas ;
Flamary, Remi ;
Tuia, Devis ;
Rakotomamonjy, Alain .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (09) :1853-1865
[9]  
Cuturi M., 2013, ADV NEURAL INF PROCE, V26
[10]   Defect-GAN: High-Fidelity Defect Synthesis for Automated Defect Inspection [J].
Zhang, Gongjie ;
Cui, Kaiwen ;
Hung, Tzu-Yi ;
Lu, Shijian .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2523-2533