iFlowGAN: An Invertible Flow-Based Generative Adversarial Network for Unsupervised Image-to-Image Translation

被引：17

作者：

Dai, Longquan ^{[1
]}

Tang, Jinhui ^{[1
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2022年 / 44卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Flow; bijection; unsupervised image-to-image translation; banach fixed point theorem;

D O I：

10.1109/TPAMI.2021.3062849

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose iFlowGAN that learns an invertible flow (a sequence of invertible mappings) via adversarial learning and exploit it to transform a source distribution into a target distribution for unsupervised image-to-image translation. Existing GAN-based generative model such as CycleGAN [1], StarGAN [2], AGGAN [3] and CyCADA [4] needs to learn a highly under-constraint forward mapping F : X -> Y from a source domain X to a target domain Y. Researchers do this by assuming there is a backward mapping B : Y -> X such that x and y are fixed points of the composite functions B omicron F and F omicron B. Inspired by zero-order reverse filtering [5], we (1) understand F via contraction mappings on a metric space; (2) provide a simple yet effective algorithm to present B via the parameters of F in light of Banach fixed point theorem; (3) provide a Lipschitz-regularized network which indicates a general approach to compose the inverse for arbitrary Lipschitz-regularized networks via Banach fixed point theorem. This network is useful for image-to-image translation tasks because it could save the memory for the weights of B. Although memory can also be saved by directly coupling the weights of the forward and backward mappings, the performance of the image-to-image translation network degrades significantly. This explains why current GAN-based generative models including CycleGAN must take different parameters to compose the forward and backward mappings instead of employing the same weights to build both mappings. Taking advantage of the Lipschitz-regularized network, we not only build iFlowGAN to solve the redundancy shortcoming of CycleGAN but also assemble the corresponding iFlowGAN versions of StarGAN, AGGAN and CyCADA without breaking their network architectures. Extensive experiments show that the iFlowGAN version could produce comparable results of the original implementation while saving half parameters.

引用

页码：4151 / 4162

页数：12

共 58 条

[41]

Perarnau G., 2016, ARXIV 161106355

[42]

Rezende DJ, 2015, PR MACH LEARN RES, V37, P1530

[43]

Rumelhart G. E., 1986, Parallel distributed processing: Explorations in the microstructure of cognition, V1, P318

[44] Fully Convolutional Networks for Semantic Segmentation [J].

Shelhamer, Evan ;

Long, Jonathan ;

Darrell, Trevor .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) :640-651

[45] Zero-order Reverse Filtering [J].

Tao, Xin ;

Zhou, Chao ;

Shen, Xiaoyong ;

Wang, Jue ;

Jia, Jiaya .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :222-230

[46]

Tsuzuku Y., 2018, ARXIV 180204034

[47]

Tylecek R, 2013, LECT NOTES COMPUT SC, V8142, P364, DOI 10.1007/978-3-642-40602-7_39

[48]

van den Berg R., 2018, ARXIV 180305649

[49] Reversible GANs for Memory-efficient Image-to-Image Translation [J].

van der Ouderaa, Tycho F. A. ;

Worrall, Daniel E. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4715-4723

[50]

Vondrick C, 2016, 30 C NEURAL INFORM P, V29

← 1 2 3 4 5 6 →