Adversarial Generation of Continuous Images

被引：72

作者：

Skorokhodov, Ivan ^{[1
]}

Ignatyev, Savva ^{[2
]}

Elhoseiny, Mohamed ^{[1
]}

机构：

[1] King Abdullah Univ Sci & Technol KAUST, Thuwal, Saudi Arabia

[2] Skolkovo Inst Sci & Technol, Moscow, Russia

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.01061

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In most existing learning systems, images are typically viewed as 2D pixel arrays. However, in another paradigm gaining popularity, a 2D image is represented as an implicit neural representation (INR) - an MLP that predicts an RGB pixel value given its (x, y) coordinate. In this paper, we propose two novel architectural techniques for building INR-based image decoders: factorized multiplicative modulation and multi-scale INRs, and use them to build a state-of-the-art continuous image GAN. Previous attempts to adapt INRs for image generation were limited to MNIST-like datasets and do not scale to complex real-world data. Our proposed INR-GAN architecture improves the performance of continuous image generators by several times, greatly reducing the gap between continuous image GANs and pixel-based ones. Apart from that, we explore several exciting properties of the INR-based decoders, like out-of-the-box superresolution, meaningful image-space interpolation, accelerated inference of low-resolution images, an ability to extrapolate outside of image boundaries, and strong geometric prior. The project page is located at https://universome.github.io/inr-gan.

引用

页码：10748 / 10759

页数：12

共 93 条

[81]

Tancik M, 2020, P INT C NEUR INF PRO, DOI DOI 10.48550/ARXIV.2006.10739

[82]

Teerapittayanon S, 2016, INT C PATT RECOG, P2464, DOI 10.1109/ICPR.2016.7900006

[83]

Ukai K., 2018, P MACHINE LEARNING R, P176

[84]

Vaswani A, 2017, ADV NEUR IN, V30

[85]

von Oswald J., 2020, ICLR

[86]

Wang X., 2020, SOLOV2 DYNAMIC FASTE

[87]

Wang Z., 2018, ARXIV180707044

[88]

Watters N., 2019, ARXIV190107017

[89]

Yu Fisher, 2016, LSUN CONSTRUCTION LA

[90] On Compressing Deep Models by Low Rank and Sparse Decomposition [J].

Yu, Xiyu ;

Liu, Tongliang ;

Wang, Xinchao ;

Tao, Dacheng .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :67-76

← 1 2 3 4 5 6 7 8 9 10 →