Semantic Image Synthesis with Spatially-Adaptive Normalization

被引:1693
作者
Park, Taesung [1 ]
Liu, Ming-Yu [2 ]
Wang, Ting-Chun [2 ]
Zhu, Jun-Yan [2 ,3 ]
机构
[1] Univ Calif Berkeley, Berkeley, CA 94720 USA
[2] NVIDIA, Santa Clara, CA USA
[3] MIT CSAIL, Cambridge, MA USA
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to "wash away" semantic information. To address the issue, we propose using the input layout. for modulating the activations in normalization layers through a spatially-adaptive,learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and align-ment with input layouts. Finally, our model allows user control over both semantic and style as synthesizing images.
引用
收藏
页码:2332 / 2341
页数:10
相关论文
共 56 条
  • [1] [Anonymous], INT C MACH LEARN ICM
  • [2] [Anonymous], 2009, ACM SIGGRAPH
  • [3] [Anonymous], 2015, PROC INT C MACH LEAR
  • [4] [Anonymous], 2017, IEEE INT C COMP VIS
  • [5] [Anonymous], 2018, IEEE C COMP VIS PATT
  • [6] [Anonymous], INT C MACH LEARN ICM
  • [7] [Anonymous], 2019, P INT C LEARN REPR I
  • [8] [Anonymous], 2018, INT C MACH LEARN ICM
  • [9] [Anonymous], ACM SIGGRAPH
  • [10] [Anonymous], 2017, INT C MACH LEARN ICM