Image generation with self pixel-wise normalization

被引:3
作者
Yeo, Yoon-Jae [1 ]
Sagong, Min-Cheol [1 ]
Park, Seung [2 ]
Ko, Sung-Jea [1 ]
Shin, Yong-Goo [3 ]
机构
[1] Korea Univ, Sch Elect Engn Dept, Seoul 136713, South Korea
[2] Chungbuk Natl Univ Hosp, Biomed Engn, 776 Seowon Gu, Cheongju, Chungcheongbuk, South Korea
[3] Hannam Univ, Dept Artificial Intelligence, Daejeon 34430, South Korea
基金
新加坡国家研究基金会;
关键词
Generative adversarial networks; Image generation; Normalization; Region-adaptive normalization;
D O I
10.1007/s10489-022-04007-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Region-adaptive normalization (RAN) methods have been widely used in the generative adversarial network (GAN)-based image-to-image translation technique. However, since these approaches need a mask image to infer pixel-wise affine transformation parameters, they are not applicable to general image generation models having no paired mask images. To resolve this problem, this paper presents a novel normalization method, called self pixel-wise normalization (SPN), which effectively boosts the generative performance by carrying out the pixel-adaptive affine transformation without an external guidance map. In our method, the transforming parameters are derived from a self-latent mask that divides the feature map into foreground and background regions. The visualization of the self-latent masks shows that SPN effectively captures a single object to be generated as the foreground. Since the proposed method produces the self-latent mask without external data, it is easily adaptable to existing generative models. Extensive experiments on various datasets reveal that our SPN significantly improves the performance of image generation technique in terms of Frechet inception distance (FID) and Inception score (IS).
引用
收藏
页码:9409 / 9423
页数:15
相关论文
共 49 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] Arjovsky M, 2017, PR MACH LEARN RES, V70
  • [3] Ba J, 2016, ARXIV160706450
  • [4] Brock Andrew, 2018, P INT C LEARN REPR
  • [5] StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation
    Choi, Yunjey
    Choi, Minje
    Kim, Munyoung
    Ha, Jung-Woo
    Kim, Sunghun
    Choo, Jaegul
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 8789 - 8797
  • [6] Chollet F, 2015, KERAS
  • [7] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [8] Dumoulin V., 2017, INT C LEARN REPR
  • [9] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [10] Hensel M, 2017, ADV NEUR IN, V30