SA-SinGAN: self-attention for single-image generation adversarial networks

被引:0
作者
Xi Chen
Hongdong Zhao
Dongxu Yang
Yueyuan Li
Qing Kang
Haiyan Lu
机构
[1] Hebei University of Technology,School of Electronic and Information Engineering
[2] Northwest Normal University,School of Physics and Electronic Engineering
来源
Machine Vision and Applications | 2021年 / 32卷
关键词
Single image; GAN; Self-attention; Spectral normalization; Application;
D O I
暂无
中图分类号
学科分类号
摘要
Single-image training is a research hotspot task of generating adversarial networks, especially in tasks such as image editing and image coordination. However, the existing network has a series of problems such as a long training time, poor image quality, and an unstable training model. Based on the research hot issues, we propose a single-image generation adversarial network of the self-attention mechanism and discuss the changes of the model when the self-attention mechanism is placed in different positions of the generator. We introduced the spectral normalization in the generator and discriminator networks to stabilize the training process and compared the influence of the learning rate on the network. We used artificial vision and model evaluation methods to test the performance of the model on three representative datasets and compared with the current more advanced models. Experiments show that our proposed model has better performance than single-sample generative adversarial networks, reducing Single Image Fréchet Inception Distance (SIFID) from 4.80 to 2.057 on the challenging Generation datasets, reducing SIFID from 0.06 to 0.02 on the Places datasets, and reducing SIFID from 0.23 to 0.04 on the LSUN datasets. The training time of our model is one-ninth of the single-sample generation adversarial network, which can obtain the overall structure of the single training sample, which has great research significance.
引用
收藏
相关论文
共 107 条
  • [1] Goodfellow I(2020)Generative adversarial networks Commun. ACM. 63 139-144
  • [2] Pouget-Abadie J(2021)Fuzzy fault detection for Markov jump systems with partly accessible hidden information: an event-triggered approach IEEE Trans. Cybernet. 103 1733-1755
  • [3] Mirza M(2021)Input-to-state stability of impulsive reaction–diffusion neural networks with infinite distributed delays Nonlinear Dyn. 32 671-692
  • [4] Xu B(2021)Robust PD-type iterative learning control for discrete systems with multiple time-delays subjected to polytopic uncertainty and restricted frequency-domain Multidim. Syst. Sign Process. 32 17587-17600
  • [5] Warde-Farley D(2019)InGAN: capturing and retargeting the “DNA” of a natural image IEEE Comput. Soc. 31 2126-2140
  • [6] Ozair S(2017)Photo-realistic single image super-resolution using a generative adversarial network IEEE Comput. Soc. 31 22-1962
  • [7] Courville A(2017)Image-to-image translation with conditional adversarial networks IEEE Comput. Soc. 41 1947-173498
  • [8] Bengio Y(2020)TileGAN: category-oriented attention-based high-quality tiled clothes generation from dressed person Neural Comput. Appl. 7 173485-49:13
  • [9] Cheng P(2020)Single image deraining via deep shared pyramid network Vis. Comput. 37 49:1-14622
  • [10] He S(2021)Adaptive optimization algorithm for nonlinear Markov jump systems with partial unknown dynamics Int. J. Robust Nonlinear Control 32 14613-106