Spatial Transformer Generative Adversarial Network for Robust Image Super-Resolution

被引:20
作者
Kasem, Hossam M. [1 ,2 ,3 ]
Hung, Kwok-Wai [1 ,2 ]
Jiang, Jianmin [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Comp Sci & Software Engn, Res Inst Future Media Comp, Shenzhen 518060, Peoples R China
[2] Shenzhen Univ, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[3] Tanta Univ, Fac Engn, Tanta 31512, Egypt
关键词
Super-resolution; generative adversarial networks; spatial transformer network; robust image super-resolution; robust generative adversarial network; RESOLUTION;
D O I
10.1109/ACCESS.2019.2959940
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, there have been signifi cant advances in image super-resolution based on generative adversarial networks (GANs) to achieve breakthroughs in generating more images with high subjective quality. However, there are remaining challenges needs to be met, such as simultaneously recovering the finer texture details for large upscaling factors and mitigating the geometric transformation effects. In this paper, we propose a novel robust super-resolution GAN (i.e. namely RSR-GAN) which can simultaneously perform both the geometric transformation and recovering the finer texture details. Specifically, since the performance of the generator depends on the discreminator, we propose a novel discriminator design by incorporating the spatial transformer module with residual learning to improve the discrimination of fake and true images through removing the geometric noise, in order to enhance the super-resolution of geometric corrected images. Finally, to further improve the perceptual quality, we introduce an additional DCT loss term into the existing loss function. Extensive experiments, measured by both PSNR and SSIM measurements, show that our proposed method achieves a high level of robustness against a number of geometric transformations, including rotation, translation, a combination of rotation and scaling effects, and a cobmination of rotaion, transalation and scaling effects. Benchmarked by the existing state-of-the-arts SR methods, our proposed delivers superior performances on a wide range of datasets which are publicly available and widely adopted across research communities.
引用
收藏
页码:182993 / 183009
页数:17
相关论文
共 55 条
[1]  
[Anonymous], 2016, P IEEE C COMPUTER VI
[2]  
[Anonymous], 2015, arXiv
[3]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[4]  
[Anonymous], NEURAL NETW
[5]  
[Anonymous], IEEE ACCESS
[6]  
[Anonymous], IEEE ACCESS
[7]  
[Anonymous], 2015, PROC 28 INT C NEURAL
[8]   Contour Detection and Hierarchical Image Segmentation [J].
Arbelaez, Pablo ;
Maire, Michael ;
Fowlkes, Charless ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) :898-916
[9]   Deep neural network for traffic sign recognition systems: An analysis of spatial transformers and stochastic optimisation methods [J].
Arcos-Garcia, Alvaro ;
Alvarez-Garcia, Juan A. ;
Soria-Morillo, Luis M. .
NEURAL NETWORKS, 2018, 99 :158-165
[10]   Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding [J].
Bevilacqua, Marco ;
Roumy, Aline ;
Guillemot, Christine ;
Morel, Marie-Line Alberi .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,