ESTUGAN: Enhanced Swin Transformer with U-Net Discriminator for Remote Sensing Image Super-Resolution

被引:5
作者
Yu, Chunhe [1 ]
Hong, Lingyue [1 ]
Pan, Tianpeng [1 ]
Li, Yufeng [1 ]
Li, Tingting [1 ]
机构
[1] Shenyang Aerosp Univ, Sch Elect & Informat Engn, Shenyang 110136, Peoples R China
关键词
Swin Transformer; U-Net discriminator; remote sensing image; super-resolution; generative adversarial network; QUALITY ASSESSMENT;
D O I
10.3390/electronics12204235
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Remote sensing image super-resolution (SR) is a practical research topic with broad applications. However, the mainstream algorithms for this task suffer from limitations. CNN-based algorithms face difficulties in modeling long-term dependencies, while generative adversarial networks (GANs) are prone to producing artifacts, making it difficult to reconstruct high-quality, detailed images. To address these challenges, we propose ESTUGAN for remote sensing image SR. On the one hand, ESTUGAN adopts the Swin Transformer as the network backbone and upgrades it to fully mobilize input information for global interaction, achieving impressive performance with fewer parameters. On the other hand, we employ a U-Net discriminator with the region-aware learning strategy for assisted supervision. The U-shaped design enables us to obtain structural information at each hierarchy and provides dense pixel-by-pixel feedback on the predicted images. Combined with the region-aware learning strategy, our U-Net discriminator can perform adversarial learning only for texture-rich regions, effectively suppressing artifacts. To achieve flexible supervision for the estimation, we employ the Best-buddy loss. And we also add the Back-projection loss as a constraint for the faithful reconstruction of the high-resolution image distribution. Extensive experiments demonstrate the superior perceptual quality and reliability of our proposed ESTUGAN in reconstructing remote sensing images.
引用
收藏
页数:20
相关论文
共 64 条
  • [1] Brock A, 2019, Arxiv, DOI [arXiv:1809.11096, DOI 10.48550/ARXIV.1809.11096]
  • [2] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
  • [3] End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling
    Chen, Tong
    Liu, Haojie
    Ma, Zhan
    Shen, Qiu
    Cao, Xun
    Wang, Yao
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3179 - 3191
  • [4] Activating More Pixels in Image Super-Resolution Transformer
    Chen, Xiangyu
    Wang, Xintao
    Zhou, Jiantao
    Qiao, Yu
    Dong, Chao
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22367 - 22377
  • [5] Remote Sensing Image Scene Classification: Benchmark and State of the Art
    Cheng, Gong
    Han, Junwei
    Lu, Xiaoqiang
    [J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
  • [6] N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution
    Choi, Haram
    Lee, Jeongmin
    Yang, Jihoon
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2071 - 2081
  • [7] Learning a Deep Convolutional Network for Image Super-Resolution
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
  • [8] Fourier Space Losses for Efficient Perceptual Image Super-Resolution
    Fuoli, Dario
    Van Gool, Luc
    Timofte, Radu
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2340 - 2349
  • [9] Glasner D, 2009, IEEE I CONF COMP VIS, P349, DOI 10.1109/ICCV.2009.5459271
  • [10] Generative Adversarial Networks
    Goodfellow, Ian
    Pouget-Abadie, Jean
    Mirza, Mehdi
    Xu, Bing
    Warde-Farley, David
    Ozair, Sherjil
    Courville, Aaron
    Bengio, Yoshua
    [J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144