ESTUGAN: Enhanced Swin Transformer with U-Net Discriminator for Remote Sensing Image Super-Resolution

被引：5

作者：

Yu, Chunhe ^{[1
]}

Hong, Lingyue ^{[1
]}

Pan, Tianpeng ^{[1
]}

Li, Yufeng ^{[1
]}

Li, Tingting ^{[1
]}

机构：

[1] Shenyang Aerosp Univ, Sch Elect & Informat Engn, Shenyang 110136, Peoples R China

来源：

ELECTRONICS | 2023年 / 12卷 / 20期

关键词：

Swin Transformer; U-Net discriminator; remote sensing image; super-resolution; generative adversarial network; QUALITY ASSESSMENT;

D O I：

10.3390/electronics12204235

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Remote sensing image super-resolution (SR) is a practical research topic with broad applications. However, the mainstream algorithms for this task suffer from limitations. CNN-based algorithms face difficulties in modeling long-term dependencies, while generative adversarial networks (GANs) are prone to producing artifacts, making it difficult to reconstruct high-quality, detailed images. To address these challenges, we propose ESTUGAN for remote sensing image SR. On the one hand, ESTUGAN adopts the Swin Transformer as the network backbone and upgrades it to fully mobilize input information for global interaction, achieving impressive performance with fewer parameters. On the other hand, we employ a U-Net discriminator with the region-aware learning strategy for assisted supervision. The U-shaped design enables us to obtain structural information at each hierarchy and provides dense pixel-by-pixel feedback on the predicted images. Combined with the region-aware learning strategy, our U-Net discriminator can perform adversarial learning only for texture-rich regions, effectively suppressing artifacts. To achieve flexible supervision for the estimation, we employ the Best-buddy loss. And we also add the Back-projection loss as a constraint for the faithful reconstruction of the high-resolution image distribution. Extensive experiments demonstrate the superior perceptual quality and reliability of our proposed ESTUGAN in reconstructing remote sensing images.

引用

页数：20

共 64 条

[1] Brock A, 2019, Arxiv, DOI [arXiv:1809.11096, DOI 10.48550/ARXIV.1809.11096]
[2] Carion Nicolas, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12346), P213, DOI 10.1007/978-3-030-58452-8_13
[3] End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling
Chen, Tong
Liu, Haojie
Ma, Zhan
Shen, Qiu
Cao, Xun
Wang, Yao
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3179 - 3191
[4] Activating More Pixels in Image Super-Resolution Transformer
Chen, Xiangyu
Wang, Xintao
Zhou, Jiantao
Qiao, Yu
Dong, Chao
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22367 - 22377
[5] Remote Sensing Image Scene Classification: Benchmark and State of the Art
Cheng, Gong
Han, Junwei
Lu, Xiaoqiang
[J]. PROCEEDINGS OF THE IEEE, 2017, 105 (10) : 1865 - 1883
[6] N-Gram in Swin Transformers for Efficient Lightweight Image Super-Resolution
Choi, Haram
Lee, Jeongmin
Yang, Jihoon
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2071 - 2081
[7] Learning a Deep Convolutional Network for Image Super-Resolution
Dong, Chao
Loy, Chen Change
He, Kaiming
Tang, Xiaoou
[J]. COMPUTER VISION - ECCV 2014, PT IV, 2014, 8692 : 184 - 199
[8] Fourier Space Losses for Efficient Perceptual Image Super-Resolution
Fuoli, Dario
Van Gool, Luc
Timofte, Radu
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2340 - 2349
[9] Glasner D, 2009, IEEE I CONF COMP VIS, P349, DOI 10.1109/ICCV.2009.5459271
[10] Generative Adversarial Networks
Goodfellow, Ian
Pouget-Abadie, Jean
Mirza, Mehdi
Xu, Bing
Warde-Farley, David
Ozair, Sherjil
Courville, Aaron
Bengio, Yoshua
[J]. COMMUNICATIONS OF THE ACM, 2020, 63 (11) : 139 - 144

← 1 2 3 4 5 6 7 →