Pan-sharpening by deep recursive residual network

被引:0
|
作者
Wang F. [1 ,2 ]
Guo Q. [1 ]
Ge X. [1 ]
机构
[1] Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing
[2] School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing
基金
中国国家自然科学基金;
关键词
Convolutional neural network; Deep learning; Recursive network; Remote sensing image fusion; Residual network; Space spectrum fusion;
D O I
10.11834/jrs.20219250
中图分类号
学科分类号
摘要
Pan-sharpening is a task in the field of remote sensing data fusion, in which multispectral (MS) images with rich spectral information but low spatial resolution and panchromatic (PAN) images with rich spatial details but only grey information are fused to yield images with high spatial and spectral resolution. Traditional Component Substitution (CS) methods replace a particular component of the MS image transformation with a PAN image, and then inversely transforms it to obtain the final fused image. The traditional MultiResolution Analysis (MRA) methods first extract spatial structures from the PAN image by using MRA transforms, and then the extracted spatial structure information is injected into the up-sampled MS images to obtain the fused image. The whole fusing process of the CS and MRA methods can be described as linear functions. However, the performance of such linear models are limited by their linearity, which often has spectral distortion. In recent years, many advanced nonlinear deep learning models have been proposed. However, those existing deep learning fusion models are relatively simple and pose difficultly in learn in-depth features. To overcome the shortcomings of the current models, we propose a deep recursive residual network that is specifically designed for the pan-sharpening task.Considering that the low-resolution input image and the high-resolution output image have high similarity, learning the relationship between input and output is highly redundant and difficult. If the sparse residual features between input and output are learned directly, then the network convergence can be significantly improved. Thus, the residual learning introduces the network structure, in which the introduced residuals include global residuals and local residuals. Such a structure is conducive to learning and not prone to overfitting. Moreover, the residual network can solve the problem of deep network gradient disappearance and gradient explosion well. Recursive network improves accuracy by increasing the number of network layers without increasing weight parameters. Specifically, as we use the residual network globally, recursive learning is introduced into residual learning by constructing recursive blocks structure, whereas multiple local residual units are stacked together in the recursive block. Through such an end-to-end network design, a better image fusion effect is obtained.Given that no ideal fusion result has been used as a label, we made a data set according to Wald's protocol using the original MS as the ideal fused image, downsampling and then upsampling the MS as the MS of the network input, and the downsampled PAN as the PAN of the network input. To comprehensively analyze our experimental results, we performed a large number of simulation experiments and real experiments on the 4-band GaoFen-1 data and 8-band WorldView-2 data with abundant feature types. We then generalized them to 4-band GeoEye data and 8-band WorldView-3 data. Experimental results are compared with traditional methods and the existing deep learning methods. The subjective visual analysis and objective evaluation indicators show that the proposed method reduces the spectral distortion phenomenon of traditional methods and preserves the spectrum of an image better than the existing deep learning method does.The deep network designed in this paper has learned more in-depth and more luxurious image features and has achieved better fusion effects than existing methods. It uses a residual network to solve profound network gradient disappearance, gradient explosion, and degradation problems. In addition, the weight parameters are reduced by the design of the recurrent recursive block, and the network speed is improved. The generalization experiment shows that our network has a good generalization ability. © 2021 SinoMaps Press. All right reserved.
引用
收藏
页码:1244 / 1256
页数:12
相关论文
共 23 条
  • [1] Azarang A, Ghassemian H., A new pansharpening method using multi resolution analysis framework and deep neural networks, Proceedings of the 2017 3rd International Conference on Pattern Recognition and Image Analysis (IPRIA), pp. 1-6, (2017)
  • [2] Chavez P S, Kwarteng A Y., Extracting spectral contrast in landsat thematic mapper image data using selective principal component analysis, Photogrammetric Engineering and Remote Sensing, 55, 3, pp. 339-348, (1988)
  • [3] Choi J, Yu K, Kim Y., A new adaptive component-substitution-based satellite image fusion by using partial replacement, IEEE Transactions on Geoscience and Remote Sensing, 49, 1, pp. 295-309, (2011)
  • [4] Easley G, Labate D, Lim W Q., Sparse directional image representations using the discrete shearlet transform, Applied and Computational Harmonic Analysis, 25, 1, pp. 25-46, (2008)
  • [5] Ehlers M., Multisensor image fusion techniques in remote sensing, ISPRS Journal of Photogrammetry and Remote Sensing, 46, 1, pp. 19-30, (1991)
  • [6] Koutsias N, Karteris M, Chuvieco E., The use of intensity-hue-saturation transformation of landsat-5 thematic mapper data for burned land mapping, Photogrammetric Engineering and Remote Sensing, 66, 7, pp. 829-839, (2000)
  • [7] Laben C A, Brower B V., Process for enhancing the spatial resolution of multispectral imagery using pan-sharpening, (2000)
  • [8] Le Pennec E, Mallat S., Image compression with geometrical wavelets, Proceedings 2000 International Conference on Image Processing, pp. 661-664, (2000)
  • [9] Liang M, Hu X L., Recurrent convolutional neural network for object recognition, Proceedings of 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3367-3375, (2015)
  • [10] Mallat S G., Multifrequency channel decompositions of images and wavelet models, IEEE Transactions on Acoustics, Speech, and Signal Processing, 37, 12, pp. 2091-2110, (1989)