Spatiotemporal Satellite Image Fusion Using Deep Convolutional Neural Networks

被引:270
作者
Song, Huihui [1 ]
Liu, Qingshan [1 ]
Wang, Guojie [2 ]
Hang, Renlong [1 ]
Huang, Bo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Jiangsu Key Lab Big Data Anal Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Collaborat Innovat Ctr Forecast & Evaluat Meteoro, Nanjing 210044, Jiangsu, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Convolutional neural network (CNN); nonlinear mapping (NLM); spatial resolution; temporal resolution; REFLECTANCE FUSION; LANDSAT DATA; TIME-SERIES; MODIS; RESOLUTION; MODEL;
D O I
10.1109/JSTARS.2018.2797894
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel spatiotemporal fusion method based on deep convolutional neural networks (CNNs) under the application background of massive remote sensing data. In the training stage, we build two five-layer CNNs to deal with the problems of complicated correspondence and large spatial resolution gaps between MODIS and Landsat images. Specifically, we first learn a nonlinear mapping CNN between MODIS and low-spatial-resolution (LSR) Landsat images and then learn a super-resolution CNN between LSR Landsat and original Landsat images. In the prediction stage, instead of directly taking the outputs of CNNs as the fusion result, we design a fusion model consisting of high-pass modulation and a weighting strategy to make full use of the information in prior images. Specifically, we firstmap the input MODIS images to transitional images via the learned nonlinear mapping CNN and further improve the transitional images to LSR Landsat images via the fusion model; then, via the learned SR CNN, the LSR Landsat images are supersolved to transitional images, which are further improved to Landsat images via the fusion model. Compared with the previous learning-based fusion methods, mainly referring to the sparse-representation-based methods, our CNNs-based spatiotemporalmethod has the following advantages: 1) automatically extracting effective image features; 2) learning an end-to-end mapping between MODIS and LSR Landsat images; and 3) generating more favorable fusion results. To examine the performance of the proposed fusion method, we conduct experiments on two representative Landsat-MODIS datasets by comparing with the sparse-representation-based spatiotemporal fusion model. The quantitative evaluations on all possibleprediction dates and the comparison of fusion results on one key date in both visual effect and quantitative evaluationsdemonstrate that the proposed method can generate more accurate fusionresults.
引用
收藏
页码:821 / 829
页数:9
相关论文
共 35 条
  • [11] A new data fusion model for high spatial- and temporal-resolution mapping of forest disturbance based on Landsat and MODIS
    Hilker, Thomas
    Wulder, Michael A.
    Coops, Nicholas C.
    Linke, Julia
    McDermid, Greg
    Masek, Jeffrey G.
    Gao, Feng
    White, Joanne C.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2009, 113 (08) : 1613 - 1627
  • [12] Spatiotemporal Reflectance Fusion via Sparse Representation
    Huang, Bo
    Song, Huihui
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2012, 50 (10): : 3707 - 3716
  • [13] Ioffe S, 2015, PR MACH LEARN RES, V37, P448
  • [14] Two-Step Sparse Coding for the Pan-Sharpening of Remote Sensing Images
    Jiang, Cheng
    Zhang, Hongyan
    Shen, Huanfeng
    Zhang, Liangpei
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (05) : 1792 - 1805
  • [15] Pansharpening Quality Assessment Using the Modulation Transfer Functions of Instruments
    Khan, Muhammad Murtaza
    Alparone, Luciano
    Chanussot, Jocelyn
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2009, 47 (11): : 3880 - 3891
  • [16] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [17] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [18] An Evaluation of the Use of Atmospheric and BRDF Correction to Standardize Landsat Data
    Li, Fuqin
    Jupp, David L. B.
    Reddy, Shanti
    Lymburner, Leo
    Mueller, Norman
    Tan, Peter
    Islam, Anisul
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2010, 3 (03) : 257 - 270
  • [19] North American forest disturbance mapped from a decadal Landsat record
    Masek, Jeffrey G.
    Huang, Chengquan
    Wolfe, Robert
    Cohen, Warren
    Hall, Forrest
    Kutler, Jonathan
    Nelson, Peder
    [J]. REMOTE SENSING OF ENVIRONMENT, 2008, 112 (06) : 2914 - 2926
  • [20] Nair V., 2010, P 27 INT C MACH LEAR, P807