Spatiotemporal Satellite Image Fusion Using Deep Convolutional Neural Networks

被引:270
作者
Song, Huihui [1 ]
Liu, Qingshan [1 ]
Wang, Guojie [2 ]
Hang, Renlong [1 ]
Huang, Bo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Jiangsu Key Lab Big Data Anal Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Jiangsu, Peoples R China
[2] Nanjing Univ Informat Sci & Technol, Collaborat Innovat Ctr Forecast & Evaluat Meteoro, Nanjing 210044, Jiangsu, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
关键词
Convolutional neural network (CNN); nonlinear mapping (NLM); spatial resolution; temporal resolution; REFLECTANCE FUSION; LANDSAT DATA; TIME-SERIES; MODIS; RESOLUTION; MODEL;
D O I
10.1109/JSTARS.2018.2797894
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel spatiotemporal fusion method based on deep convolutional neural networks (CNNs) under the application background of massive remote sensing data. In the training stage, we build two five-layer CNNs to deal with the problems of complicated correspondence and large spatial resolution gaps between MODIS and Landsat images. Specifically, we first learn a nonlinear mapping CNN between MODIS and low-spatial-resolution (LSR) Landsat images and then learn a super-resolution CNN between LSR Landsat and original Landsat images. In the prediction stage, instead of directly taking the outputs of CNNs as the fusion result, we design a fusion model consisting of high-pass modulation and a weighting strategy to make full use of the information in prior images. Specifically, we firstmap the input MODIS images to transitional images via the learned nonlinear mapping CNN and further improve the transitional images to LSR Landsat images via the fusion model; then, via the learned SR CNN, the LSR Landsat images are supersolved to transitional images, which are further improved to Landsat images via the fusion model. Compared with the previous learning-based fusion methods, mainly referring to the sparse-representation-based methods, our CNNs-based spatiotemporalmethod has the following advantages: 1) automatically extracting effective image features; 2) learning an end-to-end mapping between MODIS and LSR Landsat images; and 3) generating more favorable fusion results. To examine the performance of the proposed fusion method, we conduct experiments on two representative Landsat-MODIS datasets by comparing with the sparse-representation-based spatiotemporal fusion model. The quantitative evaluations on all possibleprediction dates and the comparison of fusion results on one key date in both visual effect and quantitative evaluationsdemonstrate that the proposed method can generate more accurate fusionresults.
引用
收藏
页码:821 / 829
页数:9
相关论文
共 35 条
  • [1] [Anonymous], 2008, NIPS
  • [2] Rich feature hierarchies for accurate object detection and semantic segmentation
    Girshick, Ross
    Donahue, Jeff
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 580 - 587
  • [3] MODTRAN4 radiative transfer modeling for atmospheric correction
    Berk, A
    Anderson, GP
    Bernstein, LS
    Acharya, PK
    Dothe, H
    Matthew, MW
    Adler-Golden, SM
    Chetwynd, JH
    Richtsmeier, SC
    Pukall, B
    Allred, CL
    Jeong, LS
    Hoke, ML
    [J]. OPTICAL SPECTROSCOPIC TECHNIQUES AND INSTRUMENTATION FOR ATMOSPHERIC AND SPACE RESEARCH III, 1999, 3756 : 348 - 353
  • [4] A Spatial and Temporal Nonlocal Filter-Based Data Fusion Method
    Cheng, Qing
    Liu, Huiqing
    Shen, Huanfeng
    Wu, Penghai
    Zhang, Liangpei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08): : 4476 - 4488
  • [5] Image Super-Resolution Using Deep Convolutional Networks
    Dong, Chao
    Loy, Chen Change
    He, Kaiming
    Tang, Xiaoou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) : 295 - 307
  • [6] Assessing the accuracy of blending Landsat-MODIS surface reflectances in two landscapes with contrasting spatial and temporal dynamics: A framework for algorithm selection
    Emelyanova, Irina V.
    McVicar, Tim R.
    Van Niel, Thomas G.
    Li, Ling Tao
    van Dijk, Albert I. J. M.
    [J]. REMOTE SENSING OF ENVIRONMENT, 2013, 133 : 193 - 209
  • [7] On the blending of the Landsat and MODIS surface reflectance: Predicting daily Landsat surface reflectance
    Gao, Feng
    Masek, Jeff
    Schwaller, Matt
    Hall, Forrest
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (08): : 2207 - 2218
  • [8] An Online Coupled Dictionary Learning Approach for Remote Sensing Image Fusion
    Guo, Min
    Zhang, Hongyan
    Li, Jiayi
    Zhang, Liangpei
    Shen, Huanfeng
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) : 1284 - 1294
  • [9] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [10] Generation of dense time series synthetic Landsat data through data blending with MODIS using a spatial and temporal adaptive reflectance fusion model
    Hilker, Thomas
    Wulder, Michael A.
    Coops, Nicholas C.
    Seitz, Nicole
    White, Joanne C.
    Gao, Feng
    Masek, Jeffrey G.
    Stenhouse, Gordon
    [J]. REMOTE SENSING OF ENVIRONMENT, 2009, 113 (09) : 1988 - 1999