Spatiotemporal Satellite Image Fusion Using Deep Convolutional Neural Networks

被引：286

作者：

Song, Huihui ^{[1
]}

Liu, Qingshan ^{[1
]}

Wang, Guojie ^{[2
]}

Hang, Renlong ^{[1
]}

Huang, Bo ^{[3
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Jiangsu Key Lab Big Data Anal Technol, Jiangsu Collaborat Innovat Ctr Atmospher Environm, Nanjing 210044, Jiangsu, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Collaborat Innovat Ctr Forecast & Evaluat Meteoro, Nanjing 210044, Jiangsu, Peoples R China

[3] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING | 2018年 / 11卷 / 03期

关键词：

Convolutional neural network (CNN); nonlinear mapping (NLM); spatial resolution; temporal resolution; REFLECTANCE FUSION; LANDSAT DATA; TIME-SERIES; MODIS; RESOLUTION; MODEL;

D O I：

10.1109/JSTARS.2018.2797894

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We propose a novel spatiotemporal fusion method based on deep convolutional neural networks (CNNs) under the application background of massive remote sensing data. In the training stage, we build two five-layer CNNs to deal with the problems of complicated correspondence and large spatial resolution gaps between MODIS and Landsat images. Specifically, we first learn a nonlinear mapping CNN between MODIS and low-spatial-resolution (LSR) Landsat images and then learn a super-resolution CNN between LSR Landsat and original Landsat images. In the prediction stage, instead of directly taking the outputs of CNNs as the fusion result, we design a fusion model consisting of high-pass modulation and a weighting strategy to make full use of the information in prior images. Specifically, we firstmap the input MODIS images to transitional images via the learned nonlinear mapping CNN and further improve the transitional images to LSR Landsat images via the fusion model; then, via the learned SR CNN, the LSR Landsat images are supersolved to transitional images, which are further improved to Landsat images via the fusion model. Compared with the previous learning-based fusion methods, mainly referring to the sparse-representation-based methods, our CNNs-based spatiotemporalmethod has the following advantages: 1) automatically extracting effective image features; 2) learning an end-to-end mapping between MODIS and LSR Landsat images; and 3) generating more favorable fusion results. To examine the performance of the proposed fusion method, we conduct experiments on two representative Landsat-MODIS datasets by comparing with the sparse-representation-based spatiotemporal fusion model. The quantitative evaluations on all possibleprediction dates and the comparison of fusion results on one key date in both visual effect and quantitative evaluationsdemonstrate that the proposed method can generate more accurate fusionresults.

引用

页码：821 / 829

页数：9

共 35 条

[1]

[Anonymous], 2008, NIPS

[2] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[3] MODTRAN4 radiative transfer modeling for atmospheric correction [J].

Berk, A ;

Anderson, GP ;

Bernstein, LS ;

Acharya, PK ;

Dothe, H ;

Matthew, MW ;

Adler-Golden, SM ;

Chetwynd, JH ;

Richtsmeier, SC ;

Pukall, B ;

Allred, CL ;

Jeong, LS ;

Hoke, ML .

OPTICAL SPECTROSCOPIC TECHNIQUES AND INSTRUMENTATION FOR ATMOSPHERIC AND SPACE RESEARCH III, 1999, 3756 :348-353

[4] A Spatial and Temporal Nonlocal Filter-Based Data Fusion Method [J].

Cheng, Qing ;

Liu, Huiqing ;

Shen, Huanfeng ;

Wu, Penghai ;

Zhang, Liangpei .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4476-4488

[5] Image Super-Resolution Using Deep Convolutional Networks [J].

Dong, Chao ;

Loy, Chen Change ;

He, Kaiming ;

Tang, Xiaoou .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (02) :295-307

[6] Assessing the accuracy of blending Landsat-MODIS surface reflectances in two landscapes with contrasting spatial and temporal dynamics: A framework for algorithm selection [J].

Emelyanova, Irina V. ;

McVicar, Tim R. ;

Van Niel, Thomas G. ;

Li, Ling Tao ;

van Dijk, Albert I. J. M. .

REMOTE SENSING OF ENVIRONMENT, 2013, 133 :193-209

[7] On the blending of the Landsat and MODIS surface reflectance: Predicting daily Landsat surface reflectance [J].

Gao, Feng ;

Masek, Jeff ;

Schwaller, Matt ;

Hall, Forrest .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2006, 44 (08) :2207-2218

[8] An Online Coupled Dictionary Learning Approach for Remote Sensing Image Fusion [J].

Guo, Min ;

Zhang, Hongyan ;

Li, Jiayi ;

Zhang, Liangpei ;

Shen, Huanfeng .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (04) :1284-1294

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] Generation of dense time series synthetic Landsat data through data blending with MODIS using a spatial and temporal adaptive reflectance fusion model [J].

Hilker, Thomas ;

Wulder, Michael A. ;

Coops, Nicholas C. ;

Seitz, Nicole ;

White, Joanne C. ;

Gao, Feng ;

Masek, Jeffrey G. ;

Stenhouse, Gordon .

REMOTE SENSING OF ENVIRONMENT, 2009, 113 (09) :1988-1999

← 1 2 3 4 →