Remote sensing image cloud detection using a shallow convolutional neural network

被引:10
作者
Chai, Dengfeng [1 ]
Huang, Jingfeng [2 ,3 ]
Wu, Minghui [4 ]
Yang, Xiaoping [1 ]
Wang, Ruisheng [5 ]
机构
[1] Zhejiang Univ, Sch Earth Sci, Key Lab Geosci Big Data & Deep Resource Zhejiang P, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Inst Appl Remote Sensing & Informat Technol, Key Lab Agr Remote Sensing & Informat Syst, Hangzhou 310058, Peoples R China
[3] Zhejiang Univ, Key Lab Agr Remote Sensing & Informat Syst, Hangzhou 310058, Peoples R China
[4] Zhejiang Univ, Sch Comp & Comp Sci, City Coll, Hangzhou 310015, Peoples R China
[5] Univ Calgary, Dept Geomat Engn, Calgary, AB, Canada
基金
中国国家自然科学基金;
关键词
Cloud detection; Semantic segmentation; Shallow convolutional neural network (SCNN); Landsat; Gaofen; SHADOW DETECTION; AUTOMATED CLOUD; DETECTION ALGORITHM; LANDSAT IMAGERY; COVER ASSESSMENT; CLASSIFICATION; SEGMENTATION; VALIDATION;
D O I
10.1016/j.isprsjprs.2024.01.026
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
The state-of-the-art methods for cloud detection are dominated by deep convolutional neural networks (DCNNs). However, it is very expensive to train DCNNs for cloud detection and the trained DCNNs do not always perform well as expected. This paper proposes a shallow CNN (SCNN) by removing pooling/unpooling layers and normalization layers in DCNNs, retaining only three convolutional layers, and equipping them with 3 filters of 1 x 1,1 x 1, 3 x 3 in spatial dimensions. It demonstrates that the three convolutional layers are sufficient for cloud detection. Since the label output by the SCNN for a pixel depends on a 3 x 3 patch around this pixel, the SCNN can be trained using some thousands 3 x 3 patches together with ground truth of their center pixels. It is very cheap to train a SCNN using some thousands 3 x 3 patches and to provide ground truth of their center pixels. Despite of its low cost, SCNN training is stabler than DCNN training, and the trained SCNN outperforms the representative state-of-the-art DCNNs for cloud detection. The same resolution of original image, feature maps and final label map assures that details are not lost as by pooling/unpooling in DCNNs. The border artifacts suffering from deep convolutional and pooling/unpooling layers are minimized by 3 convolutional layers with 1x1, 1x1, 3x3 filters. Incoherent patches suffering from patch -by -patch segmentation and batch normalization are eliminated by SCNN without normalization layers. Extensive experiments based on the L7 Irish, L8 Biome and GF1 WHU datasets are carried out to evaluate the proposed method and compare with state-of-the-art methods. The proposed SCNN promises to deal with images from any other sensors. The code and data will be open for further research. 1
引用
收藏
页码:66 / 84
页数:19
相关论文
共 64 条
[1]   Statistical cloud detection from SEVIRI multispectral images [J].
Amato, U. ;
Antomadis, A. ;
Cuomo, V. ;
Cutillo, L. ;
Franzese, M. ;
Murino, L. ;
Serio, C. .
REMOTE SENSING OF ENVIRONMENT, 2008, 112 (03) :750-766
[2]  
[Anonymous], 1990, Wavelets
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Aerial image semantic segmentation using DCNN predicted distance maps [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Huang, Jingfeng .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 :309-322
[5]   Cloud and cloud shadow detection in Landsat imagery based on deep convolutional neural networks [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Zhang, Hankui K. ;
Qiu, Yifan ;
Huang, Jingfeng .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :307-316
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[8]   Cloud detection algorithm comparison and validation for operational Landsat data products [J].
Foga, Steve ;
Scaramuzza, Pat L. ;
Guo, Song ;
Zhu, Zhe ;
Dilley, Ronald D., Jr. ;
Beckmann, Tim ;
Schmidt, Gail L. ;
Dwyer, John L. ;
Hughes, M. Joseph ;
Laue, Brady .
REMOTE SENSING OF ENVIRONMENT, 2017, 194 :379-390
[9]   Enhancing the Detectability of Clouds and Their Shadows in Multitemporal Dryland Landsat Imagery: Extending Fmask [J].
Frantz, David ;
Roeder, Achim ;
Udelhoven, Thomas ;
Schmidt, Michael .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2015, 12 (06) :1242-1246
[10]  
Fulkerson B, 2009, IEEE I CONF COMP VIS, P670