CSDFormer: A cloud and shadow detection method for landsat images based on transformer

被引:13
作者
Li, Jiayi [1 ]
Wang, Qunming [1 ,2 ]
机构
[1] Tongji Univ, Coll Surveying & Geoinformat, 1239 Siping Rd, Shanghai 200092, Peoples R China
[2] Minzu Univ China, Key Lab Ethn Language Intelligent Anal & Secur Gov, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
Cloud and shadow (CS) detection; Deep learning; Transformer; DETECTION ALGORITHM; AUTOMATED CLOUD; SNOW DETECTION; CLASSIFICATION;
D O I
10.1016/j.jag.2024.103799
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Cloud and shadow (CS) detection is crucial prerequisite for application of remote sensing images. Current deep learning-based detection algorithms mainly employ Convolutional Neural Networks (CNNs). However, the local receptive field in CNNs cannot effectively capture global contextual information, which hinders accurate characterization of the dependency between clouds and shadows. In vision Transformers, self-attention mechanisms can effectively capture the long-distance dependencies between different regions in an image. Inspired by this, this paper proposed a new CS Detection algorithm based on a Transformer, called CSDFormer. Specifically, we exclusively employed a hierarchical Transformer structure in the encoder stage to extract features of CS. Each Transformer layer contains several multi-head self-attention mechanisms for calculating pixel-wise long-distance connectivity. The designed structure enables the Transformer to better extract global context information, which helps to strengthen the comprehension of the semantic relationships between clouds and shadows. Benefiting from the global feature extraction capability of the encoder stage, we employed several simple multilayer perceptron layers for multi-scale feature map fusion and pixel classification in the decoder stage. The proposed CSDFormer was validated using 898 Landsat 8 Biome images with 512 x 512 pixels, producing an overall accuracy of 95.28 % and a mean intersection over union of 84.08 %, outperforming three state-of-the-art CNNbased algorithms. CSDFormer is consistently more accurate in detection of both clouds and shadows. Owing to the parallel computing capability of the self-attention mechanism, CSDFormer is computationally more efficient than the three CNN-based benchmark methods. For the input spectral bands, the performance of CSDFormer produced can be further enhanced with additional thermal infrared bands.
引用
收藏
页数:12
相关论文
共 53 条
[1]   Optimizing WorldView-2,-3 cloud masking using machine learning approaches [J].
Caraballo-Vega, J. A. ;
Carroll, M. L. ;
Neigh, C. S. R. ;
Wooten, M. ;
Lee, B. ;
Weis, A. ;
Aronne, M. ;
Alemu, W. G. ;
Williams, Z. .
REMOTE SENSING OF ENVIRONMENT, 2023, 284
[2]   Cloud detection from a sequence of SST images [J].
Cayula, JF ;
Cornillon, P .
REMOTE SENSING OF ENVIRONMENT, 1996, 55 (01) :80-88
[3]   Cloud and cloud shadow detection in Landsat imagery based on deep convolutional neural networks [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Zhang, Hankui K. ;
Qiu, Yifan ;
Huang, Jingfeng .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :307-316
[4]   Pre-Trained Image Processing Transformer [J].
Chen, Hanting ;
Wang, Yunhe ;
Guo, Tianyu ;
Xu, Chang ;
Deng, Yiping ;
Liu, Zhenhua ;
Ma, Siwei ;
Xu, Chunjing ;
Xu, Chao ;
Gao, Wen .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :12294-12305
[5]  
Chen LC, 2018, Arxiv, DOI [arXiv:1802.02611, 10.48550/arXiv.1802.02611]
[6]   An Iterative Haze Optimized Transformation for Automatic Cloud/Haze Detection of Landsat Imagery [J].
Chen, Shuli ;
Chen, Xuehong ;
Chen, Jin ;
Jia, Pengfei ;
Cao, Xin ;
Liu, Canyou .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (05) :2682-2694
[7]   Cloud detection in Landsat imagery of ice sheets using shadow matching technique and automatic normalized difference snow index threshold value decision [J].
Choi, H ;
Bindschadler, R .
REMOTE SENSING OF ENVIRONMENT, 2004, 91 (02) :237-242
[8]   An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines [J].
Choubin, Bahram ;
Moradi, Ehsan ;
Golshan, Mohammad ;
Adamowski, Jan ;
Sajedi-Hosseini, Farzaneh ;
Mosavi, Amir .
SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 651 :2087-2096
[9]  
Chu XX, 2021, ADV NEUR IN
[10]  
Dai Z, 2021, ADV NEUR IN, V34