CloudViT: A Lightweight Vision Transformer Network for Remote Sensing Cloud Detection

被引:21
作者
Zhang, Bin [1 ]
Zhang, Yongjun [1 ]
Li, Yansheng [1 ]
Wan, Yi [1 ]
Yao, Yongxiang [1 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Dept Photogrammetry, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Cloud computing; Remote sensing; Feature extraction; Earth; Artificial satellites; Satellites; Transformers; Attention mechanism; cloud detection; deep learning; remote sensing image; vision transformer (ViT); FEATURES; IMAGES;
D O I
10.1109/LGRS.2022.3233122
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Clouds inevitably exist in satellite images, which limit the processing and application of satellite images to a certain extent. Therefore, cloud detection is a preprocessing task in satellite image extraction and analysis processing. However, the existing methods are difficult to mine robust features, and the number of parameters and computation are large, which is not conducive to the deployment of the model. In this letter, cloud vision transformer (CloudViT), a lightweight vision transformer network for cloud detection from satellite imagery, is proposed. In detail, to utilize dark channel priors in multispectral imagery to guide the network to learn features, a multiscale dark channel extractor is used to first predict dark channels, and then, the dark channel features and image features are input to the attention mechanism-based dark channel-guided context aggregation module to enhance image features, which in turn makes cloud detection results more accurate. At the same time, to enhance the transfer ability of the network between different satellite sensors, a plug-and-play channel adaptive module is proposed to deal with the inconsistency of the number of different satellite sensor bands. The experimental results on the Landsat7 dataset show that our network CloudViT outperforms the state-of-the-art methods while keeping the number of parameters and computation small. At the same time, the experimental results on transfer to three other datasets show that using the channel adaptation module can greatly improve the transfer ability of the model.
引用
收藏
页数:5
相关论文
共 18 条
[1]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[2]  
Dosovitskiy Alexey., 2021, PROC INT C LEARN REP, P2021
[3]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[4]  
Guo J., 2020, IEEE T GEOSCI ELECT, V59
[5]   Single Image Haze Removal Using Dark Channel Prior [J].
He, Kaiming ;
Sun, Jian ;
Tang, Xiaoou .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (12) :2341-2353
[6]   DABNet: Deformable Contextual and Boundary-Weighted Network for Cloud Detection in Remote Sensing Images [J].
He, Qibin ;
Sun, Xian ;
Yan, Zhiyuan ;
Fu, Kun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]  
Hu J, 2018, PROC CVPR IEEE, P7132, DOI [10.1109/TPAMI.2019.2913372, 10.1109/CVPR.2018.00745]
[8]   A Lightweight Deep Learning-Based Cloud Detection Method for Sentinel-2A Imagery Fusing Multiscale Spectral and Spatial Features [J].
Li, Jun ;
Wu, Zhaocong ;
Hu, Zhongwen ;
Jian, Canliang ;
Luo, Shaojie ;
Mou, Lichao ;
Zhu, Xiao Xiang ;
Molinier, Matthieu .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[9]   Selective Kernel Networks [J].
Li, Xiang ;
Wang, Wenhai ;
Hu, Xiaolin ;
Yang, Jian .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :510-519
[10]   Cloud and cloud shadow detection for optical satellite imagery: Features, algorithms, validation, and prospects [J].
Li, Zhiwei ;
Shen, Huanfeng ;
Weng, Qihao ;
Zhang, Yuzhuo ;
Dou, Peng ;
Zhang, Liangpei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2022, 188 :89-108