Generative Self-Supervised Learning With Spectral-Spatial Masking for Hyperspectral Target Detection

被引:7
作者
Chen, Xi [1 ]
Zhang, Yuxiang [1 ]
Dong, Yanni [2 ]
Du, Bo [3 ]
机构
[1] China Univ Geosci, Sch Geophys & Geomatics, Wuhan 430074, Peoples R China
[2] Wuhan Univ, Sch Resource & Environm Sci, Wuhan 430079, Peoples R China
[3] Wuhan Univ, Sch Comp, Wuhan 430079, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Generative learning; hyperspectral target detection (HTD); self-supervised learning (SSL); vision transformer (ViT); IMAGE CLASSIFICATION; MATCHED-FILTER; REPRESENTATION; MODEL;
D O I
10.1109/TGRS.2024.3423781
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Deep learning (DL) has made significant progress in hyperspectral target detection (HTD) in recent years. However, the existing DL-based HTD methods generally generate numerous labeled samples for network training, which may be impure or too similar to each other. Moreover, most methods with enormous parameters are trained and tested on the same dataset, resulting in single scenario applicability and significant computational consumption issues. To solve these issues, we propose a generative self-supervised learning (GSSL) pretraining model with spectral-spatial masking (S2M). The lightweight vision transformer (ViT) is utilized as the backbone to learn the universal feature representation of images without labeled samples. Subsequently, the pretrained model is transferred to various HTD tasks. The transfer learning model is constructed via the lightweight ViT and a fully connected (FC) layer and fine-tuned via a weighted binary cross entropy (WBCE) loss function and a small number of selected samples. We evaluate its effectiveness on four challenging hyperspectral datasets in terms of the GSSL pretraining model, the S2M strategy, and the WBCE loss function. Our methods achieve improvements in comparison to different pretraining models, masking strategies and loss functions. And our detection results also outperform other state-of-the-art approaches.
引用
收藏
页数:13
相关论文
共 66 条
[61]   ViT-YOLO:Transformer-Based YOLO for Object Detection [J].
Zhang, Zixiao ;
Lu, Xiaoqiang ;
Cao, Guojin ;
Yang, Yuting ;
Jiao, Licheng ;
Liu, Fang .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, :2799-2808
[62]   Ensemble-Based Cascaded Constrained Energy Minimization for Hyperspectral Target Detection [J].
Zhao, Rui ;
Shi, Zhenwei ;
Zou, Zhengxia ;
Zhang, Zhou .
REMOTE SENSING, 2019, 11 (11)
[63]   Learning Single Spectral Abundance for Hyperspectral Subpixel Target Detection [J].
Zhu, Dehui ;
Du, Bo ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (07) :10134-10144
[64]   Two-Stream Convolutional Networks for Hyperspectral Target Detection [J].
Zhu, Dehui ;
Du, Bo ;
Zhang, Liangpei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (08) :6907-6921
[65]   Target Dictionary Construction-Based Sparse Representation Hyperspectral Target Detection Methods [J].
Zhu, Dehui ;
Du, Bo ;
Zhang, Liangpei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (04) :1254-1264
[66]   Hierarchical Suppression Method for Hyperspectral Target Detection [J].
Zou, Zhengxia ;
Shi, Zhenwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (01) :330-342