Cross-Modal Contrastive Learning for Remote Sensing Image Classification

被引:29
作者
Feng, Zhixi [1 ]
Song, Liangliang [1 ]
Yang, Shuyuan [1 ]
Zhang, Xinyu [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Cross-modal contrastive learning (CMCL); multimodal remote sensing image (MRSI) classification; self-supervised; LIDAR DATA; FUSION;
D O I
10.1109/TGRS.2023.3296703
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, multimodal remote sensing image (MRSI) classification has attracted increasing attention from researchers. However, the classification of MRSI with limited labeled instances is still a challenging task. In this article, a novel self-supervised cross-modal contrastive learning (CMCL) method is proposed for MRSI classification. Joint intramodal contrastive learning (IMCL) and CMCL are used to better mine multimodal feature representations during pretraining, and the IMCL and CMCL objectives are jointly optimized, whereby it encourages the learned representation to be semantically consistent within and between modalities simultaneously. Moreover, a simple but effective hybrid cross-modal fusion module (HCFM) is designed in the fine-tuning stage, which could better compactly integrate complementary information across these modalities for more accurate classification. Extensive experiments are taken on four benchmark datasets (i.e., Houston 2013, Augsburg, Germany; Trento, Italy; and Berlin, Germany), and the results show that the proposed method outperforms state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 48 条
[31]   Self-Supervised Assisted Semi-Supervised Residual Network for Hyperspectral Image Classification [J].
Song, Liangliang ;
Feng, Zhixi ;
Yang, Shuyuan ;
Zhang, Xinyu ;
Jiao, Licheng .
REMOTE SENSING, 2022, 14 (13)
[32]   Ensemble Learning for Hyperspectral Image Classification Using Tangent Collaborative Representation [J].
Su, Hongjun ;
Yu, Yao ;
Du, Qian ;
Du, Peijun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (06) :3778-3790
[33]   SpectralSpatial Feature Tokenization Transformer for Hyperspectral Image Classification [J].
Sun, Le ;
Zhao, Guangrui ;
Zheng, Yuhui ;
Wu, Zebin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[34]  
Vaswani A, 2017, ADV NEUR IN, V30
[35]   Hyperspectral and SAR Image Classification via Multiscale Interactive Fusion Network [J].
Wang, Junjie ;
Li, Wei ;
Gao, Yunhao ;
Zhang, Mengmeng ;
Tao, Ran ;
Du, Qian .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) :10823-10837
[36]   Convolutional Neural Networks for Multimodal Remote Sensing Data Classification [J].
Wu, Xin ;
Hong, Danfeng ;
Chanussot, Jocelyn .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[37]   Hyperspectral and LiDAR Classification With Semisupervised Graph Fusion [J].
Xia, Junshi ;
Liao, Wenzhi ;
Du, Peijun .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (04) :666-670
[38]   Fusion of Hyperspectral and LiDAR Data With a Novel Ensemble Classifier [J].
Xia, Junshi ;
Yokoya, Naoto ;
Iwasaki, Akira .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (06) :957-961
[39]   Unsupervised Spectral-Spatial Semantic Feature Learning for Hyperspectral Image Classification [J].
Xu, Huilin ;
He, Wei ;
Zhang, Liangpei ;
Zhang, Hongyan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[40]   Multisource Remote Sensing Data Classification Based on Convolutional Neural Network [J].
Xu, Xiaodong ;
Li, Wei ;
Ran, Qiong ;
Du, Qian ;
Gao, Lianru ;
Zhang, Bing .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (02) :937-949