Cross-Modal Contrastive Learning for Remote Sensing Image Classification

被引:29
作者
Feng, Zhixi [1 ]
Song, Liangliang [1 ]
Yang, Shuyuan [1 ]
Zhang, Xinyu [1 ]
Jiao, Licheng [1 ]
机构
[1] Xidian Univ, Sch Artificial Intelligence, Xian 710071, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Cross-modal contrastive learning (CMCL); multimodal remote sensing image (MRSI) classification; self-supervised; LIDAR DATA; FUSION;
D O I
10.1109/TGRS.2023.3296703
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Recently, multimodal remote sensing image (MRSI) classification has attracted increasing attention from researchers. However, the classification of MRSI with limited labeled instances is still a challenging task. In this article, a novel self-supervised cross-modal contrastive learning (CMCL) method is proposed for MRSI classification. Joint intramodal contrastive learning (IMCL) and CMCL are used to better mine multimodal feature representations during pretraining, and the IMCL and CMCL objectives are jointly optimized, whereby it encourages the learned representation to be semantically consistent within and between modalities simultaneously. Moreover, a simple but effective hybrid cross-modal fusion module (HCFM) is designed in the fine-tuning stage, which could better compactly integrate complementary information across these modalities for more accurate classification. Extensive experiments are taken on four benchmark datasets (i.e., Houston 2013, Augsburg, Germany; Trento, Italy; and Berlin, Germany), and the results show that the proposed method outperforms state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 48 条
[1]  
Chen T, 2020, PR MACH LEARN RES, V119
[2]   Deep Fusion of Remote Sensing Data for Accurate Classification [J].
Chen, Yushi ;
Li, Chunyang ;
Ghamisi, Pedram ;
Jia, Xiuping ;
Gu, Yanfeng .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2017, 14 (08) :1253-1257
[3]   Global-Local Transformer Network for HSI and LiDAR Data Joint Classification [J].
Ding, Kexing ;
Lu, Ting ;
Fu, Wei ;
Li, Shutao ;
Ma, Fuyan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[4]   Multibranch Feature Fusion Network With Self- and Cross-Guided Attention for Hyperspectral and LiDAR Classification [J].
Dong, Wenqian ;
Zhang, Tian ;
Qu, Jiahui ;
Xiao, Song ;
Zhang, Tongzhen ;
Li, Yunsong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[5]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[6]   Self-Supervised Divide-and-Conquer Generative Adversarial Network for Classification of Hyperspectral Images [J].
Feng, Jie ;
Zhao, Ning ;
Shang, Ronghua ;
Zhang, Xiangrong ;
Jiao, Licheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[7]   Multimodal Classification of Remote Sensing Images: A Review and Future Directions [J].
Gomez-Chova, Luis ;
Tuia, Devis ;
Moser, Gabriele ;
Camps-Valls, Gustau .
PROCEEDINGS OF THE IEEE, 2015, 103 (09) :1560-1584
[8]  
Grill J.-B., 2020, P INT C MACH LEARN
[9]   Classification of Hyperspectral and LiDAR Data Using Coupled CNNs [J].
Hang, Renlong ;
Li, Zhu ;
Ghamisi, Pedram ;
Hong, Danfeng ;
Xia, Guiyu ;
Liu, Qingshan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07) :4939-4950
[10]   Urban structure type characterization using hyperspectral remote sensing and height information [J].
Heiden, Uta ;
Heldens, Wieke ;
Roessner, Sigrid ;
Segl, Karl ;
Esch, Thomas ;
Mueller, Andreas .
LANDSCAPE AND URBAN PLANNING, 2012, 105 (04) :361-375