A Deep Cross-Modality Hashing Network for SAR and Optical Remote Sensing Images Retrieval

被引:49
作者
Xiong, Wei [1 ]
Xiong, Zhenyu [1 ]
Zhang, Yang [1 ]
Cui, Yaqi [1 ]
Gu, Xiangqi [1 ]
机构
[1] Naval Aviat Univ, Res Inst Informat Fus, Yantai 264001, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Synthetic aperture radar; Feature extraction; Optical sensors; Optical imaging; Task analysis; Adaptive optics; Cross-modality content-based remote sensing image retrieval (CM-CBRSIR); deep cross-modality hashing network (DCMHN); modality discrepancy; synthetic aperture radar (SAR)-optical dual-modality remote sensing image dataset (SODMRSID); REPRESENTATION; FEATURES;
D O I
10.1109/JSTARS.2020.3021390
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The content-based remote sensing image retrieval (CBRSIR) has recently become a hot topic due to its wide applications in analysis of remote sensing data. However, since conventional CBRSIR is unsuitable in harsh environments, this article focuses on the cross-modality CBRSIR (CM-CBRSIR) between synthetic aperture radar (SAR) and optical images. Besides the large interclass and small intraclass in CBRSIR, CM-CBRSIR is limited by prominent modality discrepancy caused by different imaging mechanisms. To address this limitation, this study proposes a deep cross-modality hashing network. First, we transform optical images with three channels into four different types of single-channel images to increase diversity of the training modalities. This helps the network to mainly focus on extracting the contour and texture shared features and makes it less sensitive to color information for images across modalities. Second, we combine any type of randomly selected transformed images and its corresponding SAR or optical images to form image pairs that are fed into the networks. The training strategy, with paired image data, eliminates the large cross-modality variations caused by different modalities. Finally, the triplet loss, in combination with the hash function, helps the modal to extract the discriminative features of images and upgrade the retrieval efficiency. To further evaluate the proposed modality, we construct a SAR-optical dual-modality remote sensing image dataset containing 12 categories. Experimental results demonstrate the superiority of the proposed method with regards to efficiency and generality.
引用
收藏
页码:5284 / 5296
页数:13
相关论文
共 69 条
[51]   Cross-Modality Person Re-Identification Based on Dual-Path Multi-Branch Network [J].
Xiang, Xuezhi ;
Lv, Ning ;
Yu, Zeting ;
Zhai, Mingliang ;
El Saddik, Abdulmotaleb .
IEEE SENSORS JOURNAL, 2019, 19 (23) :11706-11713
[52]  
Xie J, 2015, PROC CVPR IEEE, P1275, DOI 10.1109/CVPR.2015.7298732
[53]   Learning to Translate for Cross-Source Remote Sensing Image Retrieval [J].
Xiong, Wei ;
Lv, Yafei ;
Zhang, Xiaohan ;
Cui, Yaqi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (07) :4860-4874
[54]   A Discriminative Distillation Network for Cross-Source Remote Sensing Image Retrieval [J].
Xiong, Wei ;
Xiong, Zhenyu ;
Cui, Yaqi ;
Lv, Yafei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :1234-1247
[55]   A Discriminative Feature Learning Approach for Remote Sensing Image Retrieval [J].
Xiong, Wei ;
Lv, Yafei ;
Cui, Yaqi ;
Zhang, Xiaohan ;
Gu, Xiangqi .
REMOTE SENSING, 2019, 11 (03)
[56]   Learning Discriminative Binary Codes for Large-scale Cross-modal Retrieval [J].
Xu, Xing ;
Shen, Fumin ;
Yang, Yang ;
Shen, Heng Tao ;
Li, Xuelong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (05) :2494-2507
[57]   Supervised Learning of Semantics-Preserving Hash via Deep Convolutional Neural Networks [J].
Yang, Huei-Fang ;
Lin, Kevin ;
Chen, Chu-Song .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (02) :437-451
[58]   An improved Bag-of-Words framework for remote sensing image retrieval in large-scale image databases [J].
Yang, Jin ;
Liu, Jianbo ;
Dai, Qin .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2015, 8 (04) :273-292
[59]   Dynamic Match Kernel with Deep Convolutional Features for Image Retrieval [J].
Yang, Jufeng ;
Liang, Jie ;
Shen, Hui ;
Wang, Kai ;
Rosin, Paul L. ;
Yang, Ming-Hsuan .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (11) :5288-5302
[60]   Multitask Learning for Cross-Domain Image Captioning [J].
Yang, Min ;
Zhao, Wei ;
Xu, Wei ;
Feng, Yabing ;
Zhao, Zhou ;
Chen, Xiaojun ;
Lei, Kai .
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (04) :1047-1061