Deep Metric Learning Based on Scalable Neighborhood Components for Remote Sensing Scene Characterization

被引:60
作者
Kang, Jian [1 ]
Fernandez-Beltran, Ruben [2 ]
Ye, Zhen [3 ]
Tong, Xiaohua [3 ]
Ghamisi, Pedram [4 ]
Plaza, Antonio [5 ]
机构
[1] Tech Univ Berlin, Fac Elect Engn & Comp Sci, D-10587 Berlin, Germany
[2] Univ Jaume 1, Inst New Imaging Technol, Castellon De La Plana 12071, Spain
[3] Tongji Univ, Coll Surveying & Geoinformat, Shanghai 200092, Peoples R China
[4] Helmholtz Zentrum Dresden Rossendorf, Explorat Div, Machine Learning Grp, Helmholtz Inst Freiberg Resource Technol, D-09599 Freiberg, Germany
[5] Univ Extremadura, Dept Technol Comp & Commun, Hyperspectral Comp Lab, Escuela Politecn, Caceres 10003, Spain
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2020年 / 58卷 / 12期
基金
中国国家自然科学基金;
关键词
Measurement; Semantics; Remote sensing; Feature extraction; Training; Complexity theory; Encoding; Deep learning; dimensionality reduction; metric learning; remote sensing (RS) scene characterization; CLASSIFICATION; CHALLENGES;
D O I
10.1109/TGRS.2020.2991657
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
With the development of convolutional neural networks (CNNs), the semantic understanding of remote sensing (RS) scenes has been significantly improved based on their prominent feature encoding capabilities. While many existing deep-learning models focus on designing different architectures, only a few works in the RS field have focused on investigating the performance of the learned feature embeddings and the associated metric space. In particular, two main loss functions have been exploited: the contrastive and the triplet loss. However, the straightforward application of these techniques to RS images may not be optimal in order to capture their neighborhood structures in the metric space due to the insufficient sampling of image pairs or triplets during the training stage and to the inherent semantic complexity of remotely sensed data. To solve these problems, we propose a new deep metric learning approach, which overcomes the limitation on the class discrimination by means of two different components: 1) scalable neighborhood component analysis (SNCA) that aims at discovering the neighborhood structure in the metric space and 2) the cross-entropy loss that aims at preserving the class discrimination capability based on the learned class prototypes. Moreover, in order to preserve feature consistency among all the minibatches during training, a novel optimization mechanism based on momentum update is introduced for minimizing the proposed loss. An extensive experimental comparison (using several state-of-the-art models and two different benchmark data sets) has been conducted to validate the effectiveness of the proposed method from different perspectives, including: 1) classification; 2) clustering; and 3) image retrieval. The related codes of this article will be made publicly available for reproducible research by the community.
引用
收藏
页码:8905 / 8918
页数:14
相关论文
共 62 条
[1]  
[Anonymous], 2018, Urban remote sensing
[2]  
[Anonymous], 2015, Distilling the knowledge in a neural network
[3]  
[Anonymous], 2008, Introduction to information retrieval
[4]  
[Anonymous], 2006 IEEE COMP VIS P, DOI 10.1109/CVPR.2006.100
[5]  
[Anonymous], 2017, ARXIV171201511
[6]   Remote Sensing Image Retrieval With Global Morphological Texture Descriptors [J].
Aptoula, Erchan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (05) :3023-3034
[7]   Very High-Resolution Remote Sensing: Challenges and Opportunities [J].
Benediktsson, Jon Atli ;
Chanussot, Jocelyn ;
Moon, Wooil M. .
PROCEEDINGS OF THE IEEE, 2012, 100 (06) :1907-1910
[8]   Bridging the Semantic Gap for Satellite Image Annotation and Automatic Mapping Applications [J].
Bratasanu, Dragos ;
Nedelcu, Ion ;
Datcu, Mihai .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (01) :193-204
[9]  
Cao R., 2019, ARXIV190205818
[10]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784