Siamese-GAN: Learning Invariant Representations for Aerial Vehicle Image Categorization

被引:49
作者
Bashmal, Laila [1 ]
Bazi, Yakoub [1 ]
AlHichri, Haikel [1 ]
AlRahhal, Mohamad M. [2 ]
Ammour, Nassim [1 ]
Alajlan, Naif [1 ]
机构
[1] King Saud Univ, Dept Comp Engn, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
[2] King Saud Univ, Dept Informat Sci, Coll Appl Comp Sci, Riyadh 11543, Saudi Arabia
关键词
manned/unmanned aerial vehicles (MAV/UAV); extremely high resolution (EHR) images; distribution mismatch; generative adversarial networks (GANs); Siamese encoder-decoder; SCENE CLASSIFICATION; HIERARCHICAL FEATURES; NETWORKS; DOMAIN;
D O I
10.3390/rs10020351
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In this paper, we present a new algorithm for cross-domain classification in aerial vehicle images based on generative adversarial networks (GANs). The proposed method, called Siamese-GAN, learns invariant feature representations for both labeled and unlabeled images coming from two different domains. To this end, we train in an adversarial manner a Siamese encoder-decoder architecture coupled with a discriminator network. The encoder-decoder network has the task of matching the distributions of both domains in a shared space regularized by the reconstruction ability, while the discriminator seeks to distinguish between them. After this phase, we feed the resulting encoded labeled and unlabeled features to another network composed of two fully-connected layers for training and classification, respectively. Experiments on several cross-domain datasets composed of extremely high resolution (EHR) images acquired by manned/unmanned aerial vehicles (MAV/UAV) over the cities of Vaihingen, Toronto, Potsdam, and Trento are reported and discussed.
引用
收藏
页数:19
相关论文
共 53 条
[1]  
[Anonymous], 2008, ICML 08, DOI 10.1145/1390156.1390294
[2]  
[Anonymous], Photorealistic single image superresolution using a generative adversarial network.Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition
[3]  
2017:46814690
[4]   Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks [J].
Bousmalis, Konstantinos ;
Silberman, Nathan ;
Dohan, David ;
Erhan, Dumitru ;
Krishnan, Dilip .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :95-104
[5]   Efficient Training of Convolutional Deep Belief Networks in the Frequency Domain for Application to High-Resolution 2D and 3D Images [J].
Brosch, Tom ;
Tam, Roger .
NEURAL COMPUTATION, 2015, 27 (01) :211-227
[6]   Deep Feature Fusion for VHR Remote Sensing Scene Classification [J].
Chaib, Souleyman ;
Liu, Huan ;
Gu, Yanfeng ;
Yao, Hongxun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (08) :4775-4784
[7]   Pyramid of Spatial Relatons for Scene-Level Land Use Classification [J].
Chen, Shizhi ;
Tian, YingLi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (04) :1947-1957
[8]   Effective and Efficient Midlevel Visual Elements-Oriented Land-Use Classification Using VHR Remote Sensing Images [J].
Cheng, Gong ;
Han, Junwei ;
Guo, Lei ;
Liu, Zhenbao ;
Bu, Shuhui ;
Ren, Jinchang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2015, 53 (08) :4238-4249
[9]   Unsupervised Feature Learning for Aerial Scene Classification [J].
Cheriyadat, Anil M. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (01) :439-451
[10]   Learning Hierarchical Features for Scene Labeling [J].
Farabet, Clement ;
Couprie, Camille ;
Najman, Laurent ;
LeCun, Yann .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1915-1929