Data-Efficient Domain Adaptation for Semantic Segmentation of Aerial Imagery Using Generative Adversarial Networks

被引:27
作者
Benjdira, Bilel [1 ,2 ]
Ammar, Adel [1 ]
Koubaa, Anis [1 ,3 ]
Ouni, Kais [2 ]
机构
[1] Prince Sultan Univ, Coll Comp & Informat Sci, Robot & Internet Things Lab, Riyadh 11586, Saudi Arabia
[2] Univ Carthage, Natl Engn Sch Carthage, Res Lab Smart Elect & ICT, SEICT,LR18ES44, Tunis 2035, Tunisia
[3] Polytech Inst Porto, ISEP, INESC TEC, CISTER, P-4200465 Porto, Portugal
来源
APPLIED SCIENCES-BASEL | 2020年 / 10卷 / 03期
关键词
deep learning; domain adaptation; semantic segmentation; generative adversarial networks; convolutional neural networks; aerial imagery; NEURAL-NETWORKS;
D O I
10.3390/app10031092
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Despite the significant advances noted in semantic segmentation of aerial imagery, a considerable limitation is blocking its adoption in real cases. If we test a segmentation model on a new area that is not included in its initial training set, accuracy will decrease remarkably. This is caused by the domain shift between the new targeted domain and the source domain used to train the model. In this paper, we addressed this challenge and proposed a new algorithm that uses Generative Adversarial Networks (GAN) architecture to minimize the domain shift and increase the ability of the model to work on new targeted domains. The proposed GAN architecture contains two GAN networks. The first GAN network converts the chosen image from the target domain into a semantic label. The second GAN network converts this generated semantic label into an image that belongs to the source domain but conserves the semantic map of the target image. This resulting image will be used by the semantic segmentation model to generate a better semantic label of the first chosen image. Our algorithm is tested on the ISPRS semantic segmentation dataset and improved the global accuracy by a margin up to 24% when passing from Potsdam domain to Vaihingen domain. This margin can be increased by addition of other labeled data from the target domain. To minimize the cost of supervision in the translation process, we proposed a methodology to use these labeled data efficiently.
引用
收藏
页数:24
相关论文
共 53 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Convolutional Neural Networks for Electrocardiogram Classification [J].
Al Rahhal, Mohamad M. ;
Bazi, Yakoub ;
Al Zuair, Mansour ;
Othman, Esam ;
BenJdira, Bilel .
JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2018, 38 (06) :1014-1025
[3]   Multiple Object Scene Description for the Visually Impaired Using Pre-trained Convolutional Neural Networks [J].
Alhichri, Haikel ;
Bin Jdira, Bilel ;
Bazi, Yacoub ;
Alajlan, Naif .
IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016), 2016, 9730 :290-295
[4]   Deep Learning Approach for Car Detection in UAV Imagery [J].
Ammour, Nassim ;
Alhichri, Haikel ;
Bazi, Yakoub ;
Benjdira, Bilel ;
Alajlan, Naif ;
Zuair, Mansour .
REMOTE SENSING, 2017, 9 (04)
[5]  
[Anonymous], ARXIV14097495
[6]  
[Anonymous], 2017, P 2017 IEEE INT C CO
[7]  
[Anonymous], ICML
[8]  
[Anonymous], 2017, P 2017 IEEE C COMP V
[9]  
[Anonymous], ARXIV161202649
[10]  
[Anonymous], ARXIV170100160