Cross-domain road detection based on global-local adversarial learning framework from very high resolution satellite imagery

被引：37

作者：

Lu, Xiaoyan ^{[1
]}

Zhong, Yanfei ^{[1
,2
]}

Zheng, Zhuo ^{[1
]}

Wang, Junjue ^{[1
]}

机构：

[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China

[2] Wuhan Univ, Hubei Prov Engn Res Ctr Nat Resources Remote Sens, Wuhan 430079, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2021年 / 180卷

基金：

中国国家自然科学基金;

关键词：

Road detection; Remote sensing; Global-local; Adversarial learning framework; Cross-domain; DETECTION NETWORK; MULTISCALE; EXTRACTION; INFORMATION; TRACKING;

D O I：

10.1016/j.isprsjprs.2021.08.018

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Road detection based on convolutional neural networks (CNNs) has achieved remarkable performances for very high resolution (VHR) remote sensing images. However, this approach relies on massive annotated samples, and the problem of limited generalization for unseen images still remains. The manual pixel-level labeling process is also extremely time-consuming, and the performance of CNNs degrades significantly when there is a domain gap between the training and test images. In this paper, to address this problem, a global-local adversarial learning (GOAL) framework is proposed for cross-domain road detection. On the one hand, considering the spatial information similarities between the source and target domains, feature space driven adversarial learning is applied to explore the shared features across domains. On the other hand, the complex background of VHR remote sensing images, such as the occlusions and shadows of trees and buildings, makes some roads easy to recognize, while others are much more difficult. However, the traditional global adversarial learning approach cannot guarantee local semantic consistency. Therefore, a local alignment operation is introduced, which adaptively adjusts the weight of the adversarial loss according to the road recognition difficulty. Extensive experiments were conducted on different road datasets, including two public competition road datasets-SpaceNet and DeepGlobe-and our own large-scale annotated images from four cities: Boston, Birmingham, Shanghai, and Wuhan. The experimental results show that the proposed GOAL framework can clearly improve the cross-domain road detection performance, without any annotation of the target domain images. For instance, taking SpaceNet road dataset as the source domain, compared with the no adaptation method, the IOU performance of GOAL framework is increased by 14.36%, 5.49%, 4.51%, 5.63% and 15.14% on DeepGlobe, Boston, Birmingham, Shanghai, and Wuhan images, respectively, which demonstrates its strong generalization capability.

引用

页码：296 / 312

页数：17

共 59 条

[1] RoadTracer: Automatic Extraction of Road Networks from Aerial Images [J].

Bastani, Favyen ;

He, Songtao ;

Abbar, Sofiane ;

Alizadeh, Mohammad ;

Balakrishnan, Hari ;

Chawla, Sanjay ;

Madden, Sam ;

DeWitt, David .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4720-4728

[2] Improved Road Connectivity by Joint Learning of Orientation and Segmentation [J].

Batra, Anil ;

Singh, Suriya ;

Pang, Guan ;

Basu, Saikat ;

Jawahar, C., V ;

Paluri, Manohar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10377-10385

[3] Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images [J].

Benjdira, Bilel ;

Bazi, Yakoub ;

Koubaa, Anis ;

Ouni, Kais .

REMOTE SENSING, 2019, 11 (11)

[4] Large-Scale Machine Learning with Stochastic Gradient Descent [J].

Bottou, Leon .

COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186

[5] All about Structure: Adapting Structural Information across Domains for Boosting Semantic Segmentation [J].

Chang, Wei-Lun ;

Wang, Hui-Po ;

Peng, Wen-Hsiao ;

Chiu, Wei-Chen .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1900-1909

[6] Deep Cross-Modal Audio-Visual Generation [J].

Chen, Lele ;

Srivastava, Sudhanshu ;

Duan, Zhiyao ;

Xu, Chenliang .

PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, :349-357

[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[8] Learning Semantic Segmentation from Synthetic Data: A Geometrically Guided Input-Output Adaptation Approach [J].

Chen, Yuhua ;

Li, Wen ;

Chen, Xiaoran ;

Van Gool, Luc .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1841-1850

[9] A Survey on Deep Transfer Learning [J].

Tan, Chuanqi ;

Sun, Fuchun ;

Kong, Tao ;

Zhang, Wenchang ;

Yang, Chao ;

Liu, Chunfang .

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 :270-279

[10]

Constantin A, 2018, 2018 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS (APCCAS 2018), P423, DOI 10.1109/APCCAS.2018.8605652

← 1 2 3 4 5 6 →