Context Aggregation Network for Semantic Labeling in Aerial Images

被引:32
作者
Cheng, Wensheng [1 ,2 ]
Yang, Wen [1 ,2 ]
Wang, Min [2 ]
Wang, Gang [2 ]
Chen, Jinyong [2 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430072, Hubei, Peoples R China
[2] CETC Key Lab Aerosp Informat Applicat, Shijiazhuang 050081, Hebei, Peoples R China
基金
中国国家自然科学基金;
关键词
convolutional neural networks; semantic labeling; context aggregation; channel attention; residual convolution; aerial images; REMOTE-SENSING IMAGERY; CLASSIFICATION; CNN;
D O I
10.3390/rs11101158
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic labeling for high resolution aerial images is a fundamental and necessary task in remote sensing image analysis. It is widely used in land-use surveys, change detection, and environmental protection. Recent researches reveal the superiority of Convolutional Neural Networks (CNNs) in this task. However, multi-scale object recognition and accurate object localization are two major problems for semantic labeling methods based on CNNs in high resolution aerial images. To handle these problems, we design a Context Fuse Module, which is composed of parallel convolutional layers with kernels of different sizes and a global pooling branch, to aggregate context information at multiple scales. We propose an Attention Mix Module, which utilizes a channel-wise attention mechanism to combine multi-level features for higher localization accuracy. We further employ a Residual Convolutional Module to refine features in all feature levels. Based on these modules, we construct a new end-to-end network for semantic labeling in aerial images. We evaluate the proposed network on the ISPRS Vaihingen and Potsdam datasets. Experimental results demonstrate that our network outperforms other competitors on both datasets with only raw image data.
引用
收藏
页数:19
相关论文
共 54 条
[1]  
Adam H., ARXIV170605587
[2]   Simultaneous extraction of roads and buildings in remote sensing imagery with convolutional neural networks [J].
Alshehhi, Rasha ;
Marpu, Prashanth Reddy ;
Woon, Wei Lee ;
Dalla Mura, Mauro .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 130 :139-149
[3]  
[Anonymous], 2011, STAIR VISION LIB V2
[4]  
[Anonymous], P 3 INT C LEARNING R
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[7]  
[Anonymous], ISPRS J PHOTOGRAMM R
[8]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[9]  
[Anonymous], 2016, 4 INT C LEARN REPR I
[10]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386