Deep Attention and Multi-Scale Networks for Accurate Remote Sensing Image Segmentation

被引：39

作者：

Qi, Xingqun ^{[1
,2
]}

Li, Kaiqi ^{[1
,2
]}

Liu, Pengkun ^{[1
,2
]}

Zhou, Xiaoguang ^{[3
]}

Sun, Muyi ^{[1
,2
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Automat, Beijing 100876, Peoples R China

[2] Minist Educ, Engn Res Ctr Informat Network, Beijing 100876, Peoples R China

[3] Minjiang Univ, Fuzhou 350108, Fujian, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷 / 08期

关键词：

Remote sensing; Image segmentation; Feature extraction; Semantics; Image resolution; Convolution; Roads; Remote sensing image; convolutional neural network; semantic segmentation; attention; multi-scale; dense upsampling convolution; SEMANTIC SEGMENTATION; CLASSIFICATION;

D O I：

10.1109/ACCESS.2020.3015587

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Remote sensing image segmentation is a challenging task in remote sensing image analysis. Remote sensing image segmentation has great significance in urban planning, crop planting, and other fields that need plentiful information about the land. Technically, this task suffers from the ultra-high resolution, large shooting angle, and feature complexity of the remote sensing images. To address these issues, we propose a deep learning-based network called ATD-LinkNet with several customized modules. Specifically, we propose a replaceable module named AT block using multi-scale convolution and attention mechanism as the building block in ATD-LinkNet. AT block fuses different scale features and effectively utilizes the abundant spatial and semantic information in remote sensing images. To refine the nonlinear boundaries of internal objects in remote sensing images, we adopt the dense upsampling convolution in the decoder part of ATD-LinkNet. Experimentally, we enforce sufficient comparative experiments on two public remote sensing datasets (Potsdam and DeepGlobe Road Extraction). The results show our ATD-LinkNet achieves better performance against most state-of-the-art networks. We obtain 89.0% for pixel-level accuracy in the Potsdam dataset and 62.68% for mean Intersection over Union in the DeepGlobe Road Extraction dataset.

引用

页码：146627 / 146639

页数：13

共 69 条

[1]

[Anonymous], 2015, 1511 ARXIV

[2]

[Anonymous], 2017, 2017 2 INT C MECH, DOI DOI 10.1109/ICMCCE.2017.49

[3]

[Anonymous], 2015, ARXIV 1502 03167

[4]

[Anonymous], 2015, PROC CVPR IEEE

[5]

[Anonymous], 2017, ARXIV PREPRINT ARXIV

[6]

[Anonymous], 2019, INT J ADV ROBOT SYST

[7]

[Anonymous], 2015, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-24574-4_28

[8] Deep Machine Learning-A New Frontier in Artificial Intelligence Research [J].

Arel, Itamar ;

Rose, Derek C. ;

Karnowski, Thomas P. .

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (04) :13-18

[9] Joint Learning from Earth Observation and OpenStreetMap Data to Get Faster Better Semantic Maps [J].

Audebert, Nicolas ;

Le Saux, Bertrand ;

Lefevre, Sebastien .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1552-1560

[10] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

← 1 2 3 4 5 6 7 →