ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery

被引：231

作者：

Li, Rui ^{[1
]}

Zheng, Shunyi ^{[1
]}

Zhang, Ce ^{[2
,3
]}

Duan, Chenxi ^{[4
,5
]}

Wang, Libo ^{[1
]}

Atkinson, Peter M. ^{[2
,6
,7
]}

机构：

[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China

[2] Univ Lancaster, Lancaster Environm Ctr, Lancaster LA1 4YQ, England

[3] UK Ctr Ecol & Hydrol, Lib Ave, Lancaster LA1 4AP, England

[4] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, Enschede, Netherlands

[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China

[6] Univ Southampton, Geog & Environm Sci, Southampton SO17 1BJ, Hants, England

[7] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, 11A Datun Rd, Beijing 100101, Peoples R China

来源：

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING | 2021年 / 181卷

基金：

中国国家自然科学基金;

关键词：

Semantic Segmentation; Attention Mechanism; Bilateral Architecture; Convolutional Neural Network; Deep Learning; LAND-COVER; CLASSIFICATION; ALGORITHMS; FRAMEWORK; NET;

D O I：

10.1016/j.isprsjprs.2021.09.005

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Semantic segmentation of remotely sensed imagery plays a critical role in many real-world applications, such as environmental change monitoring, precision agriculture, environmental protection, and economic assessment. Following rapid developments in sensor technologies, vast numbers of fine-resolution satellite and airborne remote sensing images are now available, for which semantic segmentation is potentially a valuable method. However, because of the rich complexity and heterogeneity of information provided with an ever-increasing spatial resolution, state-of-the-art deep learning algorithms commonly adopt complex network structures for segmentation, which often result in significant computational demand. Particularly, the frequently-used fully convolutional network (FCN) relies heavily on fine-grained spatial detail (fine spatial resolution) and contextual information (large receptive fields), both imposing high computational costs. This impedes the practical utility of FCN for real-world applications, especially those requiring real-time data processing. In this paper, we propose a novel Attentive Bilateral Contextual Network (ABCNet), a lightweight convolutional neural network (CNN) with a spatial path and a contextual path. Extensive experiments, including a comprehensive ablation study, demonstrate that ABCNet has strong discrimination capability with competitive accuracy compared with stateof-the-art benchmark methods while achieving significantly increased computational efficiency. Specifically, the proposed ABCNet achieves a 91.3% overall accuracy (OA) on the Potsdam test dataset and outperforms all lightweight benchmark methods significantly. The code is freely available at https;//github.com./lironui/ABCNet.

引用

页码：84 / 98

页数：15

共 77 条

[1] Big earth observation time series analysis for monitoring Brazilian agriculture [J].

Araujo Picoli, Michelle Cristina ;

Camara, Gilberto ;

Sanches, Ieda ;

Simoes, Rolf ;

Carvalho, Alexandre ;

Maciel, Adeline ;

Coutinho, Alexandre ;

Esquerdo, Julio ;

Antunes, Joao ;

Begotti, Rodrigo Anzolin ;

Arvor, Damien ;

Almeida, Claudio .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 :328-339

[2] Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].

Audebert, Nicolas ;

Le Saux, Bertrand ;

Lefevre, Sebastien .

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32

[3] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].

Badrinarayanan, Vijay ;

Kendall, Alex ;

Cipolla, Roberto .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495

[4] Attention Augmented Convolutional Networks [J].

Bello, Irwan ;

Zoph, Barret ;

Vaswani, Ashish ;

Shlens, Jonathon ;

Le, Quoc V. .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3285-3294

[5]

Caccetta P, 2020, ARXIV PREPRINT ARXIV

[6]

Cao Y, 2019, IEEE ICC

[7]

Chen L. C., 2014, ICLR

[8] Deep Cross-Modal Audio-Visual Generation [J].

Chen, Lele ;

Srivastava, Sudhanshu ;

Duan, Zhiyao ;

Xu, Chenliang .

PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, :349-357

[9] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[10] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].

Chen, Long ;

Zhang, Hanwang ;

Xiao, Jun ;

Nie, Liqiang ;

Shao, Jian ;

Liu, Wei ;

Chua, Tat-Seng .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306

← 1 2 3 4 5 6 7 8 →