ABCNet: Attentive bilateral contextual network for efficient semantic segmentation of Fine-Resolution remotely sensed imagery

被引:231
作者
Li, Rui [1 ]
Zheng, Shunyi [1 ]
Zhang, Ce [2 ,3 ]
Duan, Chenxi [4 ,5 ]
Wang, Libo [1 ]
Atkinson, Peter M. [2 ,6 ,7 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
[2] Univ Lancaster, Lancaster Environm Ctr, Lancaster LA1 4YQ, England
[3] UK Ctr Ecol & Hydrol, Lib Ave, Lancaster LA1 4AP, England
[4] Univ Twente, Fac Geoinformat Sci & Earth Observat ITC, Enschede, Netherlands
[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, 129 Luoyu Rd, Wuhan 430079, Hubei, Peoples R China
[6] Univ Southampton, Geog & Environm Sci, Southampton SO17 1BJ, Hants, England
[7] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, 11A Datun Rd, Beijing 100101, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic Segmentation; Attention Mechanism; Bilateral Architecture; Convolutional Neural Network; Deep Learning; LAND-COVER; CLASSIFICATION; ALGORITHMS; FRAMEWORK; NET;
D O I
10.1016/j.isprsjprs.2021.09.005
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Semantic segmentation of remotely sensed imagery plays a critical role in many real-world applications, such as environmental change monitoring, precision agriculture, environmental protection, and economic assessment. Following rapid developments in sensor technologies, vast numbers of fine-resolution satellite and airborne remote sensing images are now available, for which semantic segmentation is potentially a valuable method. However, because of the rich complexity and heterogeneity of information provided with an ever-increasing spatial resolution, state-of-the-art deep learning algorithms commonly adopt complex network structures for segmentation, which often result in significant computational demand. Particularly, the frequently-used fully convolutional network (FCN) relies heavily on fine-grained spatial detail (fine spatial resolution) and contextual information (large receptive fields), both imposing high computational costs. This impedes the practical utility of FCN for real-world applications, especially those requiring real-time data processing. In this paper, we propose a novel Attentive Bilateral Contextual Network (ABCNet), a lightweight convolutional neural network (CNN) with a spatial path and a contextual path. Extensive experiments, including a comprehensive ablation study, demonstrate that ABCNet has strong discrimination capability with competitive accuracy compared with stateof-the-art benchmark methods while achieving significantly increased computational efficiency. Specifically, the proposed ABCNet achieves a 91.3% overall accuracy (OA) on the Potsdam test dataset and outperforms all lightweight benchmark methods significantly. The code is freely available at https;//github.com./lironui/ABCNet.
引用
收藏
页码:84 / 98
页数:15
相关论文
共 77 条
[1]   Big earth observation time series analysis for monitoring Brazilian agriculture [J].
Araujo Picoli, Michelle Cristina ;
Camara, Gilberto ;
Sanches, Ieda ;
Simoes, Rolf ;
Carvalho, Alexandre ;
Maciel, Adeline ;
Coutinho, Alexandre ;
Esquerdo, Julio ;
Antunes, Joao ;
Begotti, Rodrigo Anzolin ;
Arvor, Damien ;
Almeida, Claudio .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 :328-339
[2]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Attention Augmented Convolutional Networks [J].
Bello, Irwan ;
Zoph, Barret ;
Vaswani, Ashish ;
Shlens, Jonathon ;
Le, Quoc V. .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3285-3294
[5]  
Caccetta P, 2020, ARXIV PREPRINT ARXIV
[6]  
Cao Y, 2019, IEEE ICC
[7]  
Chen L. C., 2014, ICLR
[8]   Deep Cross-Modal Audio-Visual Generation [J].
Chen, Lele ;
Srivastava, Sudhanshu ;
Duan, Zhiyao ;
Xu, Chenliang .
PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, :349-357
[9]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[10]   SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning [J].
Chen, Long ;
Zhang, Hanwang ;
Xiao, Jun ;
Nie, Liqiang ;
Shao, Jian ;
Liu, Wei ;
Chua, Tat-Seng .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6298-6306