Real-Time Semantic Segmentation of Remote Sensing Images Based on Bilateral Attention Refined Network

被引:8
作者
Cai, Jiali [1 ]
Liu, Chunjuan [1 ]
Yan, Haowen [1 ]
Wu, Xiaosuo [1 ,2 ,3 ]
Lu, Wanzhen [1 ]
Wang, Xiaoyu [1 ]
Sang, Changlin [1 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou 730070, Peoples R China
[2] Gansu Acad Sci, Inst Sensor Technol, Lanzhou 730070, Peoples R China
[3] Lanzhou Jiaotong Univ, Key Lab Opt Technol & Intelligent Control, Minist Educ, Lanzhou 730070, Peoples R China
关键词
Semantics; Feature extraction; Convolution; Licenses; Convolutional codes; Task analysis; Spatial resolution; Remote sensing image; real-time semantic segmentation; local attention enhancement module; global context embedding module; multi-label category loss; PREDICTION;
D O I
10.1109/ACCESS.2021.3058571
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The trade-off between feature representation capability and spatial positioning accuracy is crucial to dense classification or semantic segmentation of remote sensing images. In order to better balance the low-level spatial details in the shallow network and the high-level abstract semantics in the deep network, the bilateral attention refinement lightweight network BARNet is introduced. In this way, we can use the fine-grained features in the shallow layer to further supplement and capture the deeper information of the high-level semantic features. The network employs an asymmetric encoder decoder architecture for the task of real-time semantic segmentation. Encoder part proposes a lightweight network residual unit with the split, concatenate and split bottleneck structure to achieve more light weighted, effificient and powerful feature extraction. In the decoding section, we propose an adaptive method to enhance feature representation in local attention enhancement module. In addition, the global context embedding module is introduced to divide the high-level features into two branches. One branch gets the weight vector to guide the low-level learning, and the other branch will get a semantic vector, which is used to calculate the multi-label category loss and further introduce into the overall loss function to regulate the training process better. The effectiveness and efficiency of the network are verified on ISPRS Potsdam data set and CCF data set, respectively. The results show that the models using these strategies outperform the baseline network on MIoU, PA and F1, which increase by 18.86%, 16.21% and 15.64% on the Potsdam dataset; 10.51%, 6.53% and 8.19% on the CCF dataset.
引用
收藏
页码:28349 / 28360
页数:12
相关论文
共 44 条
[31]  
Shen L., 2018, PROC CVPR IEEE, P7132
[32]   Lightweight Prediction and Boundary Attention-Based Semantic Segmentation for Road Scene Understanding [J].
Sun, Jee-Young ;
Jung, Seung-Won ;
Ko, Sung-Jea .
IEEE ACCESS, 2020, 8 :108449-108460
[33]  
Wang L., 2019, INT CONF WIRE COMMUN, P1
[34]  
Wang Y, 2019, IEEE IMAGE PROC, P1860, DOI [10.1109/ICIP.2019.8803154, 10.1109/icip.2019.8803154]
[35]   CBAM: Convolutional Block Attention Module [J].
Woo, Sanghyun ;
Park, Jongchan ;
Lee, Joon-Young ;
Kweon, In So .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :3-19
[36]   DANet: Divergent Activation for Weakly Supervised Object Localization [J].
Xue, Haolan ;
Liu, Chang ;
Wan, Fang ;
Jiao, Jianbin ;
Ji, Xiangyang ;
Ye, Qixiang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6588-6597
[37]  
Yu C., 2020, ARXIV200402147
[38]   BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation [J].
Yu, Changqian ;
Wang, Jingbo ;
Peng, Chao ;
Gao, Changxin ;
Yu, Gang ;
Sang, Nong .
COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :334-349
[39]   Learning a Discriminative Feature Network for Semantic Segmentation [J].
Yu, Changqian ;
Wang, Jingbo ;
Peng, Chao ;
Gao, Changxin ;
Yu, Gang ;
Sang, Nong .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1857-1866
[40]   ACFNet: Attentional Class Feature Network for Semantic Segmentation [J].
Zhang, Fan ;
Chen, Yanqin ;
Li, Zhihang ;
Hong, Zhibin ;
Liu, Jingtuo ;
Ma, Feifei ;
Han, Junyu ;
Ding, Errui .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6797-6806