Multistage Attention ResU-Net for Semantic Segmentation of Fine-Resolution Remote Sensing Images

被引:188
作者
Li, Rui [1 ]
Zheng, Shunyi [1 ]
Duan, Chenxi [2 ]
Su, Jianlin [3 ]
Zhang, Ce [4 ,5 ]
机构
[1] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430079, Peoples R China
[2] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[3] Shenzhen Zhuiyi Technol Co Ltd, Shenzhen 518054, Peoples R China
[4] Univ Lancaster, Lancaster Environm Ctr, Lancaster LA1 4YQ, England
[5] UK Ctr Ecol & Hydrol, Lancaster LA1 4AP, England
关键词
Semantics; Complexity theory; Remote sensing; Task analysis; Image segmentation; Feature extraction; Decoding; Fine-resolution remote sensing images; linear attention mechanism (LAM); semantic segmentation;
D O I
10.1109/LGRS.2021.3063381
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The attention mechanism can refine the extracted feature maps and boost the classification performance of the deep network, which has become an essential technique in computer vision and natural language processing. However, the memory and computational costs of the dot-product attention mechanism increase quadratically with the spatiotemporal size of the input. Such growth hinders the usage of attention mechanisms considerably in application scenarios with large-scale inputs. In this letter, we propose a linear attention mechanism (LAM) to address this issue, which is approximately equivalent to dot-product attention with computational efficiency. Such a design makes the incorporation between attention mechanisms and deep networks much more flexible and versatile. Based on the proposed LAM, we refactor the skip connections in the raw U-Net and design a multistage attention ResU-Net (MAResU-Net) for semantic segmentation from fine-resolution remote sensing images. Experiments conducted on the Vaihingen data set demonstrated the effectiveness and efficiency of our MAResU-Net. Our code is available at https://github.com/lironui/MAResU-Net.
引用
收藏
页数:5
相关论文
共 23 条
[11]   Synchrosqueezing Polynomial Chirplet Transform and Its Application in Tight Sandstone Gas Reservoir Identification [J].
Li, Rui ;
Chen, Hui ;
Fang, Yuxia ;
Hu, Ying ;
Chen, Xuping ;
Li, Jun .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[12]   Classification of Hyperspectral Image Based on Double-Branch Dual-Attention Mechanism Network [J].
Li, Rui ;
Zheng, Shunyi ;
Duan, Chenxi ;
Yang, Yang ;
Wang, Xiqi .
REMOTE SENSING, 2020, 12 (03)
[13]   U-Net: Convolutional Networks for Biomedical Image Segmentation [J].
Ronneberger, Olaf ;
Fischer, Philipp ;
Brox, Thomas .
MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 :234-241
[14]   Efficient Attention: Attention with Linear Complexities [J].
Shen Zhuoran ;
Zhang Mingyuan ;
Zhao Haiyu ;
Yi Shuai ;
Li Hongsheng .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3530-3538
[15]  
Vaswani A, 2017, ADV NEUR IN, V30
[16]   Deep Parametric Continuous Convolutional Neural Networks [J].
Wang, Shenlong ;
Suo, Simon ;
Ma, Wei-Chiu ;
Pokrovsky, Andrei ;
Urtasun, Raquel .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2589-2597
[17]  
Yu Fisher, 2015, ARXIV
[18]  
Yuan Yuhui, 2018, arXiv
[19]   Identifying and mapping individual plants in a highly diverse high-elevation ecosystem using UAV imagery and deep learning [J].
Zhang, Ce ;
Atkinson, Peter M. ;
George, Charles ;
Wen, Zhaofei ;
Diazgranados, Mauricio ;
Gerard, France .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 169 :280-291
[20]   Co-occurrent Features in Semantic Segmentation [J].
Zhang, Hang ;
Zhang, Han ;
Wang, Chenguang ;
Xie, Junyuan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :548-557