Integrating Gate and Attention Modules for High-Resolution Image Semantic Segmentation

被引:22
作者
Zheng, Zixian [1 ]
Zhang, Xueliang [1 ]
Xiao, Pengfeng [1 ]
Li, Zhenshi [1 ]
机构
[1] Nanjing Univ, Sch Geog & Ocean Sci, Key Lab Land Satellite Remote Sensing Applicat, Minist Nat Resources,Jiangsu Prov Key Lab Geog In, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Image segmentation; Feature extraction; Decoding; Remote sensing; Spatial resolution; Logic gates; Attention module (AM); gate module (GM); high-resolution (HR) remote sensing imagery; semantic segmentation; FULLY CONVOLUTIONAL NETWORKS; AERIAL; MULTISCALE; FUSION;
D O I
10.1109/JSTARS.2021.3071353
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic segmentation of high-resolution (HR) remote sensing images achieved great progress by utilizing deep convolutional neural networks (DCNNs) in recent years. However, the decrease of resolution in the feature map of DCNNs brings about the loss of spatial information and thus leads to the blurring of object boundary and misclassification of small objects. In addition, the class imbalance and the high diversity of geographic objects in HR images exacerbate the performance. To deal with the above problems, we proposed an end-to-end DCNN network named GAMNet to balance the contradiction between global semantic information and local details. An integration of attention and gate module (GAM) is specially designed to simultaneously realize multiscale feature extraction and boundary recovery. The integration module can be inserted in an encoder-decoder network with skip connection. Meanwhile, a composite loss function is designed to achieve deep supervision of GAM by adding an auxiliary loss, which can help improve the effectiveness of the integration module. The performance of GAMNet is quantitatively evaluated on the ISPRS 2-D semantic labeling datasets and achieves state-of-the-art performance in comparison with other representative methods.
引用
收藏
页码:4530 / 4546
页数:17
相关论文
共 66 条
[1]  
[Anonymous], 2015, 1511 ARXIV
[2]  
[Anonymous], 2016, RefineNet: Multi-path refinement networks for high-resolution semantic segmentation
[3]  
[Anonymous], 2015, ARXIV
[4]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[5]   Semantic Segmentation of Earth Observation Data Using Multimodal and Multi-scale Deep Networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
COMPUTER VISION - ACCV 2016, PT I, 2017, 10111 :180-196
[6]   HOW USEFUL IS REGION-BASED CLASSIFICATION OF REMOTE SENSING IMAGES IN A DEEP LEARNING FRAMEWORK ? [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :5091-5094
[7]   End-to-End DSM Fusion Networks for Semantic Segmentation in High-Resolution Aerial Images [J].
Cao, Zhiying ;
Fu, Kun ;
Lu, Xiaode ;
Diao, Wenhui ;
Sun, Hao ;
Yan, Menglong ;
Yu, Hongfeng ;
Sun, Xian .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (11) :1766-1770
[8]   Aerial image semantic segmentation using DCNN predicted distance maps [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Huang, Jingfeng .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 161 :309-322
[9]   Cloud and cloud shadow detection in Landsat imagery based on deep convolutional neural networks [J].
Chai, Dengfeng ;
Newsam, Shawn ;
Zhang, Hankui K. ;
Qiu, Yifan ;
Huang, Jingfeng .
REMOTE SENSING OF ENVIRONMENT, 2019, 225 :307-316
[10]   Symmetrical Dense-Shortcut Deep Fully Convolutional Networks for Semantic Segmentation of Very-High-Resolution Remote Sensing Images [J].
Chen, Guanzhou ;
Zhang, Xiaodong ;
Wang, Qing ;
Dai, Fan ;
Gong, Yuanfu ;
Zhu, Kun .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (05) :1633-1644