GVANet: A Grouped Multiview Aggregation Network for Remote Sensing Image Segmentation

被引:1
作者
Yang, Yunsong [1 ]
Li, Jinjiang [1 ]
Chen, Zheng [1 ]
Ren, Lu [1 ]
机构
[1] Shandong Technol & Business Univ, Sch Comp Sci & Technol, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Remote sensing; Feature extraction; Accuracy; Semantic segmentation; Semantics; Convolutional neural networks; Buildings; Attention mechanism; multiscale fusion; remote sensing; semantic segmentation; transformer; CONVOLUTIONAL NEURAL-NETWORKS; SEMANTIC SEGMENTATION; CLASSIFICATION;
D O I
10.1109/JSTARS.2024.3459958
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In remote sensing image segmentation tasks, various challenges arise, including difficulties in recognizing objects due to differences in perspective, difficulty in distinguishing objects with similar colors, and challenges in segmentation caused by occlusions. To address these issues, we propose a method called the grouped multiview aggregation network (GVANet), which leverages multiview information for image analysis. This approach enables global multiview expansion and fine-grained cross-layer information interaction within the network. Within this network framework, to better utilize a wider range of multiview information to tackle challenges in remote sensing segmentation, we introduce the multiview feature aggregation block for extracting multiview information. Furthermore, to overcome the limitations of same-level shortcuts when dealing with multiview problems, we propose the channel group fusion block for cross-layer feature information interaction through a grouped fusion approach. Finally, to enhance the utilization of global features during the feature reconstruction phase, we introduce the aggregation-inhibition-activation block for feature selection and focus, which captures the key features for segmentation. Comprehensive experimental results on the Vaihingen and Potsdam datasets demonstrate that GVANet outperforms current state-of-the-art methods, achieving mIoU scores of 84.5% and 87.6%, respectively.
引用
收藏
页码:16727 / 16743
页数:17
相关论文
共 59 条
[1]  
Anxun Han, 2022, 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC), P2452, DOI 10.1109/SMC53654.2022.9945314
[2]   Big earth observation time series analysis for monitoring Brazilian agriculture [J].
Araujo Picoli, Michelle Cristina ;
Camara, Gilberto ;
Sanches, Ieda ;
Simoes, Rolf ;
Carvalho, Alexandre ;
Maciel, Adeline ;
Coutinho, Alexandre ;
Esquerdo, Julio ;
Antunes, Joao ;
Begotti, Rodrigo Anzolin ;
Arvor, Damien ;
Almeida, Claudio .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 145 :328-339
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[5]   Masked-attention Mask Transformer for Universal Image Segmentation [J].
Cheng, Bowen ;
Misra, Ishan ;
Schwing, Alexander G. ;
Kirillov, Alexander ;
Girdhar, Rohit .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289
[6]   Dual Attention Network for Scene Segmentation [J].
Fu, Jun ;
Liu, Jing ;
Tian, Haijie ;
Li, Yong ;
Bao, Yongjun ;
Fang, Zhiwei ;
Lu, Hanqing .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3141-3149
[7]   Improving public data for building segmentation from Convolutional Neural Networks (CNNs) for fused airborne lidar and image data using active contours [J].
Griffiths, David ;
Boehm, Jan .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 154 :70-83
[8]   Effective Sequential Classifier Training for SVM-Based Multitemporal Remote Sensing Image Classification [J].
Guo, Yiqing ;
Jia, Xiuping ;
Paull, David .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (06) :3036-3048
[9]   Hybrid first and second order attention Unet for building segmentation in remote sensing images [J].
He, Nanjun ;
Fang, Leyuan ;
Plaza, Antonio .
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (04)
[10]   Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation [J].
He, Xin ;
Zhou, Yong ;
Zhao, Jiaqi ;
Zhang, Di ;
Yao, Rui ;
Xue, Yong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60