Semantic segmentation of remote sensing images based on dilated convolution and spatial-channel attention mechanism

被引：5

作者：

Jin, Huazhong ^{[1
]}

Bao, Zhixi ^{[1
]}

Chang, Xueli ^{[1
]}

Zhang, Tingtao ^{[2
]}

Chen, Can ^{[1
]}

机构：

[1] Hubei Univ Technol, Coll Comp, Dept Comp Sci & Technol, Wuhan, Peoples R China

[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China

来源：

JOURNAL OF APPLIED REMOTE SENSING | 2023年 / 17卷 / 01期

关键词：

semantic segmentation; dilated convolution; spatial-channel attention; convolutional conditional random field;

D O I：

10.1117/1.JRS.17.016518

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The rich context information and multiscale ground information in remote sensing images are crucial to improving the semantic segmentation accuracy. Therefore, we propose a remote sensing image semantic segmentation method that integrates multilevel spatial channel attention and multi-scale dilated convolution, effectively addressing the issue of poor segmentation performance of small target objects in remote sensing images. This method builds a multilevel characteristic fusion structure, combining deep-level semantic characteristics with the details of the shallow levels to generate multiscale feature diagrams. Then, we introduce the dilated convolution of the series combination in each layer of the atrous spatial pyramid pooling structure to reduce the loss of small target information. Finally, using convolutional conditional random field to describe the context information on the space and edges to improve the model's ability to extract details. We prove the effectiveness of the model on the three public datasets. From the quantitative point of view, we mainly evaluate the four indicators of the model's F1 score, overall accuracy (OA), Intersection over Union (IoU), and Mean Intersection over Union (MIoU). On GID dataset, F1 score, OA, and MIoU reach 87.27, 87.80, and 77.70, respectively, superior to most mainstream semantic segmentation networks.

引用

页数：17

共 48 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[3] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
Chen, Long
Zhang, Hanwang
Xiao, Jun
Nie, Liqiang
Shao, Jian
Liu, Wei
Chua, Tat-Seng
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
[4] Improved U-Net Remote Sensing Classification Algorithm Fusing Attention and Multiscale Features
Fan, Xiangsuo
Yan, Chuan
Fan, Jinlong
Wang, Nayi
[J]. REMOTE SENSING, 2022, 14 (15)
[5] Dual Attention Network for Scene Segmentation
Fu, Jun
Liu, Jing
Tian, Haijie
Li, Yong
Bao, Yongjun
Fang, Zhiwei
Lu, Hanqing
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
[6] Attention mechanisms in computer vision: A survey
Guo, Meng-Hao
Xu, Tian-Xing
Liu, Jiang-Jiang
Liu, Zheng-Ning
Jiang, Peng-Tao
Mu, Tai-Jiang
Zhang, Song-Hai
Martin, Ralph R.
Cheng, Ming-Ming
Hu, Shi-Min
[J]. COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) : 331 - 368
[7] Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective
Hossain, Mohammad D.
Chen, Dongmei
[J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 150 : 115 - 134
[8] Huang DM, 2017, IEEE INT C NETW SENS, P679, DOI 10.1109/ICNSC.2017.8000172
[9] ISPRS, 2014, 2D SEM LAB CONT
[10] Krahenbuhl P., 2011, P ADV NEURAL INFORM, DOI DOI 10.48550/ARXIV.1210.5644

← 1 2 3 4 5 →