Semantic segmentation of remote sensing images based on dilated convolution and spatial-channel attention mechanism

被引:5
作者
Jin, Huazhong [1 ]
Bao, Zhixi [1 ]
Chang, Xueli [1 ]
Zhang, Tingtao [2 ]
Chen, Can [1 ]
机构
[1] Hubei Univ Technol, Coll Comp, Dept Comp Sci & Technol, Wuhan, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing, Peoples R China
关键词
semantic segmentation; dilated convolution; spatial-channel attention; convolutional conditional random field;
D O I
10.1117/1.JRS.17.016518
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rich context information and multiscale ground information in remote sensing images are crucial to improving the semantic segmentation accuracy. Therefore, we propose a remote sensing image semantic segmentation method that integrates multilevel spatial channel attention and multi-scale dilated convolution, effectively addressing the issue of poor segmentation performance of small target objects in remote sensing images. This method builds a multilevel characteristic fusion structure, combining deep-level semantic characteristics with the details of the shallow levels to generate multiscale feature diagrams. Then, we introduce the dilated convolution of the series combination in each layer of the atrous spatial pyramid pooling structure to reduce the loss of small target information. Finally, using convolutional conditional random field to describe the context information on the space and edges to improve the model's ability to extract details. We prove the effectiveness of the model on the three public datasets. From the quantitative point of view, we mainly evaluate the four indicators of the model's F1 score, overall accuracy (OA), Intersection over Union (IoU), and Mean Intersection over Union (MIoU). On GID dataset, F1 score, OA, and MIoU reach 87.27, 87.80, and 77.70, respectively, superior to most mainstream semantic segmentation networks.
引用
收藏
页数:17
相关论文
共 48 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [3] SCA-CNN: Spatial and Channel-wise Attention in Convolutional Networks for Image Captioning
    Chen, Long
    Zhang, Hanwang
    Xiao, Jun
    Nie, Liqiang
    Shao, Jian
    Liu, Wei
    Chua, Tat-Seng
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6298 - 6306
  • [4] Improved U-Net Remote Sensing Classification Algorithm Fusing Attention and Multiscale Features
    Fan, Xiangsuo
    Yan, Chuan
    Fan, Jinlong
    Wang, Nayi
    [J]. REMOTE SENSING, 2022, 14 (15)
  • [5] Dual Attention Network for Scene Segmentation
    Fu, Jun
    Liu, Jing
    Tian, Haijie
    Li, Yong
    Bao, Yongjun
    Fang, Zhiwei
    Lu, Hanqing
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3141 - 3149
  • [6] Attention mechanisms in computer vision: A survey
    Guo, Meng-Hao
    Xu, Tian-Xing
    Liu, Jiang-Jiang
    Liu, Zheng-Ning
    Jiang, Peng-Tao
    Mu, Tai-Jiang
    Zhang, Song-Hai
    Martin, Ralph R.
    Cheng, Ming-Ming
    Hu, Shi-Min
    [J]. COMPUTATIONAL VISUAL MEDIA, 2022, 8 (03) : 331 - 368
  • [7] Segmentation for Object-Based Image Analysis (OBIA): A review of algorithms and challenges from remote sensing perspective
    Hossain, Mohammad D.
    Chen, Dongmei
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 150 : 115 - 134
  • [8] Huang DM, 2017, IEEE INT C NETW SENS, P679, DOI 10.1109/ICNSC.2017.8000172
  • [9] ISPRS, 2014, 2D SEM LAB CONT
  • [10] Krahenbuhl P., 2011, P ADV NEURAL INFORM, DOI DOI 10.48550/ARXIV.1210.5644