Pixel Representation Augmented through Cross-Attention for High-Resolution Remote Sensing Imagery Segmentation

被引:3
|
作者
Luo, Yiyun [1 ,2 ]
Wang, Jinnian [1 ,2 ]
Yang, Xiankun [1 ,2 ]
Yu, Zhenyu [1 ,2 ]
Tan, Zixuan [1 ,2 ]
机构
[1] Guangzhou Univ, Sch Geog & Remote Sensing, Guangzhou 510006, Peoples R China
[2] Guangzhou Univ, Ctr Remote Sensing Big Data Intelligence Applicat, Guangzhou 510006, Peoples R China
基金
国家重点研发计划;
关键词
land cover classification; transformer; cross-attention; object embedding queries; LAND-COVER CLASSIFICATION; SEMANTIC SEGMENTATION; NETWORK;
D O I
10.3390/rs14215415
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Natural imagery segmentation has been transferred to land cover classification in remote sensing imagery with excellent performance. However, two key issues have been overlooked in the transfer process: (1) some objects were easily overwhelmed by the complex backgrounds; (2) interclass information for indistinguishable classes was not fully utilized. The attention mechanism in the transformer is capable of modeling long-range dependencies on each sample for per-pixel context extraction. Notably, per-pixel context from the attention mechanism can aggregate category information. Therefore, we proposed a semantic segmentation method based on pixel representation augmentation. In our method, a simplified feature pyramid was designed to decode the hierarchical pixel features from the backbone, and then decode the category representations into learnable category object embedding queries by cross-attention in the transformer decoder. Finally, pixel representation is augmented by an additional cross-attention in the transformer encoder under the supervision of auxiliary segmentation heads. The results of extensive experiments on the aerial image dataset Potsdam and satellite image dataset Gaofen Image Dataset with 15 categories (GID-15) demonstrate that the cross-attention is effective, and our method achieved the mean intersection over union (mIoU) of 86.2% and 62.5% on the Potsdam test set and GID-15 validation set, respectively. Additionally, we achieved an inference speed of 76 frames per second (FPS) on the Potsdam test dataset, higher than all the state-of-the-art models we tested on the same device.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Automated object recognition in high-resolution optical remote sensing imagery
    Yazhou Yao
    Tao Chen
    Hanbo Bi
    Xinhao Cai
    Gensheng Pei
    Guoye Yang
    Zhiyuan Yan
    Xian Sun
    Xing Xu
    Hai Zhang
    National Science Review, 2023, 10 (06) : 38 - 41
  • [42] STUDY OF VARIOUS RESAMPLING TECHNIQUES FOR HIGH-RESOLUTION REMOTE SENSING IMAGERY
    Gurjar, S. B.
    Padmanabhan, N.
    PHOTONIRVACHAK-JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2005, 33 (01): : 113 - 120
  • [43] Identification of shelterbelt width from high-resolution remote sensing imagery
    Rongxin Deng
    Gao Yang
    Ying Li
    Zhengran Xu
    Xing Zhang
    Lu Zhang
    Chunjing Li
    Agroforestry Systems, 2022, 96 : 1091 - 1101
  • [44] Normalized cut segmentation with edge constraint for high resolution remote sensing imagery
    Gao, Rongrong
    Zhong, Yanfei
    Zhao, Bei
    Zhang, Liangpei
    PROCEEDINGS OF THE 2015 7TH IEEE INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (CIS) AND ROBOTICS, AUTOMATION AND MECHATRONICS (RAM), 2015, : 36 - 40
  • [45] ADVANCES IN TEXTURE-BASED SEGMENTATION OF HIGH RESOLUTION REMOTE SENSING IMAGERY
    Gaetano, Raffaele
    Scarpa, Giuseppe
    Poggi, Giovanni
    2009 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-5, 2009, : 2485 - 2488
  • [46] SMAF-Net: Sharing Multiscale Adversarial Feature for High-Resolution Remote Sensing Imagery Semantic Segmentation
    Chen, Jie
    Zhu, Jingru
    Sun, Geng
    Li, Jianhui
    Deng, Min
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (11) : 1921 - 1925
  • [47] Annotated Dataset for Training Cloud Segmentation Neural Networks Using High-Resolution Satellite Remote Sensing Imagery
    He, Mingyuan
    Zhang, Jie
    He, Yang
    Zuo, Xinjie
    Gao, Zebin
    REMOTE SENSING, 2024, 16 (19)
  • [48] Extraction of water bodies from high-resolution remote sensing imagery based on a deep semantic segmentation network
    Sun, Dechao
    Gao, Guang
    Huang, Lijun
    Liu, Yunpeng
    Liu, Dongquan
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [49] Segmentation for High-Resolution Optical Remote Sensing Imagery Using Improved Quadtree and Region Adjacency Graph Technique
    Fu, Gang
    Zhao, Hongrui
    Li, Cong
    Shi, Limei
    REMOTE SENSING, 2013, 5 (07) : 3259 - 3279
  • [50] A Multi-Scale Spatial Attention Region Proposal Network for High-Resolution Optical Remote Sensing Imagery
    Dong, Ruchan
    Jiao, Licheng
    Zhang, Yan
    Zhao, Jin
    Shen, Weiyan
    REMOTE SENSING, 2021, 13 (17)