RSPS-SAM: A Remote Sensing Image Panoptic Segmentation Method Based on SAM

被引:3
作者
Liu, Zhuoran [1 ]
Li, Zizhen [2 ]
Liang, Ying [2 ]
Persello, Claudio [3 ]
Sun, Bo [4 ]
He, Guangjun [2 ]
Ma, Lei [5 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Sch Geodesy & Geomat, Beijing 102627, Peoples R China
[2] Beijing Inst Satellite Informat Engn, State Key Lab Space Earth Integrated Informat Tech, Beijing 100095, Peoples R China
[3] Univ Twente, Fac ITC, Dept Earth Observat Sci, NL-7500 AE Enschede, Netherlands
[4] Harbin Engn Univ, Coll Informat & Commun Engn, Harbin 150001, Peoples R China
[5] Nanjing Univ, Sch Geog & Ocean Sci, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
panoptic segmentation; segment anything model; remote sensing; deep learning;
D O I
10.3390/rs16214002
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Satellite remote sensing images contain complex and diverse ground object information and the images exhibit spatial multi-scale characteristics, making the panoptic segmentation of satellite remote sensing images a highly challenging task. Due to the lack of large-scale annotated datasets for panoramic segmentation, existing methods still suffer from weak model generalization capabilities. To mitigate this issue, this paper leverages the advantages of the Segment Anything Model (SAM), which can segment any object in remote sensing images without requiring any annotations and proposes a high-resolution remote sensing image panoptic segmentation method called Remote Sensing Panoptic Segmentation SAM (RSPS-SAM). Firstly, to address the problem of global information loss caused by cropping large remote sensing images for training, a Batch Attention Pyramid was designed to extract multi-scale features from remote sensing images and capture long-range contextual information between cropped patches, thereby enhancing the semantic understanding of remote sensing images. Secondly, we constructed a Mask Decoder to address the limitation of SAM requiring manual input prompts and its inability to output category information. This decoder utilized mask-based attention for mask segmentation, enabling automatic prompt generation and category prediction of segmented objects. Finally, the effectiveness of the proposed method was validated on the high-resolution remote sensing image airport scene dataset RSAPS-ASD. The results demonstrate that the proposed method achieves segmentation and recognition of foreground instances and background regions in high-resolution remote sensing images without the need for prompt input, while providing smooth segmentation boundaries with a panoptic segmentation quality (PQ) of 57.2, outperforming current mainstream methods.
引用
收藏
页数:15
相关论文
共 38 条
  • [21] Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers
    Li, Zhiqi
    Wang, Wenhai
    Xie, Enze
    Yu, Zhiding
    Anandkumar, Anima
    Alvarez, Jose M.
    Luo, Ping
    Lu, Tong
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 1270 - 1279
  • [22] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
    Liu, Ze
    Lin, Yutong
    Cao, Yue
    Hu, Han
    Wei, Yixuan
    Zhang, Zheng
    Lin, Stephen
    Guo, Baining
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 9992 - 10002
  • [23] Nguyen KD, 2023, Arxiv, DOI [arXiv:2309.13578, 10.48550/arXiv.2309.13578]
  • [24] The Segment Anything Model (SAM) for remote sensing applications: From zero to one shot
    Osco, Lucas Prado
    Wu, Qiusheng
    de Lemos, Eduardo Lopes
    Gonsalves, Wesley Nunes
    Ramos, Ana Paula Marques
    Li, Jonathan
    Marcato, Jose
    [J]. INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [25] SAM-Induced Pseudo Fully Supervised Learning for Weakly Supervised Object Detection in Remote Sensing Images
    Qian, Xiaoliang
    Lin, Chenyang
    Chen, Zhiwu
    Wang, Wei
    [J]. REMOTE SENSING, 2024, 16 (09)
  • [26] Radford A, 2021, PR MACH LEARN RES, V139
  • [27] Segment Anything Model (SAM) Assisted Remote Sensing Supervision for Mariculture-Using Liaoning Province, China as an Example
    Ren, Yougui
    Yang, Xiaomei
    Wang, Zhihua
    Yu, Ge
    Liu, Yueming
    Liu, Xiaoliang
    Meng, Dan
    Zhang, Qingyang
    Yu, Guo
    [J]. REMOTE SENSING, 2023, 15 (24)
  • [28] Sakaino Hidetomo, 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P3591, DOI 10.1109/CVPRW59228.2023.00367
  • [29] Panoptic SwiftNet: Pyramidal Fusion for Real-Time Panoptic Segmentation
    Saric, Josip
    Orsic, Marin
    Segvic, Sinisa
    [J]. REMOTE SENSING, 2023, 15 (08)
  • [30] Wang D., 2023, Advances in Neural Information Processing Systems, V36, P8815