A Creative Weak Supervised Semantic Segmentation for Remote Sensing Images

被引:0
|
作者
Wang, Zhibao [1 ,2 ]
Chang, Huan [1 ,2 ]
Bai, Lu [3 ]
Chen, Liangfu [4 ]
Bi, Xiuli [5 ]
机构
[1] Northeast Petr Univ, Dept Bohai Rim Energy Res Inst, Qinhuangdao 066004, Peoples R China
[2] Northeast Petr Univ, Sch Comp & Informat Technol, Daqing 163318, Peoples R China
[3] Queens Univ Belfast, Sch Elect Elect Engn & Comp Sci, Belfast BT9 5BN, North Ireland
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Dept State Key Lab Remote Sensing Sci, Beijing 100094, Peoples R China
[5] Chongqing Univ Posts & Telecommun, Dept Comp Sci & Technol, Chongqing 400065, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
基金
中国国家自然科学基金;
关键词
Remote sensing; Semantic segmentation; Cams; Training; Feature extraction; Sensors; Semantics; Petroleum; Location awareness; Decoding; Fine-tuning; remote sensing image; text prompts; weakly supervised semantic segmentation (WSSS);
D O I
10.1109/TGRS.2024.3477749
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In weakly supervised semantic segmentation (WSSS) tasks on remote sensing images, it is a common practice to train a classification network from scratch using a large batch of images with a limited number of classes. Subsequently, class activation maps are extracted from the model based on predefined class indices, and these maps are then optimized to obtain pseudolabels. To make this strategy effective when introducing a new class, a substantial amount of data needs to be provided to the model. In this article, we present an innovative framework, RS-TextWS-Seg, designed to efficiently generate high-quality segmentation results for a wide range of remote sensing objects using concise descriptions. Our proposed framework comprises three sequential stages: initially, we undertake parameter fine-tuning of the contrastive language-image pretraining (CLIP) model to swiftly strengthen its capacity for zero-shot detection of a limited number of remote sensing features. Subsequently, we introduce a text-driven background suppression mechanism aimed at deriving class activation maps from the refined CLIP model based on textual cues, while concurrently mitigating background noises. Finally, we use the segment anything model (SAM) to refine the edges of the extracted class activation map. We widely researched the leading-edge methodologies in WSSS and conducted a range of comparative experiments and ablation studies to prove the efficacy of our proposed framework. The research findings underscore that RS-TextWS-Seg outperforms other state-of-the-art methods on renowned datasets such as DLRSD and Potsdam, as well as on bespoke datasets specifically curated for overground petroleum pipelines and oil well fields.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Avoiding Negative Transfer for Semantic Segmentation of Remote Sensing Images
    Wang, Hao
    Tao, Chao
    Qi, Ji
    Xiao, Rong
    Li, Haifeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [22] Semantic Segmentation for Remote Sensing Images Based on Adaptive Feature Selection Network
    Xiang, Shao
    Xie, Quangqi
    Wang, Mi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [23] LANet: Local Attention Embedding to Improve the Semantic Segmentation of Remote Sensing Images
    Ding, Lei
    Tang, Hao
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01): : 426 - 435
  • [24] Contrastive Tokens and Label Activation for Remote Sensing Weakly Supervised Semantic Segmentation
    Hu, Zaiyi
    Gao, Junyu
    Yuan, Yuan
    Li, Xuelong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 11
  • [25] Frequency-Driven Edge Guidance Network for Semantic Segmentation of Remote Sensing Images
    Li, Jinsong
    Zhang, Shujun
    Sun, Yukang
    Han, Qi
    Sun, Yuanyuan
    Wang, Yimin
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 9677 - 9693
  • [26] MDANet: Unsupervised, Mixed-Domain Adaptation for Semantic Segmentation of Remote Sensing Images
    Cui, Hao
    Zhang, Guo
    Qi, Ji
    Li, Haifeng
    Tao, Chao
    Li, Xue
    Hou, Shasha
    Li, Deren
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [27] Dual-Path Sparse Hierarchical Network for Semantic Segmentation of Remote Sensing Images
    Wang, Yupei
    Shi, Hao
    Dong, Shan
    Zhuang, Yin
    Chen, Liang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [28] Progressive Guidance Edge Perception Network for Semantic Segmentation of Remote-Sensing Images
    Pan, Shaoming
    Tao, Yulong
    Chen, Xiaoshu
    Chong, Yanwen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [29] RSImageNet: A Universal Deep Semantic Segmentation Lifecycle for Remote Sensing Images
    Qin, Yiqing
    Chi, Mingmin
    IEEE ACCESS, 2020, 8 : 68254 - 68267
  • [30] Hidden Path Selection Network for Semantic Segmentation of Remote Sensing Images
    Yang, Kunping
    Tong, Xin-Yi
    Xia, Gui-Song
    Shen, Weiming
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60