Adaptive similarity-guided self-merging network for few-shot semantic segmentation

被引:1
作者
Liu, Yu [1 ]
Guo, Yingchun [2 ]
Zhu, Ye [2 ]
Yu, Ming [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Style differences; Adaptive weight; Bi-aggregation; Prototype merging;
D O I
10.1016/j.compeleceng.2024.109527
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot Semantic Segmentation (FSS) attempts to segment the new category with only a few labeled samples, presenting a significant challenge. Existing approaches primarily focus on leveraging category information from the support set to identify objects of the new category in the query image. However, these models often struggle when confronted with substantial differences between paired images. To address issues stemming from scenario differences and intra-class diversity, this paper proposes an adaptive similarity-guided self-merging network. Firstly, style differences of multi-level features are introduced to alleviate the network's sensitivity to scenario variations and learn an adaptive weight for the K-shot scheme. Secondly, a feature-mask bi-aggregation module is designed to learn an enhanced feature and an initial mask for the query image. Within this module, dynamic correlations cover all the spatial locations, providing global information crucial for feature and mask aggregation. Subsequently, a self-merging module is proposed to alleviate prototype bias. It merges a self-prototype derived from the initial mask with an adaptive weighted support prototype obtained from K support images. Finally, the target object is segmented using the enhanced feature and merging prototype, and segmentation results are further refined by predictions of base categories and an adjustment factor derived from multilevel style differences. The proposed method achieves 69.1% (1-shot) and 72.3% (5-shot) mIoU on the PASCAL-5i dataset, and 47.4% (1-shot) and 52.1% (5-shot) mIoU on the COCO-20i dataset. These results demonstrate state-of-the-art segmentation performance compared to mainstream methods. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Learning prototypes from background and latent objects for few-shot semantic segmentation
    Wang, Yicong
    Huang, Rong
    Zhou, Shubo
    Jiang, Xueqin
    Fang, Zhijun
    KNOWLEDGE-BASED SYSTEMS, 2025, 314
  • [42] Efficient Sampling-based Gaussian Processes for few-shot semantic segmentation
    Zhang, Xin-Yi
    Lu, Xian-Kai
    Yin, Yi-Long
    Ye, Han-Jia
    Zhan, De-Chuan
    PATTERN RECOGNITION, 2025, 164
  • [43] Simple yet effective joint guidance learning for few-shot semantic segmentation
    Chang, Zhaobin
    Lu, Yonggang
    Ran, Xingcheng
    Gao, Xiong
    Zhao, Hong
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26603 - 26621
  • [44] Cross-domain few-shot semantic segmentation for the astronaut work environment
    Sun, Qingwei
    Chao, Jiangang
    Lin, Wanhong
    ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5934 - 5949
  • [45] MCEENet: Multi-Scale Context Enhancement and Edge-Assisted Network for Few-Shot Semantic Segmentation
    Zhou, Hongjie
    Zhang, Rufei
    He, Xiaoyu
    Li, Nannan
    Wang, Yong
    Shen, Sheng
    SENSORS, 2023, 23 (06)
  • [46] Holistic Prototype Attention Network for Few-Shot Video Object Segmentation
    Tang, Yin
    Chen, Tao
    Jiang, Xiruo
    Yao, Yazhou
    Xie, Guo-Sen
    Shen, Heng-Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (08) : 6699 - 6709
  • [47] CAMSNet: Few-Shot Semantic Segmentation via Class Activation Map and Self-Cross Attention Block
    Yan, Jingjing
    Zhuang, Xuyang
    Zhao, Xuezhuan
    Shao, Xiaoyan
    Han, Jiaqi
    CMC-COMPUTERS MATERIALS & CONTINUA, 2025, 82 (03): : 5363 - 5386
  • [48] LTAF-NET: LEARNING TASK-AWARE ADAPTIVE FEATURES AND REFINING MASK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Mao, Binjie
    Wang, Lingfeng
    Xiang, Shiming
    Pan, Chunhong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2320 - 2324
  • [49] Scale-Aware Detailed Matching for Few-Shot Aerial Image Semantic Segmentation
    Yao, Xiwen
    Cao, Qinglong
    Feng, Xiaoxu
    Cheng, Gong
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [50] BMDENet: Bi-Directional Modality Difference Elimination Network for Few-Shot RGB-T Semantic Segmentation
    Zhao, Ying
    Song, Kechen
    Zhang, Yiming
    Yan, Yunhui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2023, 70 (11) : 4266 - 4270