Adaptive similarity-guided self-merging network for few-shot semantic segmentation

被引:1
|
作者
Liu, Yu [1 ]
Guo, Yingchun [2 ]
Zhu, Ye [2 ]
Yu, Ming [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Style differences; Adaptive weight; Bi-aggregation; Prototype merging;
D O I
10.1016/j.compeleceng.2024.109527
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot Semantic Segmentation (FSS) attempts to segment the new category with only a few labeled samples, presenting a significant challenge. Existing approaches primarily focus on leveraging category information from the support set to identify objects of the new category in the query image. However, these models often struggle when confronted with substantial differences between paired images. To address issues stemming from scenario differences and intra-class diversity, this paper proposes an adaptive similarity-guided self-merging network. Firstly, style differences of multi-level features are introduced to alleviate the network's sensitivity to scenario variations and learn an adaptive weight for the K-shot scheme. Secondly, a feature-mask bi-aggregation module is designed to learn an enhanced feature and an initial mask for the query image. Within this module, dynamic correlations cover all the spatial locations, providing global information crucial for feature and mask aggregation. Subsequently, a self-merging module is proposed to alleviate prototype bias. It merges a self-prototype derived from the initial mask with an adaptive weighted support prototype obtained from K support images. Finally, the target object is segmented using the enhanced feature and merging prototype, and segmentation results are further refined by predictions of base categories and an adjustment factor derived from multilevel style differences. The proposed method achieves 69.1% (1-shot) and 72.3% (5-shot) mIoU on the PASCAL-5i dataset, and 47.4% (1-shot) and 52.1% (5-shot) mIoU on the COCO-20i dataset. These results demonstrate state-of-the-art segmentation performance compared to mainstream methods. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Self-Enhanced Mixed Attention Network for Three-Modal Images Few-Shot Semantic Segmentation
    Song, Kechen
    Zhang, Yiming
    Bao, Yanqi
    Zhao, Ying
    Yan, Yunhui
    SENSORS, 2023, 23 (14)
  • [22] CSAANet: An Attention-Based Mechanism for Aligned Few-Shot Semantic Segmentation Network
    Wei, Guangpeng
    Qian, Pengjiang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 760 - 772
  • [23] Self-support matching networks with multiscale attention for few-shot semantic segmentation
    Yang, Yafeng
    Gao, Yufei
    Wei, Lin
    He, Mengyang
    Shi, Yucheng
    Wang, Hailing
    Li, Qing
    Zhu, Zhiyuan
    NEUROCOMPUTING, 2024, 594
  • [24] Dual cross-attention Transformer network for few-shot image semantic segmentation
    Liu, Yu
    Guo, Yingchun
    Zhu, Ye
    Yu, Ming
    CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2024, 39 (11) : 1494 - 1505
  • [25] CFENet: Boosting Few-Shot Semantic Segmentation With Complementary Feature-Enhanced Network
    Wang, Yun
    Zhu, Lu
    Liu, Yuanyuan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5630 - 5640
  • [26] Multi-prototype collaborative perception enhancement network for few-shot semantic segmentation
    Chang, Zhaobin
    Gao, Xiong
    Kong, Dongyi
    Li, Na
    Lu, Yonggang
    VISUAL COMPUTER, 2024,
  • [27] Few-shot Semantic Segmentation by Exploiting Dynamic and Regional Contexts
    Gu, Hongyu
    Zhuge, Yunzhi
    Zhang, Lu
    Qi, Jinqing
    Lu, Huchuan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 834 - 839
  • [28] Prototype-Guided Graph Reasoning Network for Few-Shot Medical Image Segmentation
    Huang, Wendong
    Hu, Jinwu
    Xiao, Junhao
    Wei, Yang
    Bi, Xiuli
    Xiao, Bin
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (02) : 761 - 773
  • [29] Multi-scale Discriminative Location-aware Network for Few-Shot Semantic Segmentation
    Dong, Zihao
    Zhang, Ruixun
    Shao, Xiuli
    Zhou, Hongyu
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, : 42 - 47
  • [30] Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark
    Broni-Bediako, Clifford
    Xia, Junshi
    Song, Jian
    Chen, Hongruixuan
    Siam, Mennatullah
    Yokoya, Naoto
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21