Adaptive similarity-guided self-merging network for few-shot semantic segmentation

被引:1
作者
Liu, Yu [1 ]
Guo, Yingchun [2 ]
Zhu, Ye [2 ]
Yu, Ming [1 ]
机构
[1] Hebei Univ Technol, Sch Elect & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
Few-shot semantic segmentation; Style differences; Adaptive weight; Bi-aggregation; Prototype merging;
D O I
10.1016/j.compeleceng.2024.109527
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Few-shot Semantic Segmentation (FSS) attempts to segment the new category with only a few labeled samples, presenting a significant challenge. Existing approaches primarily focus on leveraging category information from the support set to identify objects of the new category in the query image. However, these models often struggle when confronted with substantial differences between paired images. To address issues stemming from scenario differences and intra-class diversity, this paper proposes an adaptive similarity-guided self-merging network. Firstly, style differences of multi-level features are introduced to alleviate the network's sensitivity to scenario variations and learn an adaptive weight for the K-shot scheme. Secondly, a feature-mask bi-aggregation module is designed to learn an enhanced feature and an initial mask for the query image. Within this module, dynamic correlations cover all the spatial locations, providing global information crucial for feature and mask aggregation. Subsequently, a self-merging module is proposed to alleviate prototype bias. It merges a self-prototype derived from the initial mask with an adaptive weighted support prototype obtained from K support images. Finally, the target object is segmented using the enhanced feature and merging prototype, and segmentation results are further refined by predictions of base categories and an adjustment factor derived from multilevel style differences. The proposed method achieves 69.1% (1-shot) and 72.3% (5-shot) mIoU on the PASCAL-5i dataset, and 47.4% (1-shot) and 52.1% (5-shot) mIoU on the COCO-20i dataset. These results demonstrate state-of-the-art segmentation performance compared to mainstream methods. (c) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Query-support semantic correlation mining for few-shot segmentation
    Shao, Ji
    Gong, Bo
    Dai, Kanyuan
    Li, Daoliang
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [32] Visible and thermal images fusion architecture for few-shot semantic segmentation
    Bao, Yanqi
    Song, Kechen
    Wang, Jie
    Huang, Liming
    Dong, Hongwen
    Yan, Yunhui
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 80
  • [33] Class-Aware Self- and Cross-Attention Network for Few-Shot Semantic Segmentation of Remote Sensing Images
    Liang, Guozhen
    Xie, Fengxi
    Chien, Ying-Ren
    MATHEMATICS, 2024, 12 (17)
  • [34] Hierarchical bidirectional aggregation with prior guided transformer for few-shot segmentation
    Qiuyu Kong
    Jie Jiang
    Junyan Yang
    Qi Wang
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [35] Hierarchical bidirectional aggregation with prior guided transformer for few-shot segmentation
    Kong, Qiuyu
    Jiang, Jie
    Yang, Junyan
    Wang, Qi
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [36] Multi-Content Interaction Network for Few-Shot Segmentation
    Chen, Hao
    Yu, Yunlong
    Dong, Yonghan
    Lu, Zheming
    Li, Yingming
    Zhang, Zhongfei
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (06)
  • [37] SELF-REINFORCING FOR FEW-SHOT MEDICAL IMAGE SEGMENTATION
    Huang, Yao
    Liu, Jianming
    Chen, Hua
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 655 - 659
  • [38] FECANet: Boosting Few-Shot Semantic Segmentation With Feature-Enhanced Context-Aware Network
    Liu, Huafeng
    Peng, Pai
    Chen, Tao
    Wang, Qiong
    Yao, Yazhou
    Hua, Xian-Sheng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8580 - 8592
  • [39] FPIseg: Iterative segmentation network based on feature pyramid for few-shot segmentation
    Wang, Ronggui
    Yang, Cong
    Yang, Juan
    Xue, Lixia
    IET IMAGE PROCESSING, 2023, 17 (13) : 3801 - 3814
  • [40] Simple yet effective joint guidance learning for few-shot semantic segmentation
    Zhaobin Chang
    Yonggang Lu
    Xingcheng Ran
    Xiong Gao
    Hong Zhao
    Applied Intelligence, 2023, 53 : 26603 - 26621