Word vector embedding and self-supplementing network for Generalized Few-shot Semantic Segmentation

被引:0
|
作者
Wang, Xiaowei [1 ]
Chen, Qiong [1 ]
Yang, Yong [1 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
基金
中国国家自然科学基金;
关键词
Generalized Few-shot Semantic Segmentation; Inter-class interference; Semantic word embedding; Self-supplementing;
D O I
10.1016/j.neucom.2024.128737
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Under the condition of sufficient base class samples and a few novel class samples, Generalized Few- shot Semantic Segmentation (GFSS) classifies each pixel in the query image as base class, novel class, or background. A standard GFSS approach involves two training stages: base class learning and novel class updating. However, inter-class interference and information loss which contribute to the poor performance of GFSS, have not been synthetical considered. To address the problem, we propose an Embedded-Self- Supplementing Network (ESSNet), i.e., semantic word embedding and query set self-supplementing information to enhance segmentation accuracy. Specifically, the semantic word embedding module employs distance information between word vectors to assist the model in learning the distance between class prototypes. In order to transform the semantic word vector prototypes from the semantic space to the visual embedding space, we designed a triplet loss function to supervise the word vector embedding module, where the word vector prototype serves as an anchor and positive-negative samples are collected among the general features of the support image. To compensate for the information loss caused by using prototypes to represent classes, we propose a self-supplementing module to mine the information contained in the query image. Specifically, this module first makes a preliminary prediction on the query image, then selects high-confidence area to form pseudo labels, and finally uses pseudo labels to extract query prototypes to supplement the missing information. Extensive experiments on PASCAL-5(i) and COCO-20(i) show that ESSNet has superior performance and outperforms state-of-the-art methods in all settings.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [2] Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation
    Wang, Qi
    Jia, Yuyu
    Huang, Wei
    Gao, Junyu
    Li, Qiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [3] Prediction Calibration for Generalized Few-Shot Semantic Segmentation
    Lu, Zhihe
    He, Sen
    Li, Da
    Song, Yi-Zhe
    Xiang, Tao
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3311 - 3323
  • [4] Self-regularized prototypical network for few-shot semantic segmentation
    Ding, Henghui
    Zhang, Hui
    Jiang, Xudong
    PATTERN RECOGNITION, 2023, 133
  • [5] A Strong Baseline for Generalized Few-Shot Semantic Segmentation
    Hajimiri, Sina
    Boudiaf, Malik
    Ben Ayed, Ismail
    Dolz, Jose
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11269 - 11278
  • [6] PRIOR SEMANTIC HARMONIZATION NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
    Yang, Xinhao
    Ma, Liyan
    Zhou, Yang
    Peng, Yan
    Xie, Shaorong
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1126 - 1130
  • [7] Few-Shot Semantic Segmentation with Cyclic Memory Network
    Xie, Guo-Sen
    Xiong, Huan
    Liu, Jie
    Yao, Yazhou
    Shao, Ling
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7273 - 7282
  • [8] Deep Reasoning Network for Few-shot Semantic Segmentation
    Zhuge, Yunzhi
    Shen, Chunhua
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5344 - 5352
  • [9] Self-support Few-Shot Semantic Segmentation
    Fan, Qi
    Pei, Wenjie
    Tai, Yu-Wing
    Tang, Chi-Keung
    COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 : 701 - 719
  • [10] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328