Prediction Calibration for Generalized Few-Shot Semantic Segmentation

被引:7
|
作者
Lu, Zhihe [1 ,2 ]
He, Sen [3 ]
Li, Da [4 ]
Song, Yi-Zhe [1 ,2 ]
Xiang, Tao [1 ,2 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, England
[2] Univ Surrey, iFlyTek Surrey Joint Res Ctr Artificial Intelligen, Guildford GU2 7XH, England
[3] Meta AI, London, N1C 4BE, England
[4] Samsung AI Ctr, Cambridge CB1 2JH, England
关键词
Generalized few-shot semantic segmentation; prediction calibration; normalized score fusion; feature-score cross-covariance transformer; NETWORK;
D O I
10.1109/TIP.2023.3282070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized Few-shot Semantic Segmentation (GFSS) aims to segment each image pixel into either base classes with abundant training examples or novel classes with only a handful of (e.g., 1-5) training images per class. Compared to the widely studied Few-shot Semantic Segmentation (FSS), which is limited to segmenting novel classes only, GFSS is much under-studied despite being more practical. Existing approach to GFSS is based on classifier parameter fusion whereby a newly trained novel class classifier and a pre-trained base class classifier are combined to form a new classifier. As the training data is dominated by base classes, this approach is inevitably biased towards the base classes. In this work, we propose a novel Prediction Calibration Network (PCN) to address this problem. Instead of fusing the classifier parameters, we fuse the scores produced separately by the base and novel classifiers. To ensure that the fused scores are not biased to either the base or novel classes, a new Transformer-based calibration module is introduced. It is known that the lower-level features are useful of detecting edge information in an input image than higher level features. Thus, we build a cross-attention module that guides the classifier's final prediction using the fused multi-level features. However, transformers are computationally demanding. Crucially, to make the proposed cross-attention module training tractable at the pixel level, this module is designed based on feature-score cross-covariance and episodically trained to be generalizable at inference time. Extensive experiments on PASCAL-5(i) and COCO-20(i) show that our PCN outperforms the state-the-the-art alternatives by large margins.
引用
收藏
页码:3311 / 3323
页数:13
相关论文
共 50 条
  • [41] Axial Assembled Correspondence Network for Few-Shot Semantic Segmentation
    Liu, Yu
    Jiang, Bin
    Xu, Jiaming
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 711 - 721
  • [42] Few-shot Semantic Segmentation by Exploiting Dynamic and Regional Contexts
    Gu, Hongyu
    Zhuge, Yunzhi
    Zhang, Lu
    Qi, Jinqing
    Lu, Huchuan
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 834 - 839
  • [43] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Wang, Ya-Nan
    Tian, Xiangtao
    Zhong, Guoqiang
    COGNITIVE COMPUTATION, 2022, 14 (02) : 875 - 886
  • [44] FBINet: Few-Shot Semantic Segmentation With Foreground and Background Iteration
    Huang, Zhifu
    Chen, Ziwei
    Liu, Yu
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [45] Channel Interaction with Local Enhancement for Few-Shot Semantic Segmentation
    Gao, Jie
    Luo, Xiaoliu
    Zhang, Taiping
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [46] FFNet: Feature Fusion Network for Few-shot Semantic Segmentation
    Ya-Nan Wang
    Xiangtao Tian
    Guoqiang Zhong
    Cognitive Computation, 2022, 14 : 875 - 886
  • [47] Word vector embedding and self-supplementing network for Generalized Few-shot Semantic Segmentation
    Wang, Xiaowei
    Chen, Qiong
    Yang, Yong
    NEUROCOMPUTING, 2025, 613
  • [48] Learning Foreground Information Bottleneck for few-shot semantic segmentation
    Hu, Yutao
    Huang, Xin
    Luo, Xiaoyan
    Han, Jungong
    Cao, Xianbin
    Zhang, Jun
    PATTERN RECOGNITION, 2024, 146
  • [49] DRNet: Disentanglement and Recombination Network for Few-Shot Semantic Segmentation
    Chang, Zhaobin
    Gao, Xiong
    Li, Na
    Zhou, Huiyu
    Lu, Yonggang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (07) : 5560 - 5574
  • [50] Cycle association prototype network for few-shot semantic segmentation
    Hao, Zhuangzhuang
    Shao, Ji
    Gong, Bo
    Yang, Jingwen
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138