Prediction Calibration for Generalized Few-Shot Semantic Segmentation

被引:7
|
作者
Lu, Zhihe [1 ,2 ]
He, Sen [3 ]
Li, Da [4 ]
Song, Yi-Zhe [1 ,2 ]
Xiang, Tao [1 ,2 ]
机构
[1] Univ Surrey, Ctr Vis Speech & Signal Proc CVSSP, Guildford GU2 7XH, England
[2] Univ Surrey, iFlyTek Surrey Joint Res Ctr Artificial Intelligen, Guildford GU2 7XH, England
[3] Meta AI, London, N1C 4BE, England
[4] Samsung AI Ctr, Cambridge CB1 2JH, England
关键词
Generalized few-shot semantic segmentation; prediction calibration; normalized score fusion; feature-score cross-covariance transformer; NETWORK;
D O I
10.1109/TIP.2023.3282070
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Generalized Few-shot Semantic Segmentation (GFSS) aims to segment each image pixel into either base classes with abundant training examples or novel classes with only a handful of (e.g., 1-5) training images per class. Compared to the widely studied Few-shot Semantic Segmentation (FSS), which is limited to segmenting novel classes only, GFSS is much under-studied despite being more practical. Existing approach to GFSS is based on classifier parameter fusion whereby a newly trained novel class classifier and a pre-trained base class classifier are combined to form a new classifier. As the training data is dominated by base classes, this approach is inevitably biased towards the base classes. In this work, we propose a novel Prediction Calibration Network (PCN) to address this problem. Instead of fusing the classifier parameters, we fuse the scores produced separately by the base and novel classifiers. To ensure that the fused scores are not biased to either the base or novel classes, a new Transformer-based calibration module is introduced. It is known that the lower-level features are useful of detecting edge information in an input image than higher level features. Thus, we build a cross-attention module that guides the classifier's final prediction using the fused multi-level features. However, transformers are computationally demanding. Crucially, to make the proposed cross-attention module training tractable at the pixel level, this module is designed based on feature-score cross-covariance and episodically trained to be generalizable at inference time. Extensive experiments on PASCAL-5(i) and COCO-20(i) show that our PCN outperforms the state-the-the-art alternatives by large margins.
引用
收藏
页码:3311 / 3323
页数:13
相关论文
共 50 条
  • [1] Generalized Few-shot Semantic Segmentation
    Tian, Zhuotao
    Lai, Xin
    Jiang, Li
    Liu, Shu
    Shu, Michelle
    Zhao, Hengshuang
    Jia, Jiaya
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11553 - 11562
  • [2] A Strong Baseline for Generalized Few-Shot Semantic Segmentation
    Hajimiri, Sina
    Boudiaf, Malik
    Ben Ayed, Ismail
    Dolz, Jose
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11269 - 11278
  • [3] Learning Orthogonal Prototypes for Generalized Few-shot Semantic Segmentation
    Liu, Sun-Ao
    Zhang, Yiheng
    Qiu, Zhaofan
    Xie, Hongtao
    Zhang, Yongdong
    Yao, Ting
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11319 - 11328
  • [4] Generalized Few-Shot Semantic Segmentation for Remote Sensing Images
    Jia, Yuyu
    Li, Jiabo
    Wang, Qi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [5] Generalized Few-shot Semantic Segmentation for LiDAR Point Clouds
    Wu, Pengze
    Mei, Jilin
    Zhao, Xijun
    Hu, Yu
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 7622 - 7628
  • [6] Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark
    Broni-Bediako, Clifford
    Xia, Junshi
    Song, Jian
    Chen, Hongruixuan
    Siam, Mennatullah
    Yokoya, Naoto
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [7] Embedding Generalized Semantic Knowledge Into Few-Shot Remote Sensing Segmentation
    Wang, Qi
    Jia, Yuyu
    Huang, Wei
    Gao, Junyu
    Li, Qiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
  • [8] Few-shot Segmentation and Semantic Segmentation for Underwater Imagery
    Kabir, Imran
    Shaurya, Shubham
    Maigur, Vijayalaxmi
    Thakurdesai, Nikhil
    Latnekar, Mahesh
    Raunak, Mayank
    Crandall, David
    Reza, Md Alimoor
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 11451 - 11457
  • [9] Prototype expansion and feature calibration for few-shot point cloud semantic segmentation
    Zhang, Qieshi
    Wang, Tichao
    Hao, Fusheng
    Wu, Fuxiang
    Cheng, Jun
    NEUROCOMPUTING, 2023, 558
  • [10] LEARNING WITH MEMORY FOR FEW-SHOT SEMANTIC SEGMENTATION
    Lu, Hongchao
    Wei, Chao
    Deng, Zhidong
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 629 - 633