Intermediate Prototype Mining Transformer for Few-Shot Semantic Segmentation

被引:0
|
作者
Liu, Yuanwei [1 ]
Liu, Nian [2 ]
Yao, Xiwen [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Fremont, CA 94539 USA
[2] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot semantic segmentation aims to segment the target objects in query under the condition of a few annotated support images. Most previous works strive to mine more effective category information from the support to match with the corresponding objects in query. However, they all ignored the category information gap between query and support images. If the objects in them show large intra-class diversity, forcibly migrating the category information from the support to the query is ineffective. To solve this problem, we are the first to introduce an intermediate prototype for mining both deterministic category information from the support and adaptive category knowledge from the query. Specifically, we design an Intermediate Prototype Mining Transformer (IPMT) to learn the prototype in an iterative way. In each IPMT layer, we propagate the object information in both support and query features to the prototype and then use it to activate the query feature map. By conducting this process iteratively, both the intermediate prototype and the query feature can be progressively improved. At last, the final query feature is used to yield precise segmentation prediction. Extensive experiments on both PASCAL-5(i) and COCO-20(i) datasets clearly verify the effectiveness of our IPMT and show that it outperforms previous state-of-the-art methods by a large margin. Code is available at https://github.com/LIUYUANWEI98/IPMT
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Intermediate prototype network for few-shot segmentation
    Luo, Xiaoliu
    Duan, Zhao
    Zhang, Taiping
    SIGNAL PROCESSING, 2023, 203
  • [2] Variational Prototype Inference for Few-Shot Semantic Segmentation
    Wang, Haochen
    Yang, Yandan
    Cao, Xianbin
    Zhen, Xiantong
    Snoek, Cees
    Shao, Ling
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 525 - 534
  • [3] A Transformer-based Adaptive Prototype Matching Network for Few-Shot Semantic Segmentation
    Chen, Sihan
    Chen, Yadang
    Zheng, Yuhui
    Yang, Zhi-Xin
    Wu, Enhua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 659 - 667
  • [4] A lightweight siamese transformer for few-shot semantic segmentation
    Zhu, Hegui
    Zhou, Yange
    Jiang, Cong
    Yang, Lianping
    Jiang, Wuming
    Wang, Zhimu
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (13): : 7455 - 7469
  • [5] A lightweight siamese transformer for few-shot semantic segmentation
    Hegui Zhu
    Yange Zhou
    Cong Jiang
    Lianping Yang
    Wuming Jiang
    Zhimu Wang
    Neural Computing and Applications, 2024, 36 : 7455 - 7469
  • [6] Dynamic Prototype Convolution Network for Few-Shot Semantic Segmentation
    Liu, Jie
    Bao, Yanqi
    Xie, Guo-Sen
    Xiong, Huan
    Sonke, Jan-Jakob
    Gavves, Efstratios
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 11543 - 11552
  • [7] Cycle association prototype network for few-shot semantic segmentation
    Hao, Zhuangzhuang
    Shao, Ji
    Gong, Bo
    Yang, Jingwen
    Jing, Ling
    Chen, Yingyi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
  • [8] PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment
    Wang, Kaixin
    Liew, Jun Hao
    Zou, Yingtian
    Zhou, Daquan
    Feng, Jiashi
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9196 - 9205
  • [9] Focus on Query: Adversarial Mining Transformer for Few-Shot Segmentation
    Wang, Yuan
    Luo, Naisong
    Zhang, Tianzhu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [10] TPSN: Transformer-based multi-Prototype Search Network for few-shot semantic segmentation
    Wang, Wenjian
    Duan, Lijuan
    En, Qing
    Zhang, Baochang
    Liang, Fangfang
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 103