Query-Guided Prototype Evolution Network for Few-Shot Segmentation

被引:10
作者
Cong, Runmin [1 ,2 ,3 ]
Xiong, Hang [1 ,4 ]
Chen, Jinpeng [5 ]
Zhang, Wei [6 ,7 ]
Huang, Qingming [5 ]
Zhao, Yao [1 ,4 ]
机构
[1] Beijing Jiaotong Univ, Inst Informat Sci, Beijing 100044, Peoples R China
[2] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[3] Minist Educ, Key Lab Machine Intelligence & Syst Control, Jinan 250061, Peoples R China
[4] Beijing Key Lab Adv Informat Sci & Network Technol, Beijing 100044, Peoples R China
[5] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[6] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[7] Minist Educ, Key Lab Machine Intelligence & Syst Control, Jinan 250061, Peoples R China
关键词
Few-shot segmentation; few-shot learning; semantic segmentation; prototype generation;
D O I
10.1109/TMM.2024.3352921
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous Few-Shot Segmentation (FSS) approaches exclusively utilize support features for prototype generation, neglecting the specific requirements of the query. To address this, we present the Query-guided Prototype Evolution Network (QPENet), a new method that integrates query features into the generation process of foreground and background prototypes, thereby yielding customized prototypes attuned to specific queries. The evolution of the foreground prototype is accomplished through a support-query-support iterative process involving two new modules: Pseudo-prototype Generation (PPG) and Dual Prototype Evolution (DPE). The PPG module employs support features to create an initial prototype for the preliminary segmentation of the query image, resulting in a pseudo-prototype reflecting the unique needs of the current query. Subsequently, the DPE module performs reverse segmentation on support images using this pseudo-prototype, leading to the generation of evolved prototypes, which can be considered as custom solutions. As for the background prototype, the evolution begins with a global background prototype that represents the generalized features of all training images. We also design a Global Background Cleansing (GBC) module to eliminate potential adverse components mirroring the characteristics of the current foreground class. Experimental results on the PASCAL-5(i) and COCO-20(i) datasets attest to the substantial enhancements achieved by QPENet over prevailing state-of-the-art techniques, underscoring the validity of our ideas.
引用
收藏
页码:6501 / 6512
页数:12
相关论文
共 76 条
  • [1] Allen Kelsey R., 2019, PR MACH LEARN RES, V97
  • [2] Deep semantic segmentation of natural and medical images: a review
    Asgari Taghanaki, Saeid
    Abhishek, Kumar
    Cohen, Joseph Paul
    Cohen-Adad, Julien
    Hamarneh, Ghassan
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) : 137 - 178
  • [3] Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?
    Boudiaf, Malik
    Kervadec, Hoel
    Masud, Ziko Imtiaz
    Piantanida, Pablo
    Ben Ayed, Ismail
    Dolz, Jose
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13974 - 13983
  • [4] Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45
  • [5] KepSalinst: Using Peripheral Points to Delineate Salient Instances
    Chen, Jinpeng
    Cong, Runmin
    Ip, Horace Ho Shing
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (06) : 3392 - 3405
  • [6] Chen L.-C., 2014, Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs, DOI DOI 10.48550/ARXIV.1412.7062
  • [7] Chen L.-C., 2018, P EUR C COMP VIS ECC, P137, DOI [10.1007/978-3-030-01234-2_49, DOI 10.1007/978-3-030-01234-249, DOI 10.1007/978-3-030-01234-2_49]
  • [8] Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation
    Chen, Tao
    Yao, Yazhou
    Zhang, Lei
    Wang, Qiong
    Xie, Guo-Sen
    Shen, Fumin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 1727 - 1737
  • [9] Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation
    Chen, Tao
    Xie, Guo-Sen
    Yao, Yazhou
    Wang, Qiong
    Shen, Fumin
    Tang, Zhenmin
    Zhang, Jian
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 968 - 980
  • [10] Chen ZT, 2019, AAAI CONF ARTIF INTE, P3379