FPIseg: Iterative segmentation network based on feature pyramid for few-shot segmentation

被引:0
作者
Wang, Ronggui [1 ]
Yang, Cong [1 ]
Yang, Juan [1 ,2 ]
Xue, Lixia [1 ]
机构
[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Peoples R China
[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
attention mechanism; feature engineering; feature pyramid network; few-shot semantic segmentation; prototype network;
D O I
10.1049/ipr2.12898
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot segmentation (FSS) enables rapid adaptation to the segmentation task of unseen-classes object based on a few labelled support samples. Currently, the focal point of research in the FSS field is to align features between support and query images, aiming to improve the segmentation performance. However, most existing FSS methods implement such support/query alignment by solely leveraging middle-level feature for generalization, ignoring the category semantic information contained in high-level feature, while pooling operation inevitably lose spatial information of the feature. To alleviate these issues, the authors propose the Iterative Segmentation Network Based on Feature Pyramid (FPIseg), which mainly consists of three modules: Feature Pyramid Fusion Module (FPFM), Region Feature Enhancement Module (RFEM), and Iterative Optimization Segmentation Module (IOSM). Firstly, FPFM fully utilizes the foreground information from the support image to implement support/query alignment under multi-scale, multi-level semantic backgrounds. Secondly, RFEM enhances the foreground detail information of aligned feature to improve generalization ability. Finally, ISOM iteratively segments the query image to optimize the prediction result and improve segmentation performance. Extensive experiments on the PASCAL-5(i) and COCO-20(i) datasets show that FPIseg achieves considerable segmentation performance under both 1-shot and 5-shot settings.
引用
收藏
页码:3801 / 3814
页数:14
相关论文
共 63 条
  • [1] [Anonymous], 2017, LEARNING DEEP REPRES
  • [2] Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?
    Boudiaf, Malik
    Kervadec, Hoel
    Masud, Ziko Imtiaz
    Piantanida, Pablo
    Ben Ayed, Ismail
    Dolz, Jose
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13974 - 13983
  • [3] Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45
  • [4] Chao P., 2017, LARGE KERNEL MATTERS, P4353
  • [5] Chen L.C., 2017, RETHINKING ATROUS CO, DOI DOI 10.48550/ARXIV.1706.05587
  • [6] CaMap: Camera-based Map Manipulation on Mobile Devices
    Chen, Liang
    Chen, Dongyi
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [8] Attention to Scale: Scale-aware Semantic Image Segmentation
    Chen, Liang-Chieh
    Yang, Yi
    Wang, Jiang
    Xu, Wei
    Yuille, Alan L.
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
  • [9] Semantic Instance Segmentation for Autonomous Driving
    De Brabandere, Bert
    Neven, Davy
    Van Gool, Luc
    [J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 478 - 480
  • [10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848