FPIseg: Iterative segmentation network based on feature pyramid for few-shot segmentation

被引：0

作者：

Wang, Ronggui ^{[1
]}

Yang, Cong ^{[1
]}

Yang, Juan ^{[1
,2
]}

Xue, Lixia ^{[1
]}

机构：

[1] Hefei Univ Technol, Sch Comp & Informat, Hefei, Peoples R China

[2] Hefei Univ Technol, Sch Comp & Informat, Hefei 230601, Peoples R China

来源：

IET IMAGE PROCESSING | 2023年 / 17卷 / 13期

基金：

中国国家自然科学基金;

关键词：

attention mechanism; feature engineering; feature pyramid network; few-shot semantic segmentation; prototype network;

D O I：

10.1049/ipr2.12898

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot segmentation (FSS) enables rapid adaptation to the segmentation task of unseen-classes object based on a few labelled support samples. Currently, the focal point of research in the FSS field is to align features between support and query images, aiming to improve the segmentation performance. However, most existing FSS methods implement such support/query alignment by solely leveraging middle-level feature for generalization, ignoring the category semantic information contained in high-level feature, while pooling operation inevitably lose spatial information of the feature. To alleviate these issues, the authors propose the Iterative Segmentation Network Based on Feature Pyramid (FPIseg), which mainly consists of three modules: Feature Pyramid Fusion Module (FPFM), Region Feature Enhancement Module (RFEM), and Iterative Optimization Segmentation Module (IOSM). Firstly, FPFM fully utilizes the foreground information from the support image to implement support/query alignment under multi-scale, multi-level semantic backgrounds. Secondly, RFEM enhances the foreground detail information of aligned feature to improve generalization ability. Finally, ISOM iteratively segments the query image to optimize the prediction result and improve segmentation performance. Extensive experiments on the PASCAL-5(i) and COCO-20(i) datasets show that FPIseg achieves considerable segmentation performance under both 1-shot and 5-shot settings.

引用

页码：3801 / 3814

页数：14

共 63 条

[1] [Anonymous], 2017, LEARNING DEEP REPRES
[2] Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?
Boudiaf, Malik
Kervadec, Hoel
Masud, Ziko Imtiaz
Piantanida, Pablo
Ben Ayed, Ismail
Dolz, Jose
[J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13974 - 13983
[3] Boyu Yang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12353), P763, DOI 10.1007/978-3-030-58598-3_45
[4] Chao P., 2017, LARGE KERNEL MATTERS, P4353
[5] Chen L.C., 2017, RETHINKING ATROUS CO, DOI DOI 10.48550/ARXIV.1706.05587
[6] CaMap: Camera-based Map Manipulation on Mobile Devices
Chen, Liang
Chen, Dongyi
[J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[8] Attention to Scale: Scale-aware Semantic Image Segmentation
Chen, Liang-Chieh
Yang, Yi
Wang, Jiang
Xu, Wei
Yuille, Alan L.
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3640 - 3649
[9] Semantic Instance Segmentation for Autonomous Driving
De Brabandere, Bert
Neven, Davy
Van Gool, Luc
[J]. 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 478 - 480
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 7 →