Bi-aggregation-aggregation and self-merging network for few-shot image semantic segmentation

被引:0
作者
Liu, Yu [1 ]
Yu, Ming [1 ]
Zhu, Ye [2 ]
机构
[1] Hebei Univ Technol, Sch Eletron & Informat Engn, Tianjin 300401, Peoples R China
[2] Hebei Univ Technol, Sch Artificial Intelligence & Data Sci, Tianjin 300401, Peoples R China
基金
中国国家自然科学基金;
关键词
few-shot semantic segmentation; similarity of features; bi-aggregation; intra-class diversity; self-merging;
D O I
10.37188/CJLCD.2024-0074
中图分类号
O7 [晶体学];
学科分类号
0702 ; 070205 ; 0703 ; 080501 ;
摘要
Few-shot image semantic segmentation is a very challenging task that attempts to segmem objects of new classes using only a few labeled samples. The mamatmam methods often have problems of low discriminative feature and prototype deviation. To alleviate these problems, a new lowshot image semantic segmentation method hased on a bisaggregation and self-merging network is proposed, which can fully mine the similarity of features and reduce prototype hias. Firstly, we propose a feature-mask bi-aggregation module to provide global semantic information for the feature aggregation and mask aggregation by constructing a dense similarity relation between the support features and the query features covering all spatial locations. Specifically, an enhanced feature and an initial mask with guiding information can be obtained for the query image by performing feature and mask bi-aggregation on the similarity matrices. Then, a self-merging decoder is proposed, which reduces the prototype bias by adding the initial mask-based self-prototype with the known support prototypes, and conveys rich category semantic information to the decoder by fusing the merged prototype with the enhancement feature. Finally, the prediction results obtained by the decoder are further optimized by the prediction results of the base classes. The mlou values of our method on the dataset PASCAL-5 achieve 68.33% and 71.5% in the 1-shot and 5-shot cases, respectively, and on the dataset COCO-20 nchieve 46.5% and 51.4% in the 1-shot and 5-shot cases, respectively, which is superior to the segmentation performance of the mainstream methods, and can segment the target region of the new class more accurately.
引用
收藏
页码:1421 / 1430
页数:10
相关论文
共 22 条
[1]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[2]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[3]   Self-support Few-Shot Semantic Segmentation [J].
Fan, Qi ;
Pei, Wenjie ;
Tai, Yu-Wing ;
Tang, Chi-Keung .
COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :701-719
[4]   Simultaneous Detection and Segmentation [J].
Hariharan, Bharath ;
Arbelaez, Pablo ;
Girshick, Ross ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :297-312
[5]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[6]  
Hochreiter S, 2001, LECT NOTES COMPUT SC, V2130, P87
[7]   Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection [J].
Hu, Hanzhe ;
Bai, Shuai ;
Li, Aoxue ;
Cui, Jinshi ;
Wang, Liwei .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10180-10189
[8]   FRKDNet:feature refine semantic segmentation network based on knowledge distillation [J].
Jiang Shi-yi ;
Xu Yang ;
Li Dan-yang ;
Fan Run-ze .
CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2023, 38 (11) :1590-1599
[9]   Feature Weighting and Boosting for Few-Shot Segmentation [J].
Khoi Nguyen ;
Todorovic, Sinisa .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :622-631
[10]   Learning What Not to Segment: A New Perspective on Few-Shot Segmentation [J].
Lang, Chunbo ;
Cheng, Gong ;
Tu, Binfei ;
Han, Junwei .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :8047-8057