Query-support semantic correlation mining for few-shot segmentation

被引：9

作者：

Shao, Ji ^{[1
,2
,3
,4
]}

Gong, Bo ^{[1
,2
,3
,4
]}

Dai, Kanyuan ^{[2
,3
,4
,5
]}

Li, Daoliang ^{[1
,2
,3
,4
]}

Jing, Ling ^{[1
,2
,3
,4
,5
]}

Chen, Yingyi ^{[1
,2
,3
,4
]}

机构：

[1] China Agr Univ, Coll Informat & Elect Engn, Beijing 100083, Peoples R China

[2] China Agr Univ, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China

[3] Minist Agr & Rural Affairs, Key Lab Smart Farming Technol Aquat Anim & Livesto, Beijing 100083, Peoples R China

[4] China Agr Univ, Beijing Engn & Technol Res Ctr Internet Things Agr, Beijing 100083, Peoples R China

[5] China Agr Univ, Coll Sci, Beijing 100083, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 126卷

关键词：

Few-shot semantic segmentation; Semantic segmentation; Few-shot learning; Semantic correlation mining; Attention mechanism;

D O I：

10.1016/j.engappai.2023.106797

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Given one or several annotation samples, few-shot segmentation has the capability to complete the segmentation of unseen target objects, thereby reducing the need for extensive data annotation. However, due to the scarcity and diversity of data, it becomes challenging to employ a few support samples effectively for guiding the segmentation of one query sample. Most existing approaches are committed to excavating support samples' information while ignoring the crucial relationship between query and support samples. To address this limitation and capture richer mutual information, we propose a query-support semantic correlation mining module that serves as an elaborate guide for subsequent dense matching. Specifically, two kinds of query guide maps are generated that leverage global and local similarities to learn the comprehensive correlation between multi-scale support and query features. By incorporating this module, the network can better execute dense matching and mitigate the possibility of misclassification effectively. In addition, a dual-attention module is introduced to obtain a discriminating initial prototype representation to avert prototype bias. Furthermore, our method generalizes well in the k-shot setting. Extensive experiments on PASCAL -5i and COCO-20i datasets demonstrate that the proposed method is potent and successful, which yields competitive segmentation results with state-of-the-art methods.

引用

页数：11

共 51 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].