Foreground Feature Attention Module Based on Unsupervised Saliency Detector for Few-Shot Learning

被引：5

作者：

Kong, Zhengmin ^{[1
]}

Fu, Zhuolin ^{[1
]}

Xiong, Feng ^{[1
]}

Zhang, Chenggang ^{[1
]}

机构：

[1] Wuhan Univ, Sch Elect Engn & Automat, Wuhan 430072, Peoples R China

来源：

IEEE ACCESS | 2021年 / 9卷

关键词：

Feature extraction; Detectors; Task analysis; Training; Deep learning; Complexity theory; Prototypes; Few-shot learning; foreground feature; unsupervised saliency detector; classification; OBJECT DETECTION;

D O I：

10.1109/ACCESS.2021.3069581

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In recent years, few-shot learning is proposed to solve the problem of lacking samples in deep learning. However, previous works are mainly concentrated on optimizing neural network structures or augmenting the dataset while ignoring the local relationship of the images. Considering that humans pay more attention to the foreground or prominent features of the images during image recognition, we proposed the foreground feature attention module (FFAM) based on an unsupervised saliency detector for few-shot learning. The FFAM consists of two parts: the foreground extraction module and the features attention module. More specifically, we first extract the foreground images by Robust Background Detector (RBD), one of the best unsupervised saliency detectors. Secondly, we employ the same embedding module to extract the features of both original images and foreground images. Finally, we introduce three improvements to enhance the foreground features and make our network focus on the foreground features without losing background information. Our proposed FFAM is more sensitive to the foreground features than previous approaches. Hence, it effectively recognizes those images with similar backgrounds. Extensive experiments are conducted on miniImagenet and tieredImagenet datasets. It is demonstrated that our proposed FFAM greatly improves the accuracy performance over baseline systems for both one-shot and few-shot classification tasks without increasing the network complexity.

引用

页码：51179 / 51188

页数：10

共 51 条

[1] LaSO: Label-Set Operations networks for multi-label few-shot learning [J].

Alfassy, Amit ;

Karlinsky, Leonid ;

Aides, Amit ;

Shtok, Joseph ;

Harary, Sivan ;

Feris, Rogerio ;

Giryes, Raja ;

Bronstein, Alex M. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6541-6550

[2]

[Anonymous], 2018, INT C LEARN REPR ICL

[3]

[Anonymous], 2018, 6 INT C LEARNING REP

[4]

[Anonymous], 2017, 34 INT C MACH LEARN

[5]

[Anonymous], 2014, NEURAL TURING MACHIN

[6]

Bateni P., 2020, IEEE C COMP VIS PATT, P14493

[7]

Benaim S, 2018, ADV NEUR IN, V31

[8] Salient Object Detection: A Benchmark [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Jiang, Huaizu ;

Li, Jia .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5706-5722

[9] Salient object detection: A survey [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Hou, Qibin ;

Jiang, Huaizu ;

Li, Jia .

COMPUTATIONAL VISUAL MEDIA, 2019, 5 (02) :117-150

[10] Memory Matching Networks for One-Shot Image Recognition [J].

Cai, Qi ;

Pan, Yingwei ;

Yao, Ting ;

Yan, Chenggang ;

Mei, Tao .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4080-4088

← 1 2 3 4 5 6 →