Meta-Generating Deep Attentive Metric for Few-Shot Classification

被引：34

作者：

Zhou, Fei ^{[1
]}

Zhang, Lei ^{[2
,3
]}

Wei, Wei ^{[2
,3
,4
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian 710072, Peoples R China

[3] Northwestern Polytech Univ, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Sch Comp Sci, Xian 710072, Peoples R China

[4] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Measurement; Task analysis; Training; Gaussian distribution; Optimization; Standards; Feature extraction; Few-shot learning; deep attentive metric; meta-learning; NETWORK; MODEL;

D O I：

10.1109/TCSVT.2022.3173687

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Learning to generate a task-aware base learner proves a promising direction to deal with few-shot learning (FSL) problem. Existing methods mainly focus on generating an embedding model utilized with a fixed metric (e.g., cosine distance) for nearest neighbour classification or directly generating a linear classifier. However, due to the limited discriminative capacity of such a simple metric or classifier, these methods fail to generalize to challenging cases appropriately. To mitigate this problem, we present a novel deep metric meta-generation method that turns to an orthogonal direction, i.e., learning to adaptively generate a specific metric for a new FSL task based on the task description (e.g., a few labelled samples). In this study, we structure the metric using a three-layers deep attentive network that is flexible enough to produce a discriminative metric for each task. Moreover, different from existing methods that utilize an uni-modal weight distribution conditioned on labelled samples for network generation, the proposed meta-learner establishes a multi-modal weight distribution conditioned on cross-class sample pairs using a tailored variational autoencoder, which can separately capture the specific inter-class discrepancy statistics for each class and jointly embed the statistics for all classes into metric generation. By doing this, the generated metric can be appropriately adapted to a new FSL task with pleasing generalization performance. To demonstrate this, we test the proposed method on three benchmark FSL datasets and gain competitive results with state-of-the-art competitors.

引用

页码：6863 / 6873

页数：11

共 82 条

[1]

[Anonymous], 2019, P INT C MACH LEARN

[2]

Bertinetto Luca, 2018, INT C LEARNING REPRE

[3]

Cao K., 2021, PROC INT C LEARN REP

[4] Bayesian Correlation Filter Learning With Gaussian Scale Mixture Model for Visual Tracking [J].

Cao, Yuan ;

Shi, Guangming ;

Zhang, Tianzhu ;

Dong, Weisheng ;

Wu, Jinjian ;

Xie, Xuemei ;

Li, Xin .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) :3085-3098

[5] Hierarchical Graph Neural Networks for Few-Shot Learning [J].

Chen, Cen ;

Li, Kenli ;

Wei, Wei ;

Zhou, Joey Tianyi ;

Zeng, Zeng .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252

[6]

Chen MT, 2020, AAAI CONF ARTIF INTE, V34, P10559

[7] Meta-Learning-Based Incremental Few-Shot Object Detection [J].

Cheng, Meng ;

Wang, Hanli ;

Long, Yu .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2158-2169

[8] Know You at One Glance: A Compact Vector Representation for Low-Shot Learning [J].

Cheng, Yu ;

Zhao, Jian ;

Wang, Zhecan ;

Xu, Yan ;

Jayashree, Karlekar ;

Shen, Shengmei ;

Feng, Jiashi .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :1924-1932

[9]

Dahl GE, 2013, INT CONF ACOUST SPEE, P8609, DOI 10.1109/ICASSP.2013.6639346

[10] On the Importance of Distractors for Few-Shot Classification [J].

Das, Rajshekhar ;

Wang, Yu-Xiong ;

Moura, Jose M. F. .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9010-9020

← 1 2 3 4 5 6 7 8 9 →