Meta-Generating Deep Attentive Metric for Few-Shot Classification

被引:27
作者
Zhou, Fei [1 ]
Zhang, Lei [2 ,3 ]
Wei, Wei [2 ,3 ,4 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Shaanxi Prov Key Lab Speech & Image Informat Proc, Xian 710072, Peoples R China
[3] Northwestern Polytech Univ, Natl Engn Lab Integrated Aerosp Ground Ocean Big, Sch Comp Sci, Xian 710072, Peoples R China
[4] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Measurement; Task analysis; Training; Gaussian distribution; Optimization; Standards; Feature extraction; Few-shot learning; deep attentive metric; meta-learning; NETWORK; MODEL;
D O I
10.1109/TCSVT.2022.3173687
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Learning to generate a task-aware base learner proves a promising direction to deal with few-shot learning (FSL) problem. Existing methods mainly focus on generating an embedding model utilized with a fixed metric (e.g., cosine distance) for nearest neighbour classification or directly generating a linear classifier. However, due to the limited discriminative capacity of such a simple metric or classifier, these methods fail to generalize to challenging cases appropriately. To mitigate this problem, we present a novel deep metric meta-generation method that turns to an orthogonal direction, i.e., learning to adaptively generate a specific metric for a new FSL task based on the task description (e.g., a few labelled samples). In this study, we structure the metric using a three-layers deep attentive network that is flexible enough to produce a discriminative metric for each task. Moreover, different from existing methods that utilize an uni-modal weight distribution conditioned on labelled samples for network generation, the proposed meta-learner establishes a multi-modal weight distribution conditioned on cross-class sample pairs using a tailored variational autoencoder, which can separately capture the specific inter-class discrepancy statistics for each class and jointly embed the statistics for all classes into metric generation. By doing this, the generated metric can be appropriately adapted to a new FSL task with pleasing generalization performance. To demonstrate this, we test the proposed method on three benchmark FSL datasets and gain competitive results with state-of-the-art competitors.
引用
收藏
页码:6863 / 6873
页数:11
相关论文
共 82 条
[1]  
Bertinetto L., 2018, INT C LEARNING REPRE
[2]  
Cao K., 2021, PROC INT C LEARN REP
[3]   Bayesian Correlation Filter Learning With Gaussian Scale Mixture Model for Visual Tracking [J].
Cao, Yuan ;
Shi, Guangming ;
Zhang, Tianzhu ;
Dong, Weisheng ;
Wu, Jinjian ;
Xie, Xuemei ;
Li, Xin .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) :3085-3098
[4]   Hierarchical Graph Neural Networks for Few-Shot Learning [J].
Chen, Cen ;
Li, Kenli ;
Wei, Wei ;
Zhou, Joey Tianyi ;
Zeng, Zeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252
[5]  
Chen MT, 2020, AAAI CONF ARTIF INTE, V34, P10559
[6]   Meta-Learning-Based Incremental Few-Shot Object Detection [J].
Cheng, Meng ;
Wang, Hanli ;
Long, Yu .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2158-2169
[7]   Know You at One Glance: A Compact Vector Representation for Low-Shot Learning [J].
Cheng, Yu ;
Zhao, Jian ;
Wang, Zhecan ;
Xu, Yan ;
Jayashree, Karlekar ;
Shen, Shengmei ;
Feng, Jiashi .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :1924-1932
[8]  
Dahl GE, 2013, INT CONF ACOUST SPEE, P8609, DOI 10.1109/ICASSP.2013.6639346
[9]   On the Importance of Distractors for Few-Shot Classification [J].
Das, Rajshekhar ;
Wang, Yu-Xiong ;
Moura, Jose M. F. .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9010-9020
[10]  
Finn C, 2017, PR MACH LEARN RES, V70