Multi-granularity episodic contrastive learning for few-shot learning

被引:25
作者
Zhu, Pengfei [1 ,2 ]
Zhu, Zhilin [1 ]
Wang, Yu [1 ,2 ]
Zhang, Jinglin [3 ]
Zhao, Shuai [1 ,4 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[2] Haihe Lab Informat Technol Applicat Innovat, Tianjin, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] ICV Data Dept China Automot Data Co Ltd, Beijing, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Multi-granularity computing; Episodic contrastive learning; Few-shot learning; Deep learning;
D O I
10.1016/j.patcog.2022.108820
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot learning (FSL) aims at fast adaptation to novel classes with few training samples. Among FSL methods, meta-learning and transfer learning-based methods are the most powerful ones. However, most of them rely to some extent on cross-entropy loss, which leads to representations that are overly concerned with the classes already seen, and in turn leads to sub-optimal generalization on novel classes. In this study, we are inspired by meta-learning and transfer learning-based methods and believe good feature representations are vital for FSL. To this end, we propose a new multi-granularity episodic contrastive learning method (MGECL) that introduces contrastive learning into the episode training process. In particular, by enforcing our proposed contrastive loss on both class and instance granularities, the model is able to extract category-independent discriminative patterns and learn richer and more transferable feature representations. Extensive experiments demonstrate that our proposed method achieves state-of-the-art performance on three popular few-shot benchmarks. Our code is available at https://github.com/z1358/MGECL_PR. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 46 条
[1]   Hierarchical Graph Neural Networks for Few-Shot Learning [J].
Chen, Cen ;
Li, Kenli ;
Wei, Wei ;
Zhou, Joey Tianyi ;
Zeng, Zeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) :240-252
[2]  
Chen T, 2020, PR MACH LEARN RES, V119
[3]  
Chen W, 2019, 7 INT C LEARNING REP
[4]  
Chen Yinbo, 2021, ICCV
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
Dhillon G.S., 2020, ICLR
[7]  
Finn C, 2017, PR MACH LEARN RES, V70
[8]   Attentive Weights Generation for Few Shot Learning via Information Maximization [J].
Guo, Yiluan ;
Cheung, Ngai-Man .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :13496-13505
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]   Periphery-aware COVID-19 diagnosis with contrastive representation enhancement [J].
Hou, Junlin ;
Xu, Jilan ;
Jiang, Longquan ;
Du, Shanshan ;
Feng, Rui ;
Zhang, Yuejie ;
Shan, Fei ;
Xue, Xiangyang .
PATTERN RECOGNITION, 2021, 118