Imbalanced Few-Shot Learning Based on Meta-transfer Learning

被引:1
作者
Chu, Yan [1 ]
Sun, Xianghui [1 ]
Jiang Songhao [2 ,3 ]
Xie, Tianwen [1 ]
Wang, Zhengkui [4 ]
Shan, Wen [5 ]
机构
[1] Harbin Engn Univ, Harbin 150001, Peoples R China
[2] CNCERT CC, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[4] Singapore Inst Technol, Singapore, Singapore
[5] Singapore Univ Social Sci, Singapore, Singapore
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VIII | 2023年 / 14261卷
关键词
meta-learning; few-shot learning; meta-transfer learning;
D O I
10.1007/978-3-031-44198-1_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-shot learning is a challenging task that aims to learn to adapt to new tasks with only a few labeled samples. Meta-learning is a promising approach to address this challenge, but the learned meta-knowledge on training sets may not always be useful due to class imbalance, task imbalance, and distribution imbalance. In this paper, we propose a novel few-shot learning method based on meta-transfer learning, which is called Meta-Transfer Task-Adaptive Meta-Learning (MT-TAML). Meta-transfer learning is used to transfer the weight parameters of a pre-trained deep neural network, which makes up for the deficiency of using shallow networks as the feature extractor. To address the imbalance problem in realistic few-shot learning scenarios, we introduce a learnable parameter balance meta-knowledge for each task. Additionally, we propose a novel task training strategy that selects the difficult class in each task and re-samples from it to form the difficult task, thereby improving the model's accuracy. Our experimental results show that MT-TAML outperforms existing few-shot learning methods by 2-4%. Furthermore, our ablation experiments confirm the effectiveness of the combination of meta-transfer learning and learnable equilibrium parameters.
引用
收藏
页码:357 / 369
页数:13
相关论文
共 22 条
[1]   Describing Textures in the Wild [J].
Cimpoi, Mircea ;
Maji, Subhransu ;
Kokkinos, Iasonas ;
Mohamed, Sammy ;
Vedaldi, Andrea .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3606-3613
[2]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[3]  
Finn C, 2017, PR MACH LEARN RES, V70
[4]  
Garcia V., 2018, INT C LEARN REPR ICL, P1
[5]   MS-Celeb-1M: A Dataset and Benchmark for Large-Scale Face Recognition [J].
Guo, Yandong ;
Zhang, Lei ;
Hu, Yuxiao ;
He, Xiaodong ;
Gao, Jianfeng .
COMPUTER VISION - ECCV 2016, PT III, 2016, 9907 :87-102
[6]  
Huang J., 2017, INT C MACHINE LEARNI
[7]  
Iwata T., 2022, Adv. Neural. Inf. Process. Syst., V35, P16637
[8]  
Jiang S., 2023, INT JOINT C ARTIFICI
[9]  
Jongejan Jonas, 2016, QUICK DRAW EXPT
[10]   Human-level concept learning through probabilistic program induction [J].
Lake, Brenden M. ;
Salakhutdinov, Ruslan ;
Tenenbaum, Joshua B. .
SCIENCE, 2015, 350 (6266) :1332-1338