Coarse-to-fine pseudo supervision guided meta-task optimization for few-shot object classification

被引:12
作者
Cui, Yawen [1 ]
Liao, Qing [2 ]
Hu, Dewen [3 ]
An, Wei [4 ]
Liu, Li [1 ,5 ]
机构
[1] Univ Oulu, Ctr Machine Vis & Signal Anal, Oulu, Finland
[2] Harbin Inst Technol, Dept Comp Sci & Technol, Shenzhen, Peoples R China
[3] Natl Univ Def Technol, Coll Intelligent Sci, Changsha, Peoples R China
[4] Natl Univ Def Technol, Coll Elect Sci & Technol, Changsha, Peoples R China
[5] Natl Univ Def Technol, Coll Syst Engn, Changsha, Peoples R China
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Unsupervised few-shot learning; Meta-learning; Clustering; Object classification;
D O I
10.1016/j.patcog.2021.108296
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Few-Shot Learning (FSL) is a challenging and practical learning pattern, aiming to solve a target task which has only a few labeled examples. Currently, the field of FSL has made great progress, but largely in the supervised setting, where a large auxiliary labeled dataset is required for offline training. However, the unsupervised FSL (UFSL) problem where the auxiliary dataset is fully unlabeled has been seldom in-vestigated despite of its significant value. This paper focuses on the more general and challenging UFSL problem and presents a novel method named Coarse-to-Fine Pseudo Supervision-guided Meta-Learning (C2FPS-ML) for unsupervised few-shot object classification. It first obtains prior knowledge from an un-labeled auxiliary dataset during unsupervised meta-training, and then use the prior knowledge to as-sist the downstream few-shot classification task. Coarse-to-Fine Pseudo Supervisions in C2FPS-ML aim to optimize meta-task sampling process in unsupervised meta-training stage which is one of the domi-nant factors for improving the performance of meta-learning based FSL algorithms. Human can learn new concepts progressively or hierarchically following the coarse-to-fine manners. By simulating this human's behaviour, we develop two versions of C2FPS-ML for two different scenarios: one is natural object dataset and another one is other kinds of dataset (e.g., handwritten character dataset). For natural object dataset scenario, we propose to exploit the potential hierarchical semantics of the unlabeled auxiliary dataset to build a tree-like structure of visual concepts. For another scenario, progressive pseudo supervision is obtained by forming clusters in different similarity aspects and is represented by a pyramid-like struc-ture. The obtained structure is applied as the supervision to construct meta-tasks in meta-training stage, and prior knowledge from the unlabeled auxiliary dataset is learned from the coarse-grained level to the fine-grained level. The proposed method sets the new state of the art on the gold-standard miniImageNet and achieves remarkable results on Omniglot while simultaneously increases efficiency. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 49 条
[1]  
Antoniou Antreas, 2019, ARXIV PREPRINT ARXIV
[2]  
Berthelot D, 2019, ADV NEUR IN, V32
[3]  
Berthelot David, 2019, INT C LEARN REPR
[4]   Deep Clustering for Unsupervised Learning of Visual Features [J].
Caron, Mathilde ;
Bojanowski, Piotr ;
Joulin, Armand ;
Douze, Matthijs .
COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 :139-156
[5]   Multi-Level Semantic Feature Augmentation for One-Shot Learning [J].
Chen, Zitian ;
Fu, Yanwei ;
Zhang, Yinda ;
Jiang, Yu-Gang ;
Xue, Xiangyang ;
Sigal, Leonid .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) :4594-4605
[6]  
Chenghao Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12363), P752, DOI 10.1007/978-3-030-58523-5_44
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Donahue J., 2017, P INT C LEARN REPR, P1
[9]  
Finn C, 2017, PR MACH LEARN RES, V70
[10]   Few-Shot SAR Target Classification via Metalearning [J].
Fu, Kun ;
Zhang, Tengfei ;
Zhang, Yue ;
Wang, Zhirui ;
Sun, Xian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60