Few-shot learning with unsupervised part discovery and part-aligned similarity

被引：12

作者：

Chen, Wentao ^{[1
,2
]}

Zhang, Zhang ^{[2
,3
,4
]}

Wang, Wei ^{[2
,3
]}

Wang, Liang ^{[2
,3
]}

Wang, Zilei ^{[1
]}

Tan, Tieniu ^{[1
,2
,3
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] CASIA, NLPR, Ctr Res Intelligent Percept & Comp, Beijing, Peoples R China

[3] Univ Chinese Acad Sci, Beijing, Peoples R China

[4] Inst Automat, 95 Zhongguancun East Rd, Beijing, Peoples R China

来源：

PATTERN RECOGNITION | 2023年 / 133卷

基金：

中国国家自然科学基金;

关键词：

Few-shot learning; Self-supervised learning; Part discovery network; Part-aligned similarity;

D O I：

10.1016/j.patcog.2022.108986

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Few-shot learning aims to recognize novel concepts with only a few examples. To this end, previous studies resort to acquiring a strong inductive bias via meta-learning on a group of similar tasks, which however needs a large labeled base dataset to sample training tasks. In this paper, we show that such inductive bias can be learned from a flat collection of unlabeled images, and instantiated as transfer-able representations among seen and unseen classes. Specifically, we propose a novel unsupervised Part Discovery Network (PDN) to learn transferable representations from unlabeled images, which automat-ically selects the most discriminative part from an input image and then maximizes its similarities to the global view of the input and other neighbors with similar semantics. To better leverage the learned representations for few-shot learning, we further propose Part-Aligned Similarity (PAS), the key of which is to measure image similarities based on a set of discriminative and aligned parts. We conduct extensive studies on five popular few-shot learning datasets to evaluate our approach. The experimental results show that our approach outperforms previous unsupervised methods by a large margin and is even com-parable with state-of-the-art supervised methods.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：12

共 57 条

[1]

Afrasiyabi Arman, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12350), P18, DOI 10.1007/978-3-030-58558-7_2

[2]

Bertinetto L., 2018, INT C LEARNING REPRE

[3]

Bin Liu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12349), P438, DOI 10.1007/978-3-030-58548-8_26

[4]

Chen T, 2020, PR MACH LEARN RES, V119

[5]

Chen W., 2019, ARXIV191210917

[6]

Chen WT, 2021, PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, P2271

[7]

Chen XL, 2020, Arxiv, DOI arXiv:2003.04297

[8] Image Deformation Meta-Networks for One-Shot Learning [J].

Chen, Zitian ;

Fu, Yanwei ;

Wang, Yu-Xiong ;

Ma, Lin ;

Liu, Wei ;

Hebert, Martial .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8672-8681

[9] Coarse-to-fine pseudo supervision guided meta-task optimization for few-shot object classification [J].

Cui, Yawen ;

Liao, Qing ;

Hu, Dewen ;

An, Wei ;

Liu, Li .

PATTERN RECOGNITION, 2022, 122

[10]

Dhillon G. S., 2019, 8 INT C LEARN REPR I

← 1 2 3 4 5 6 →