Few-shot learning in deep networks through global prototyping

被引：15

作者：

Blaes, Sebastian ^{[1
]}

Burwick, Thomas ^{[1
,2
]}

机构：

[1] Goethe Univ Frankfurt, Frankfurt Inst Adv Studies, Ruth Moufang Str 1, D-60438 Frankfurt, Germany

[2] Maastricht Univ, Maastricht Ctr Syst Biol MaCSBio, POB 616, NL-6200 MD Maastricht, Netherlands

来源：

NEURAL NETWORKS | 2017年 / 94卷

关键词：

Convolutional Neural Networks; Object Recognition; Deep Learning; Few-Shot Learning; Transfer Learning;

D O I：

10.1016/j.neunet.2017.07.001

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training a deep convolution neural network (CNN) to succeed in visual object classification usually requires a great number of examples. Here, starting from such a pre-learned CNN, we study the task of extending the network to classify additional categories on the basis of only few examples ("fewshot learning''). We find that a simple and fast prototype-based learning procedure in the global feature layers ("Global Prototype Learning'', GPL) leads to some remarkably good classification results for a large portion of the new classes. It requires only up to ten examples for the new classes to reach a plateau in performance. To understand this few-shot learning performance resulting from GPL as well as the performance of the original network, we use the t-SNE method (Maaten and Hinton, 2008) to visualize clusters of object category examples. This reveals the strong connection between classification performance and data distribution and explains why some new categories only need few examples for learning while others resist good classification results even when trained with many more examples. (C) 2017 Elsevier Ltd. All rights reserved.

引用

页码：159 / 172

页数：14

共 28 条

[1]

[Anonymous], 2005, 2005 Advances in Neural Information Processing Systems

[2]

[Anonymous], 2016, CORR

[3]

[Anonymous], ARXIV160308754

[4] The devil is in the details: an evaluation of recent feature encoding methods [J].

Chatfield, Ken ;

Lempitsky, Victor ;

Vedaldi, Andrea ;

Zisserman, Andrew .

PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,

[5]

de Vries H., 2016, EUR S ART NEUR NETW

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7]

Donahue J, 2014, PR MACH LEARN RES, V32

[8]

Fritzke B., 1995, Advances in Neural Information Processing Systems 7, P625

[9] Computational Advantages of Deep Prototype-Based Learning [J].

Hecht, Thomas ;

Gepperth, Alexander .

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2016, PT II, 2016, 9887 :121-127

[10]

Jetley Saumya, 2015, ARXIV151201192

← 1 2 3 →