Zero-Shot Learning-A Comprehensive Evaluation of the Good, the Bad and the Ugly

被引:1108
作者
Xian, Yongqin [1 ]
Lampert, Christoph H. [2 ]
Schiele, Bernt [1 ]
Akata, Zeynep [1 ,3 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
[2] Inst Sci & Technol Austria IST, A-3400 Klosterneuburg, Austria
[3] Univ Amsterdam, NL-1012 WX Amsterdam, Netherlands
关键词
Generalized zero-shot learning; transductive learning; image classification; weakly-supervised learning;
D O I
10.1109/TPAMI.2018.2857768
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the importance of zero-shot learning, i.e., classifying images where there is a lack of labeled training data, the number of proposed approaches has recently increased steadily. We argue that it is time to take a step back and to analyze the status quo of the area. The purpose of this paper is three-fold. First, given the fact that there is no agreed upon zero-shot learning benchmark, we first define a new benchmark by unifying both the evaluation protocols and data splits of publicly available datasets used for this task. This is an important contribution as published results are often not comparable and sometimes even flawed due to, e.g., pre-training on zero-shot test classes. Moreover, we propose a new zero-shot learning dataset, the Animals with Attributes 2 (AWA2) dataset which we make publicly available both in terms of image features and the images themselves. Second, we compare and analyze a significant number of the state-of-the-art methods in depth, both in the classic zero-shot setting but also in the more realistic generalized zero-shot setting. Finally, we discuss in detail the limitations of the current status of the area which can be taken as a basis for advancing it.
引用
收藏
页码:2251 / 2265
页数:15
相关论文
共 75 条
[1]   Multi-Cue Zero-Shot Learning with Strong Supervision [J].
Akata, Zeynep ;
Malinowski, Mateusz ;
Fritz, Mario ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :59-68
[2]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[3]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[4]   Label-Embedding for Attribute-Based Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :819-826
[5]   Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning [J].
Al-Halah, Ziad ;
Tapaswi, Makarand ;
Stiefelhagen, Rainer .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5975-5984
[6]  
[Anonymous], ARXIV170509474
[7]  
[Anonymous], P 3 INT C LEARNING R
[8]  
[Anonymous], 2014, P 31 INT C MACH LEAR
[9]  
[Anonymous], PROC CVPR IEEE
[10]  
[Anonymous], 2010, CALTECH UCSD BIRDS