No Adversaries to Zero-Shot Learning: Distilling an Ensemble of Gaussian Feature Generators

被引:7
作者
Cavazza, Jacopo [1 ,2 ]
Murino, Vittorio [3 ,4 ]
Bue, Alessio Del [1 ,2 ]
机构
[1] Ist Italiano Tecnol, Pattern Anal & Comp Vis PAVIS, I-16152 Genoa, GE, Italy
[2] Ist Italiano Tecnol, Visual Geometry & Modelling VGM Dept, I-16152 Genoa, GE, Italy
[3] Univ Verona, Dept Comp Sci, I-37129 Verona, Italy
[4] Univ Genoa, DIBRIS, I-16126 Genoa, Italy
关键词
Ensemble; feature generation; inductive (generalized) zero-shot learning; neural distillation; object recognition;
D O I
10.1109/TPAMI.2023.3282971
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In zero-shot learning (ZSL), the task of recognizing unseen categories when no data for training is available, state-of-the-art methods generate visual features from semantic auxiliary information (e.g., attributes). In this work, we propose a valid alternative (simpler, yet better scoring) to fulfill the very same task. We observe that, if first- and second-order statistics of the classes to be recognized were known, sampling from Gaussian distributions would synthesize visual features that are almost identical to the real ones as per classification purposes. We propose a novel mathematical framework to estimate first- and second-order statistics, even for unseen classes: our framework builds upon prior compatibility functions for ZSL and does not require additional training. Endowed with such statistics, we take advantage of a pool of class-specific Gaussian distributions to solve the feature generation stage through sampling. We exploit an ensemble mechanism to aggregate a pool of softmax classifiers, each trained in a one-seen-class-out fashion to better balance the performance over seen and unseen classes. Neural distillation is finally applied to fuse the ensemble into a single architecture which can perform inference through one forward pass only. Our method, termed Distilled Ensemble of Gaussian Generators, scores favorably with respect to state-of-the-art works.
引用
收藏
页码:12167 / 12178
页数:12
相关论文
共 48 条
[1]   Label-Embedding for Image Classification [J].
Akata, Zeynep ;
Perronnin, Florent ;
Harchaoui, Zaid ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (07) :1425-1438
[2]  
[Anonymous], 2016, PROC C EMPIRICAL MET
[3]   Adaptive Confidence Smoothing for Generalized Zero-Shot Learning [J].
Atzmon, Yuval ;
Chechik, Gal .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11663-11672
[4]   Enhancing Visual Embeddings through Weakly Supervised Captioning for Zero-Shot Learning [J].
Bustreo, Matteo ;
Cavazza, Jacopo ;
Murino, Vittorio .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :1298-1307
[5]   Classifier and Exemplar Synthesis for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (01) :166-201
[6]   Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Sha, Fei .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3496-3505
[7]   Synthesized Classifiers for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5327-5336
[8]   Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention [J].
Dat Huynh ;
Elhamifar, Ehsan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4482-4492
[9]  
Dinu G., 2014, arXiv
[10]   Diversity with Cooperation: Ensemble Methods for Few-Shot Classification [J].
Dvornik, Nikita ;
Schmid, Cordelia ;
Mairal, Julien .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3722-3730