Generalized Zero-Shot Learning using Identifiable Variational Autoencoders

被引：8

作者：

Gull, Muqaddas ^{[1
]}

Arif, Omar ^{[1
]}

机构：

[1] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44000, Pakistan

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2022年 / 191卷

关键词：

Zero-shot learning; Generalized zero-shot learning; Non-Linear ICA; Disentangled Representat i o n Learning;

D O I：

10.1016/j.eswa.2021.116268

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning tasks rely heavily on a large amount of training data, but collecting and annotating data daily is not practical. Therefore, Zero-shot learning (ZSL) has become important for the applications, where no labeled data is available during training. ZSL aims at recognizing unseen classes by semantic transfer of information from seen to unseen classes. In this paper, we have proposed an identifiable VAE (iVAE) based generative model to address conventional and generalized ZSL. The key to our approach is learning disentangled representations, where each dimension is statistically independent and responsible for generating data. Thus, VAE is a commonly used model for learning disentangled independent factors of variation from the data. Our goal is to learn a latent space representing significant information, that approximates the actual data distribution. Extensive experiments on five benchmark datasets, i.e. CUB, AWA1, AWA2, SUN and aPY, are performed for further evaluation in both settings.

引用

页数：8

共 67 条

[1]

Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911

[2] Label-Embedding for Attribute-Based Classification [J].

Akata, Zeynep ;

Perronnin, Florent ;

Harchaoui, Zaid ;

Schmid, Cordelia .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :819-826

[3] Preserving Semantic Relations for Zero-Shot Learning [J].

Annadani, Yashas ;

Biswas, Soma .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7603-7612

[4] Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions [J].

Ba, Jimmy Lei ;

Swersky, Kevin ;

Fidler, Sanja ;

Salakhutdinov, Ruslan .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4247-4255

[5] Generating Visual Representations for Zero-Shot Classification [J].

Bucher, Maxime ;

Herbin, Stephane ;

Jurie, Frederic .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2666-2673

[6] An Empirical Study and Analysis of Generalized Zero-Shot Learning for Object Recognition in the Wild [J].

Chao, Wei-Lun ;

Changpinyo, Soravit ;

Gong, Boqing ;

Sha, Fei .

COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :52-68

[7] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].

Chen, Liang-Chieh ;

Papandreou, George ;

Kokkinos, Iasonas ;

Murphy, Kevin ;

Yuille, Alan L. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848

[8]

Chen Ricky T. Q., 2018, Advances in Neural Information Processing Systems, V31

[9] Fine-Grained Generalized Zero-Shot Learning via Dense Attribute-Based Attention [J].

Dat Huynh ;

Elhamifar, Ehsan .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :4482-4492

[10]

Farhadi A, 2009, PROC CVPR IEEE, P1778, DOI 10.1109/CVPRW.2009.5206772

← 1 2 3 4 5 6 7 →