Zero-Shot Learning with Deep Canonical Correlation Analysis

被引:1
作者
Ji, Zhong [1 ]
Yu, Xuejie [1 ]
Pang, Yanwei [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
来源
COMPUTER VISION, PT III | 2017年 / 773卷
基金
中国国家自然科学基金;
关键词
Zero-shot learning; Image classification; Canonical Correlation Analysis; Deep neural network;
D O I
10.1007/978-981-10-7305-2_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot learning (ZSL) improves the scalability of conventional image classification systems by allowing some testing categories having no training data. One key component is to learn a shared embedding space where both side information of object categories and visual representation of object images can be projected to for nearest neighbor search. Although great progress has been made, existing approaches mainly focus on shallow models, which cannot learn a strong generalized embedding space. To this end, this paper proposes a novel deep embedding model for ZSL, which formulates the embedding space with Deep Canonical Correlation Analysis (DCCA). Specifically, the side information and the visual representation are transformed via two independent deep neural networks, and then they are highly linearly correlated in the final output layer. Extensive experiments on two publicly popular datasets demonstrated the effectiveness and superiority the proposed approach.
引用
收藏
页码:209 / 219
页数:11
相关论文
共 16 条
[1]  
Akata Z., 2014, 27 IEEE C COMP VIS P, P2927
[2]  
[Anonymous], 2011, TECHNICAL REPORT
[3]  
[Anonymous], 2014, DEEP INSIDE CONVOLUT
[4]  
[Anonymous], 2013, NeurIPS
[5]  
[Anonymous], 2016, ARXIV160201024
[6]  
Farhadi A, 2009, PROC CVPR IEEE, P1778, DOI 10.1109/CVPRW.2009.5206772
[7]  
Guo Y., 2016, International Joint Conference on Artificial Intelligence, P3382
[8]   Manifold regularized cross-modal embedding for zero-shot learning [J].
Ji, Zhong ;
Yu, Yunlong ;
Pang, Yanwei ;
Guo, Jichang ;
Zhang, Zhongfei .
INFORMATION SCIENCES, 2017, 378 :48-58
[9]  
Lampert CH, 2009, PROC CVPR IEEE, P951, DOI 10.1109/CVPRW.2009.5206594
[10]  
Lazaridou A, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1403