Manifold embedding for zero-shot recognition

被引:2
作者
Ji, Zhong [1 ]
Yu, Xuejie [1 ]
Yu, Yunlong [1 ]
He, Yuqing [1 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin, Peoples R China
来源
COGNITIVE SYSTEMS RESEARCH | 2019年 / 55卷
基金
中国国家自然科学基金;
关键词
Zero-shot recognition; Manifold embedding; Image recognition; Semantic embedding;
D O I
10.1016/j.cogsys.2018.12.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-Shot Recognition (ZSR) has gained its popularity recently owing to its promising characteristic that extends the classifiers to the unseen classes. It is typically addressed by resorting to a class semantic space to transfer the knowledge from the seen classes to unseen ones. Therefore, constructing the effective interactions between the visual space and the class semantic space is the key for ZSR. In this paper, under the assumption that the distribution of the semantic categories in the semantic space has an intrinsic manifold structure, we propose two manifold embedding-based ZSR approaches to capture the intrinsic structures of both the visual space and the class semantic space, i.e., ME-ZSR and MCCA-ZSR. Specifically, ME-ZSR builds embedding from visual space to semantic space, while MCCA-ZSR explores to embed both visual and semantic features into a common space. The linear, closed-form solutions make both methods efficient to optimize. Extensive experiments on three popular datasets AwA, CUB and NAB validate the effectiveness of both methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:34 / 43
页数:10
相关论文
共 41 条
[1]  
Akata Z, 2015, PROC CVPR IEEE, P2927, DOI 10.1109/CVPR.2015.7298911
[2]  
[Anonymous], 2014, INF SOFTW TECHNOL
[3]  
[Anonymous], 2018, SIEM REAP
[4]  
[Anonymous], 2015, PROC CVPR IEEE
[5]   Predicting Deep Zero-Shot Convolutional Neural Networks using Textual Descriptions [J].
Ba, Jimmy Lei ;
Swersky, Kevin ;
Fidler, Sanja ;
Salakhutdinov, Ruslan .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4247-4255
[6]   Synthesized Classifiers for Zero-Shot Learning [J].
Changpinyo, Soravit ;
Chao, Wei-Lun ;
Gong, Boqing ;
Sha, Fei .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5327-5336
[7]  
Dhillon P., 2011, Advances in Neural Information Processing Systems, P199
[8]   Link the head to the "beak": Zero Shot Learning from Noisy Text Description at Part Precision [J].
Elhoseiny, Mohamed ;
Zhu, Yizhe ;
Zhang, Han ;
Elgammal, Ahmed .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6288-6297
[9]   Write a Classifier: Predicting Visual Classifiers from Unstructured Text [J].
Elhoseiny, Mohamed ;
Elgammal, Ahmed ;
Saleh, Babak .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2539-2553
[10]  
Frome A., 2013, Neural information processing systems, P2121