Learning the Redundancy-free Features for Generalized Zero-Shot Object Recognition

被引:74
作者
Han, Zongyan [1 ,2 ,3 ]
Fu, Zhenyong [1 ,2 ,3 ]
Yang, Jian [1 ,2 ,3 ]
机构
[1] Nanjing Univ Sci & Technol, PCALab, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Minist Educ, PCA Lab, Key Lab Intelligent Percept & Syst High Dimens In, Nanjing, Jiangsu, Peoples R China
[3] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Jiangsu Key Lab Image & Video Understanding Socia, Nanjing, Jiangsu, Peoples R China
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
基金
中国博士后科学基金;
关键词
D O I
10.1109/CVPR42600.2020.01288
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Zero-shot object recognition or zero-shot learning aims to transfer the object recognition ability among the semantically related categories, such as fine-grained animal or bird species. However, the images of different fine-grained objects tend to merely exhibit subtle differences in appearance, which will severely deteriorate zero-shot object recognition. To reduce the superfluous information in the fine-grained objects, in this paper, we propose to learn the redundancy-free features for generalized zero-shot learning. We achieve our motivation by projecting the original visual features into a new (redundancy-free) feature space and then restricting the statistical dependence between these two feature spaces. Furthermore, we require the projected features to keep and even strengthen the category relationship in the redundancy-free feature space. In this way, we can remove the redundant information from the visual features without losing the discriminative information. We extensively evaluate the performance on four benchmark datasets. The results show that our redundancy-free feature based generalized zero-shot learning (RFF-GZSL) approach can outperform the state-of-the-arts often by a large margin.
引用
收藏
页码:12862 / 12871
页数:10
相关论文
共 59 条
[21]   Multi-modal Cycle-Consistent Generalized Zero-Shot Learning [J].
Felix, Rafael ;
Kumar, B. G. Vijay ;
Reid, Ian ;
Carneiro, Gustavo .
COMPUTER VISION - ECCV 2018, PT VI, 2018, 11210 :21-37
[22]  
Frome A., 2013, Advances in neural information processing systems
[23]   Zero-Shot Learning on Semantic Class Prototype Graph [J].
Fu, Zhenyong ;
Xiang, Tao ;
Kodirov, Elyor ;
Gong, Shaogang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (08) :2009-2022
[24]  
Fu ZY, 2015, PROC CVPR IEEE, P2635, DOI 10.1109/CVPR.2015.7298879
[25]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[26]  
Gulrajani I., 2017, P 31 INT C NEUR INF, P5767
[27]  
HE KM, 2016, PROC CVPR IEEE, P770, DOI DOI 10.1109/CVPR.2016.90
[28]  
Huang He, 2019, CVPR
[29]  
King DB, 2015, ACS SYM SER, V1214, P1, DOI 10.1021/bk-2015-1214.ch001
[30]  
Kingma DP., 2014, ACS SYM SER