Deep ranking structural support vector machine for image tagging

被引:10
作者
Chen, Gang [1 ]
Xu, Ran [1 ]
Yang, Zhi [1 ]
机构
[1] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY 14260 USA
关键词
Image tagging; Maximum margin learning; Deep learning; Ranking;
D O I
10.1016/j.patrec.2017.09.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image tagging is an active research topic in computer vision and machine learning, due to its wide applications on semantic search and image retrieval. Although recent approaches based on deep neural networks can learn better representations to significantly boost performance, no structural information has been exploited. For example, "sky" and "ground" both have higher probability to appear simultaneously in the same outdoor scene. In this paper, we propose a deep ranking structural Support Vector Machine (RSSVM) for image tagging. We exploit deep learning for representation learning and propose a new ranking function over the learned features with label correlation. Specifically, we incorporate the global context information between labels into our ranking function and then formulate the multi-labeling problem as a ranking problem to handle structured output prediction. We transfer parameters from the existed convolutional neural network (CNN) model, and add additional two fully connected layers to build our deep neural structure. We evaluate our method on three widely used datasets, and show promising results over competitive baselines. (c) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 38
页数:9
相关论文
共 47 条
[1]  
[Anonymous], 2011, NIPS
[2]  
[Anonymous], P 12 EUR C COMP
[3]  
[Anonymous], 2001, ICCV
[4]  
[Anonymous], 2010, ICML
[5]  
[Anonymous], 2013, ICML
[6]  
[Anonymous], 2002, P ACM SIGKDD KDD 200
[7]  
[Anonymous], NIPS
[8]  
[Anonymous], 2007, CVPR
[9]  
[Anonymous], DEEP CONVOLUTIONAL R
[10]  
[Anonymous], 2003, NIPS