Weakly Supervised Deep Metric Learning for Community-Contributed Image Retrieval

被引:180
作者
Li, Zechao [1 ]
Tang, Jinhui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep; image retrieval; metric learning; weakly supervised; DIMENSIONALITY REDUCTION;
D O I
10.1109/TMM.2015.2477035
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recent years have witnessed the explosive growth of community-contributed images with rich context information, which is beneficial to the task of image retrieval. It can help us to learn a suitable metric to alleviate the semantic gap. In this paper, we propose a new distance metric learning algorithm, namely weakly-supervised deep metric learning (WDML), under the deep learning framework. It utilizes a progressive learning manner to discover knowledge by jointly exploiting the heterogeneous data structures from visual contents and user-provided tags of social images. The semantic structure in the textual space is expected to be well preserved while the problem of the noisy, incomplete or subjective tags is addressed by leveraging the visual structure in the original visual space. Besides, a sparse model with the mixed norm is imposed on the transformation matrix of the first layer in the deep architecture to compress the noisy or redundant visual features. The proposed problem is formulated as an optimization problem with a well-defined objective function and a simple yet efficient iterative algorithm is proposed to solve it. Extensive experiments on real-world social image datasets are conducted to verify the effectiveness of the proposed method for image retrieval. Encouraging experimental results are achieved compared with several representative metric learning methods.
引用
收藏
页码:1989 / 1999
页数:11
相关论文
共 39 条
[1]  
[Anonymous], 2012, P 20 ACM INT C MULT
[2]  
[Anonymous], THE VERGE MAR
[3]  
[Anonymous], 2004, Adv. Neural Inf. Process. Syst.
[4]  
[Anonymous], 2006, IEEE COMP SOC C CVPR
[5]  
[Anonymous], 2009, P ACM INT C IM VID R
[6]  
[Anonymous], sciences-medecine
[7]  
[Anonymous], 2008, P 1 ACM INT C MULTIM
[8]  
[Anonymous], SOFTPEDIA AUG
[9]  
Baghshah MS, 2009, 21ST INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-09), PROCEEDINGS, P1217
[10]  
Bar-Hillel AB, 2005, J MACH LEARN RES, V6, P937