Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval

被引:22
作者
Wang, Ke [1 ,2 ]
Tang, Jun [1 ,2 ]
Wang, Nian [1 ]
Shao, Ling [3 ,4 ]
机构
[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230039, Peoples R China
[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230039, Peoples R China
[3] Northumbria Univ, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England
[4] Nanjing Univ Informat Sci & Technol, Coll Elect & Informat Engn, Nanjing 210044, Jiangsu, Peoples R China
关键词
Cross-modal hashing; Multimedia retrieval; Boosting;
D O I
10.1016/j.ins.2015.10.028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cross-modal hashing aims to embed data from different modalities into a common low-dimensional Hamming space, which serves as an important part in cross-modal retrieval. Although many linear projection methods were proposed to map cross-modal data into a common abstract space, the semantic similarity between cross-modal data was often ignored. To address this issue, we put forward a novel cross-modal hashing method named Semantic Boosting Cross-Modal Hashing (SBCMH). To preserve the semantic similarity, we first apply multi-class logistic regression to project heterogeneous data into a semantic space, respectively. To further narrow the semantic gap between different modalities, we then use a joint boosting framework to learn hash functions, and finally transform the mapped data representations into a measurable binary subspace. Comparative experiments on two public datasets demonstrate the effectiveness of the proposed SBCMH. (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:199 / 210
页数:12
相关论文
共 39 条
[1]  
Andoni A, 2006, ANN IEEE SYMP FOUND, P459
[2]  
[Anonymous], 2005, Ph.D. thesis
[3]  
[Anonymous], 2010, P 18 ACM INT C MULT
[4]  
[Anonymous], 2013, P 21 ACM INT C MULTI, DOI DOI 10.1145/2502081.2502093
[5]  
Bronstein MM, 2010, PROC CVPR IEEE, P3594, DOI 10.1109/CVPR.2010.5539928
[6]  
Chen X., 2011, P 25 AAAI C ART INT, P313, DOI DOI 10.1109/CVPR.2016.425
[7]  
Chua T.-S., 2009, P ACM INT C IM VID R, P1
[8]   Collective Matrix Factorization Hashing for Multimodal Data [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090
[9]  
Fan RE, 2008, J MACH LEARN RES, V9, P1871
[10]   Cross-modal Retrieval with Correspondence Autoencoder [J].
Feng, Fangxiang ;
Wang, Xiaojie ;
Li, Ruifan .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :7-16