Semantic Boosting Cross-Modal Hashing for efficient multimedia retrieval

被引：22

作者：

Wang, Ke ^{[1
,2
]}

Tang, Jun ^{[1
,2
]}

Wang, Nian ^{[1
]}

Shao, Ling ^{[3
,4
]}

机构：

[1] Anhui Univ, Minist Educ, Key Lab Intelligent Comp & Signal Proc, Hefei 230039, Peoples R China

[2] Anhui Univ, Sch Elect & Informat Engn, Hefei 230039, Peoples R China

[3] Northumbria Univ, Dept Comp Sci & Digital Technol, Newcastle Upon Tyne NE1 8ST, Tyne & Wear, England

[4] Nanjing Univ Informat Sci & Technol, Coll Elect & Informat Engn, Nanjing 210044, Jiangsu, Peoples R China

来源：

INFORMATION SCIENCES | 2016年 / 330卷

关键词：

Cross-modal hashing; Multimedia retrieval; Boosting;

D O I：

10.1016/j.ins.2015.10.028

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Cross-modal hashing aims to embed data from different modalities into a common low-dimensional Hamming space, which serves as an important part in cross-modal retrieval. Although many linear projection methods were proposed to map cross-modal data into a common abstract space, the semantic similarity between cross-modal data was often ignored. To address this issue, we put forward a novel cross-modal hashing method named Semantic Boosting Cross-Modal Hashing (SBCMH). To preserve the semantic similarity, we first apply multi-class logistic regression to project heterogeneous data into a semantic space, respectively. To further narrow the semantic gap between different modalities, we then use a joint boosting framework to learn hash functions, and finally transform the mapped data representations into a measurable binary subspace. Comparative experiments on two public datasets demonstrate the effectiveness of the proposed SBCMH. (C) 2015 Elsevier Inc. All rights reserved.

引用

页码：199 / 210

页数：12

共 39 条

[1]

Andoni A, 2006, ANN IEEE SYMP FOUND, P459

[2]

[Anonymous], 2005, Ph.D. thesis

[3]

[Anonymous], 2010, P 18 ACM INT C MULT

[4]

[Anonymous], 2013, P 21 ACM INT C MULTI, DOI DOI 10.1145/2502081.2502093

[5]

Bronstein MM, 2010, PROC CVPR IEEE, P3594, DOI 10.1109/CVPR.2010.5539928

[6]

Chen X., 2011, P 25 AAAI C ART INT, P313, DOI DOI 10.1109/CVPR.2016.425

[7]

Chua T.-S., 2009, P ACM INT C IM VID R, P1

[8] Collective Matrix Factorization Hashing for Multimodal Data [J].

Ding, Guiguang ;

Guo, Yuchen ;

Zhou, Jile .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090

[9]

Fan RE, 2008, J MACH LEARN RES, V9, P1871

[10] Cross-modal Retrieval with Correspondence Autoencoder [J].

Feng, Fangxiang ;

Wang, Xiaojie ;

Li, Ruifan .

PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :7-16

← 1 2 3 4 →