SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval

被引:46
作者
Li, Chuan-Xiang [1 ]
Chen, Zhen-Duo [1 ]
Zhang, Peng-Fei [1 ]
Luo, Xin [1 ]
Nie, Liqiang [2 ]
Zhang, Wei [3 ]
Xu, Xin-Shun [1 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao 266237, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
来源
PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18) | 2018年
基金
中国国家自然科学基金;
关键词
Cross-Modal Retrieval; Hashing; Matrix Factorization; Discrete Optimization; IMAGE; QUANTIZATION;
D O I
10.1145/3240508.3240547
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, many hashing methods have been proposed for the cross-modal retrieval task. However, there are still some issues that need to be further explored. For example, some of them relax the binary constraints to generate the hash codes, which may generate large quantization error. Although some discrete schemes have been proposed, most of them are time-consuming. In addition, most of the existing supervised hashing methods use an n x n similarity matrix during the optimization, making them unscalable. To address these issues, in this paper, we present a novel supervised cross-modal hashing method-Scalable disCRete mATrix faCtorization Hashing, SCRATCH for short. It leverages the collective matrix factorization on the kernelized features and the semantic embedding with labels to find a latent semantic space to preserve the intra- and inter-modality similarities. In addition, it incorporates the label matrix instead of the similarity matrix into the loss function. Based on the proposed loss function and the iterative optimization algorithm, it can learn the hash functions and binary codes simultaneously. Moreover, the binary codes can be generated discretely, reducing the quantization error generated by the relaxation scheme. Its time complexity is linear to the size of the dataset, making it scalable to large-scale datasets. Extensive experiments on three benchmark datasets, namely, Wiki, MIRFlickr-25K, and NUS-WIDE, have verified that our proposed SCRATCH model outperforms several state-of-the-art unsupervised and supervised hashing methods for cross-modal retrieval.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 51 条
[21]  
Liu Hong, 2016, IJCAI, P1767, DOI DOI 10.1109/TIP.2016.2564638
[22]  
Liu W, 2012, PROC CVPR IEEE, P2074, DOI 10.1109/CVPR.2012.6247912
[23]   Query-Adaptive Reciprocal Hash Tables for Nearest Neighbor Search [J].
Liu, Xianglong ;
Deng, Cheng ;
Lang, Bo ;
Tao, Dacheng ;
Li, Xuelong .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :907-919
[24]   Composite Correlation Quantization for Efficient Multimodal Retrieval [J].
Long, Mingsheng ;
Cao, Yue ;
Wang, Jianmin ;
Yu, Philip S. .
SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, :579-588
[25]  
Luo X, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2518
[26]   Scalable Supervised Discrete Hashing for Large-Scale Search [J].
Luo, Xin ;
Wu, Ye ;
Xu, Xin-Shun .
WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, :1603-1612
[27]   Fast Scalable Supervised Hashing [J].
Luo, Xin ;
Nie, Liqiang ;
He, Xiangnan ;
Wu, Ye ;
Chen, Zhen-Duo ;
Xu, Xin-Shun .
ACM/SIGIR PROCEEDINGS 2018, 2018, :735-744
[28]   Multimodal Similarity-Preserving Hashing [J].
Masci, Jonathan ;
Bronstein, Michael M. ;
Bronstein, Alexander M. ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (04) :824-830
[29]  
Nie Liqiang, 2012, P 20 ACM INT C MULTI, P59, DOI DOI 10.1145/2393347.2393363
[30]   A GENERALIZED SOLUTION OF ORTHOGONAL PROCRUSTE PROBLEM [J].
SCHONEMA.PH .
PSYCHOMETRIKA, 1966, 31 (01) :1-1