Flexible Multi-modal Hashing for Scalable Multimedia Retrieval

被引:48
作者
Zhu, Lei [1 ]
Lu, Xu [1 ]
Cheng, Zhiyong [2 ]
Li, Jingjing [3 ]
Zhang, Huaxiang [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Natl Supercomp Ctr Jinan, Shandong Comp Sci Ctr, Jinan 250014, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-modal hashing; efficient discrete optimization;
D O I
10.1145/3365841
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-modal hashing methods could support efficient multimedia retrieval by combining multi-modal features for binary hash learning at the both offline training and online query stages. However, existing multi-modal methods cannot binarize the queries, when only one or part of modalities are provided. In this article, we propose a novel Flexible Multi-modal Hashing (FMH) method to address this problem. FMH learns multiple modality-specific hash codes and multi-modal collaborative hash codes simultaneously within a single model. The hash codes are flexibly generated according to the newly coming queries, which provide any one or combination of modality features. Besides, the hashing learning procedure is efficiently supervised by the pair-wise semantic matrix to enhance the discriminative capability. It could successfully avoid the challenging symmetric semantic matrix factorization and O(n(2)) storage cost of semantic matrix. Finally, we design a fast discrete optimization to learn hash codes directly with simple operations. Experiments validate the superiority of the proposed approach.
引用
收藏
页数:20
相关论文
共 44 条
[1]  
CHENG ZY, 2019, TOIS, V37
[2]  
Chua Tat-Seng, 2009, P CIVR, P48
[3]  
CUI CR, 2019, TMM, V21, P1209, DOI DOI 10.1109/TMM.2018.2875357
[4]   AMVH: Asymmetric Multi-Valued Hashing [J].
Da, Cheng ;
Xu, Shibiao ;
Ding, Kun ;
Meng, Gaofeng ;
Xiang, Shiming ;
Pan, Chunhong .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :898-906
[5]  
Di Wang Xinbo, 2018, IEEE TPAMI
[6]   Collective Matrix Factorization Hashing for Multimodal Data [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2083-2090
[7]  
Fu Yanjie, 2018, ACM T INTEL SYST TEC, V9
[8]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[9]   Canonical correlation analysis: An overview with application to learning methods [J].
Hardoon, DR ;
Szedmak, S ;
Shawe-Taylor, J .
NEURAL COMPUTATION, 2004, 16 (12) :2639-2664
[10]   Nonlinear Dimensionality Reduction by Locally Linear Inlaying [J].
Hou, Yuexian ;
Zhang, Peng ;
Xu, Xingxing ;
Zhang, Xiaowei ;
Li, Wenjie .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02) :300-315