Efficient Parameter-Free Adaptive Multi-Modal Hashing

被引:22
作者
Zheng, Chaoqun [1 ]
Zhu, Lei [1 ]
Zhang, Shusen [2 ]
Zhang, Huaxiang [1 ]
机构
[1] Shandong Normal Univ, Sch Informat Sci & Engn, Jinan 250358, Peoples R China
[2] Southwestern Univ, Westa Coll, Chongqing 400715, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-modal hashing; parameter-free; fast discrete optimization; adaptive weights; BINARY-CODES; SCALE;
D O I
10.1109/LSP.2020.3008335
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Unsupervised multi-modal hashing has recently attracted broad attention in research area of large-scale multimedia retrieval for its low storage cost, high retrieval speed, and independence on semantic labels. However, the model learning process of existing methods still suffer from the problem of low efficiency: 1) Many existing methods measure the contributions of different modalities using fixed modality weights. In order to avoid over-fitting, they need an inefficient hyper-parameter adjustment process. 2) Most existing methods adopt inefficient optimization strategies to learn hash codes. In this letter, we propose an unsupervised Efficient Parameter-free Adaptive Multi-modal Hashing (EPAMH) model to adaptively capture the modality variations and preserve the discriminative semantics of multi-modal features into the binary hash codes. Moreover, we directly learn the binary codes with simple and efficient operations, which prevents the relaxing quantization errors and improves the model learning efficiency. Experiments prove the superior performance of EPAMH on three public multimedia retrieval datasets. Our source codes and testing datasets can be obtained at https://github.com/ChaoqunZheng/EPAMH.
引用
收藏
页码:1270 / 1274
页数:5
相关论文
共 28 条
[1]  
[Anonymous], 2014, Advances in Neural Information Processing Systems
[2]  
Chua T.-S., 2009, ACM INT C IM VID RET, P1
[3]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[4]   Coherent Semantic-Visual Indexing for Large-Scale Image Retrieval in the Cloud [J].
Hong, Richang ;
Li, Lei ;
Cai, Junjie ;
Tao, Dapeng ;
Wang, Meng ;
Tian, Qi .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (09) :4128-4138
[5]   Multi-View Object Retrieval via Multi-Scale Topic Models [J].
Hong, Richang ;
Hu, Zhenzhen ;
Wang, Ruxin ;
Wang, Meng ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) :5814-5827
[6]  
Huiskes MJ, 2008, PROCEEDING 1 ACM INT
[7]   Deep Cross-Modal Hashing [J].
Jiang, Qing-Yuan ;
Li, Wu-Jun .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3270-3278
[8]  
Jolliffe I.T., 1986, Principal Component Analysis, DOI DOI 10.1007/B98835
[9]  
Kumar S., 2011, P IJCAI, P1360, DOI [DOI 10.5591/978-1-57735-516-8/IJCAI11-230, DOI 10.5591/978-1-57735-516-8/IJCAI11-23]
[10]  
Li XL, 2017, AAAI CONF ARTIF INTE, P2203