Adaptive Label Correlation Based Asymmetric Discrete Hashing for Cross-Modal Retrieval

被引:61
作者
Li, Huaxiong [1 ]
Zhang, Chao [1 ]
Jia, Xiuyi [2 ]
Gao, Yang [3 ]
Chen, Chunlin [1 ]
机构
[1] Nanjing Univ, Dept Control & Syst Engn, Nanjing 210093, Jiangsu, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210014, Jiangsu, Peoples R China
[3] Nanjing Univ, Dept Comp Sci & Technol, Natl Key Lab Novel Software Technol, Nanjing 210023, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Hash functions; Binary codes; Correlation; Training; Quantization (signal); Optimization; Cross-modal retrieval; discrete hashing; label correlation; similarity preservation; BINARY-CODES; QUANTIZATION;
D O I
10.1109/TKDE.2021.3102119
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hashing methods have captured much attention for cross-modal retrieval in recent years. Most existing approaches mainly focus on preserving the semantic similarity across heterogeneous modalities in a shared Hamming subspace, while the label information and potential correlations of multi-label semantics are not fully excavated. In this article, a novel Adaptive Label correlation based asymmEtric Cross-modal Hashing method, i.e., ALECH, is proposed for cross-modal retrieval. ALECH decomposes hash learning into two steps, hash codes learning and hash functions learning. For hash codes learning, the high-order semantic label correlations are adaptively exploited to guide the latent feature learning, while simultaneously generating the binary codes in a discrete manner. The asymmetric strategy is utilized to connect the latent feature space and Hamming space, and preserve the pairwise semantic similarity. Different from other two-step methods that directly adopt simple least-squares regression to learn hash functions based on binary codes, ALECH leverages both hash codes and semantic labels for hash functions learning which further preserves the similarity. Experiments on several benchmark datasets demonstrate that the proposed ALECH method outperforms the state-of-the-art cross-hashing methods.
引用
收藏
页码:1185 / 1199
页数:15
相关论文
共 59 条
[1]  
[Anonymous], 2009, Advances in Neural Information Processing Systems
[2]   Correlation Autoencoder Hashing for Supervised Cross-Modal Search [J].
Cao, Yue ;
Long, Mingsheng ;
Wang, Jianmin ;
Zhu, Han .
ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, :197-204
[3]   Extended ADMM and BCD for nonseparable convex minimization models with quadratic coupling terms: convergence analysis and insights [J].
Chen, Caihua ;
Li, Min ;
Liu, Xin ;
Ye, Yinyu .
MATHEMATICAL PROGRAMMING, 2019, 173 (1-2) :37-77
[4]   Inertial Proximal ADMM for Linearly Constrained Separable Convex Optimization [J].
Chen, Caihua ;
Chan, Raymond H. ;
Ma, Shiqian ;
Yang, Junfeng .
SIAM JOURNAL ON IMAGING SCIENCES, 2015, 8 (04) :2239-2267
[5]  
Chen H, 2020, ARXIV
[6]   Strongly Constrained Discrete Hashing [J].
Chen, Yong ;
Tian, Zhibao ;
Zhang, Hui ;
Wang, Jun ;
Zhang, Dell .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :3596-3611
[7]   SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval [J].
Chen, Zhen-Duo ;
Li, Chuan-Xiang ;
Luo, Xin ;
Nie, Liqiang ;
Zhang, Wei ;
Xu, Xin-Shun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :2262-2275
[8]   Triplet-Based Deep Hashing Network for Cross-Modal Retrieval [J].
Deng, Cheng ;
Chen, Zhaojia ;
Liu, Xianglong ;
Gao, Xinbo ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3893-3903
[9]   Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing [J].
Ding, Guiguang ;
Guo, Yuchen ;
Zhou, Jile ;
Gao, Yue .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) :5427-5440
[10]   Online hash tracking with spatio-temporal saliency auxiliary [J].
Fang, Jianwu ;
Xu, Hongke ;
Wang, Qi ;
Wu, Tianjun .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 160 :57-72