A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval

被引:28
作者
Wang, Yongxin [1 ]
Chen, Zhen-Duo [2 ]
Luo, Xin [2 ]
Xu, Xin-Shun [2 ]
机构
[1] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan 250101, Peoples R China
[2] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
基金
中国国家自然科学基金;
关键词
Codes; Semantics; Encoding; Task analysis; Optimization; Streaming media; Sparse matrices; Sparse hashing; high-dimensional hashing; cross-modal hashing; online hashing; fine-grained similarity;
D O I
10.1109/TCSVT.2022.3195874
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, many achievements have been made in improving the performance of supervised cross-modal hashing. However, it remains an open issue on how to fully explore the data information to achieve fine-grained retrieval performance. Most methods employ logical labels or a binary similarity matrix to supervise the hash learning, losing a lot of useful information. From another point of view, the low expressiveness of dense hash code severely limits its preservation of fine-grained data information. With this motivation, in this paper, we propose a high-dimensional sparse hashing framework for cross-modal retrieval, i.e., High-dimensional Sparse Cross-modal Hashing, HSCH for short. It leverages not only high-level semantic labels but also low-level multi-modal features to construct a fine-grained similarity. In particular, based on two well-designed rules, i.e., multi-level and prioritized, it is able to avoid semantic conflicts. Additionally, it leverages the strong power of high-dimensional sparse hash codes to preserve the fine-grained similarity. Then, it efficiently solves the sparse and discrete constraints of sparse hash codes through an efficient discrete optimization algorithm. In light of this, it is much more efficient and scalable to large-scale datasets. More importantly, the computational complexity of HSCH in the retrieval phase is as efficient as those naive hashing methods that use dense hash codes. Moreover, to support online learning scenarios, this paper also extends HSCH into an online version, i.e., HSCH_on. Extensive experiments on three benchmark datasets demonstrate the superiority of our framework compared with some state-of-the-art cross-modal hashing approaches in terms of both accuracy and efficiency.
引用
收藏
页码:8822 / 8836
页数:15
相关论文
共 65 条
  • [1] [Anonymous], 2009, P ACM INT C IM VID R
  • [2] MIHash: Online Hashing with Mutual Information
    Cakir, Fatih
    He, Kun
    Bargal, Sarah Adel
    Sclaroff, Stan
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 437 - 445
  • [3] Online supervised hashing
    Cakir, Fatih
    Bargal, Sarah Adel
    Sclaroff, Stan
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2017, 156 : 162 - 173
  • [4] Charikar Moses S., 2002, P 34 ACM S THEORY CO, P380
  • [5] The devil is in the details: an evaluation of recent feature encoding methods
    Chatfield, Ken
    Lempitsky, Victor
    Vedaldi, Andrea
    Zisserman, Andrew
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [6] Fine-Grained Hashing With Double Filtering
    Chen, Zhen-Duo
    Luo, Xin
    Wang, Yongxin
    Guo, Shanqing
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1671 - 1683
  • [7] SCRATCH: A Scalable Discrete Matrix Factorization Hashing Framework for Cross-Modal Retrieval
    Chen, Zhen-Duo
    Li, Chuan-Xiang
    Luo, Xin
    Nie, Liqiang
    Zhang, Wei
    Xu, Xin-Shun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) : 2262 - 2275
  • [8] A Two-Step Cross-Modal Hashing by Exploiting Label Correlations and Preserving Similarity in Both Steps
    Chen, Zhen-Duo
    Wang, Yongxin
    Li, Hui-Qiong
    Luo, Xin
    Nie, Liqiang
    Xu, Xin-Shun
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 1694 - 1702
  • [9] Cong Bai, 2020, ICMR '20: Proceedings of the 2020 International Conference on Multimedia Retrieval, P525, DOI 10.1145/3372278.3390711
  • [10] A neural algorithm for a fundamental computing problem
    Dasgupta, Sanjoy
    Stevens, Charles F.
    Navlakha, Saket
    [J]. SCIENCE, 2017, 358 (6364) : 793 - 796