Discrete asymmetric zero-shot hashing with application to cross-modal retrieval

被引:24
作者
Shu, Zhenqiu [1 ]
Yong, Kailing [1 ]
Yu, Jun [2 ]
Gao, Shengxiang [1 ]
Mao, Cunli [1 ]
Yu, Zhengtao [1 ]
机构
[1] Kunming Univ Sci & Technol, Sch Fac Informat Engn & Automation, Kunming, Peoples R China
[2] Zhengzhou Univ Light Ind, Coll Comp & Commun Engn, Zhengzhou, Peoples R China
关键词
Zero -shot hashing; Asymmetric; Cross -modal retrieval; Class attributes; Pairwise similarity;
D O I
10.1016/j.neucom.2022.09.037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, cross-modal retrieval technology has attracted extensive attention with the massive growth of multimedia data. However, most cross-modal hashing methods mainly focus on exploring the retrieval of seen classes, while ignoring the retrieval of unseen classes. Therefore, traditional cross -modal hashing methods cannot achieve satisfactory performances in zero-shot retrieval. To mitigate this challenge, in this paper, we propose a novel zero-shot cross-modal retrieval method called discrete asym-metric zero-shot hashing(DAZSH), which fully exploits the supervised knowledge of multimodal data. Specifically, it integrates pairwise similarity, class attributes and semantic labels to guide zero-shot hash-ing learning. Moreover, our proposed DAZSH method combines the data features with the class attributes to obtain a semantic category representation for each category. Therefore, the relationships between seen and unseen classes can be effectively captured by learning a category representation vector for each instance. Therefore, the supervised knowledge can be transferred from the seen classes to the unseen classes. In addition, we develop an efficient discrete optimization strategy to solve the proposed model. Massive experiments on three benchmark datasets show that our proposed approach has achieved promising results in cross-modal retrieval tasks. The source code of this paper can be obtained from https://github.com/szq0816/DAZSH.(c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:366 / 379
页数:14
相关论文
共 50 条
  • [31] Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval
    Zhang, Donglin
    Xiao-Jun Wu
    Xu, Tianyang
    Kittler, Josef
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (11): : 7014 - 7026
  • [32] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Jiao, Shichao
    Han, Xie
    Xiong, Fengguang
    Yang, Xiaowen
    Han, Huiyan
    He, Ligang
    Kuang, Liqun
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16) : 13469 - 13483
  • [33] Hypergraph-Based Discrete Hashing Learning for Cross-Modal Retrieval
    Tang, Dianjuan
    Cui, Hui
    Shi, Dan
    Ji, Hua
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT I, 2018, 11164 : 776 - 786
  • [34] SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval
    Li, Chuan-Xiang
    Chen, Zhen-Duo
    Zhang, Peng-Fei
    Luo, Xin
    Nie, Liqiang
    Zhang, Wei
    Xu, Xin-Shun
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 1 - 9
  • [35] Latent semantic-enhanced discrete hashing for cross-modal retrieval
    Liu, Yun
    Ji, Shujuan
    Fu, Qiang
    Zhao, Jianli
    Zhao, Zhongying
    Gong, Maoguo
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16004 - 16020
  • [36] Latent semantic-enhanced discrete hashing for cross-modal retrieval
    Yun Liu
    Shujuan Ji
    Qiang Fu
    Jianli Zhao
    Zhongying Zhao
    Maoguo Gong
    Applied Intelligence, 2022, 52 : 16004 - 16020
  • [37] Asymmetric Supervised Fusion-Oriented Hashing for Cross-Modal Retrieval
    Yang, Zhan
    Deng, Xiyin
    Guo, Lin
    Long, Jun
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (02) : 851 - 864
  • [38] Deep cross-modal discriminant adversarial learning for zero-shot sketch-based image retrieval
    Shichao Jiao
    Xie Han
    Fengguang Xiong
    Xiaowen Yang
    Huiyan Han
    Ligang He
    Liqun Kuang
    Neural Computing and Applications, 2022, 34 : 13469 - 13483
  • [39] Efficient discrete latent semantic hashing for scalable cross-modal retrieval
    Lu, Xu
    Zhu, Lei
    Cheng, Zhiyong
    Song, Xuemeng
    Zhang, Huaxiang
    SIGNAL PROCESSING, 2019, 154 : 217 - 231
  • [40] Semi-supervised discrete hashing for efficient cross-modal retrieval
    Wang, Xingzhi
    Liu, Xin
    Peng, Shu-Juan
    Zhong, Bineng
    Chen, Yewang
    Du, Ji-Xiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (35-36) : 25335 - 25356