Multimedia retrieval by deep hashing with multilevel similarity learning

被引:5
|
作者
Liu, Qiuli [1 ,3 ]
Jin, Lu [2 ]
Li, Zechao [2 ]
Tang, Jinhui [2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 611731, Sichuan, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Jiangsu, Peoples R China
[3] Beijing Normal Univ, Beijing 100875, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimedia retrieval; Deep neural networks; Hashing and multilevel similarity; correlation measurement;
D O I
10.1016/j.jvcir.2018.11.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep multimodal hashing has received increasing research attention in recent years due to its superior performance for large-scale multimedia retrieval. However, limited e orts have been made to explore the complex multilevel semantic structure for deep multimodal hashing. In this paper, we propose a novel deep multimodal hashing method, termed as Deep Hashing with Multilevel Similarity Learning (DHMSL), for learning compact and discriminative hash codes, which explores multilevel semantic similarity correlations of multimedia data. In DHMSL, multilevel similarity correlation is explored to learn the unified binary hash codes by exploiting the local structure and semantic label information simultaneously. Meanwhile, the bit balance and quantization constraints are taken into account to further make the unified hash codes compact. With the unified binary codes learned, two deep neural networks are jointly trained to simultaneously learn feature representations and two sets of nonlinear hash functions. Specifically, the well-designed loss functions are introduced to minimize the prediction errors of the feature representations as well as the errors between the unified binary codes and outputs of the networks. Extensive experiments on two widely-used multimodal datasets demonstrate that the proposed method can achieve the state-of-the-art performance for both image-query-text and text-query-image tasks. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:150 / 158
页数:9
相关论文
共 50 条
  • [1] Scalable Multimedia Retrieval by Deep Learning Hashing with Relative Similarity Learning
    Gao, Lianli
    Song, Jingkuan
    Zou, Fuhao
    Zhang, Dongxiang
    Shao, Jie
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 903 - 906
  • [2] Deep Hashing Similarity Learning for Cross-Modal Retrieval
    Ma, Ying
    Wang, Meng
    Lu, Guangyun
    Sun, Yajun
    IEEE ACCESS, 2024, 12 : 8609 - 8618
  • [3] Improved Deep Classwise Hashing With Centers Similarity Learning for Image Retrieval
    Zhang, Ming
    Yan, Hong
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 10516 - 10523
  • [4] Deep Semantic Reconstruction Hashing for Similarity Retrieval
    Wang, Yunbo
    Ou, Xianfeng
    Liang, Jian
    Sun, Zhenan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 387 - 400
  • [5] Deep Hashing Network for Efficient Similarity Retrieval
    Zhu, Han
    Long, Mingsheng
    Wang, Jianmin
    Cao, Yue
    THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2415 - 2421
  • [6] Central Similarity Multi-view Hashing for Multimedia Retrieval
    Zhu, Jian
    Cheng, Wen
    Cui, Yu
    Tang, Chang
    Dai, Yuyang
    Li, Yong
    Zeng, Lingfang
    WEB AND BIG DATA, PT II, APWEB-WAIM 2023, 2024, 14332 : 486 - 500
  • [7] Deep Semantic Correlation Learning based Hashing for Multimedia Cross-Modal Retrieval
    Gong, Xiaolong
    Huang, Linpeng
    Wang, Fuwei
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 117 - 126
  • [8] Deep Top Similarity Preserving Hashing for Image Retrieval
    Li, Qiang
    Fu, Haiyan
    Kong, Xiangwei
    IMAGE AND GRAPHICS (ICIG 2017), PT II, 2017, 10667 : 206 - 215
  • [9] Deep hashing with top similarity preserving for image retrieval
    Li, Qiang
    Fu, Haiyan
    Kong, Xiangwei
    Tian, Qi
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (18) : 24121 - 24141
  • [10] Deep hashing with top similarity preserving for image retrieval
    Qiang Li
    Haiyan Fu
    Xiangwei Kong
    Qi Tian
    Multimedia Tools and Applications, 2018, 77 : 24121 - 24141