Learning to Hash With Dimension Analysis Based Quantizer for Image Retrieval

被引:10
作者
Cao, Yuan [1 ]
Qi, Heng [2 ]
Gui, Jie [3 ]
Li, Keqiu [4 ]
Tang, Yuan Yan [5 ]
Kwok, James Tin-Yau [6 ]
机构
[1] Ocean Univ China, Sch Informat Sci & Engn, Qingdao 266100, Peoples R China
[2] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian 116023, Peoples R China
[3] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
[4] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
[5] Univ Macao, Dept Comp & Informat Sci, Macau 999078, Peoples R China
[6] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Hong Kong 999077, Peoples R China
基金
国家重点研发计划;
关键词
Quantization (signal); Data acquisition; Image retrieval; Nearest neighbor methods; Hamming distance; Symmetric matrices; Binary codes; Approximate nearest neighbor search; hashing algorithms; image retrieval; quantization; BIG DATA; BINARY; SEARCH; CODES;
D O I
10.1109/TMM.2020.3033118
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The last few years have witnessed the rise of the big data era, in which approximate nearest neighbor search is a fundamental problem in many applications, such as large-scale image retrieval. Recently, many research results demonstrate that hashing can achieve promising performance due to its appealing storage and search efficiency. Since the complex optimization problems for loss functions are difficult to solve, most hashing methods decompose the hash codes learning problem into two steps: projection and quantization. In the quantization step, binary codes are widely used because ranking them by Hamming distance is very efficient. However, the huge information loss produced by the quantization step should be reduced in applications, such as image retrieval where high search accuracy is required. Since many two-step hashing methods produce uneven projected dimensions in the projection step, in this paper, we propose a novel dimension analysis based quantization method (DAQ) on two-step hashing methods for image retrieval. We first perform an importance analysis of the projected dimensions and select a subset of them that are more informative than the others, then we divide the selected projected dimensions into several regions with our quantizer. Every region is quantized with its corresponding codebook. Finally, the similarity between two hash codes is estimated by Manhattan distance between their corresponding codebooks, which is also efficient. We conduct experiments on three public benchmarks containing up to one million descriptors and show that the proposed DAQ method consistently leads to significant accuracy improvements over state-of-the-art quantization methods.
引用
收藏
页码:3907 / 3918
页数:12
相关论文
共 46 条
  • [1] [Anonymous], 2012, P INT C NEUR INF PRO
  • [2] Beygelzimer A., 2006, ICML, DOI DOI 10.1145/1143844.1143857
  • [3] Binary Hashing for Approximate Nearest Neighbor Search on Big Data: A Survey
    Cao, Yuan
    Qi, Heng
    Zhou, Wenrui
    Kato, Jien
    Li, Keqiu
    Liu, Xiulong
    Gui, Jie
    [J]. IEEE ACCESS, 2018, 6 : 2039 - 2054
  • [4] Nonlinear Discrete Hashing
    Chen, Zhixiang
    Lu, Jiwen
    Feng, Jianjiang
    Zhou, Jie
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (01) : 123 - 135
  • [5] Image retrieval: Ideas, influences, and trends of the new age
    Datta, Ritendra
    Joshi, Dhiraj
    Li, Jia
    Wang, James Z.
    [J]. ACM COMPUTING SURVEYS, 2008, 40 (02)
  • [6] Cross-Modal Hashing via Rank-Order Preserving
    Ding, Kun
    Fan, Bin
    Huo, Chunlei
    Xiang, Shiming
    Pan, Chunhong
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (03) : 571 - 585
  • [7] Compact Hash Codes for Efficient Visual Descriptors Retrieval in Large Scale Databases
    Ercoli, Simone
    Bertini, Marco
    Del Bimbo, Alberto
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (11) : 2521 - 2532
  • [8] Friedman J. H, T MATH SOFTW, V3, P209
  • [9] Learning Binary Codes for High-Dimensional Data Using Bilinear Projections
    Gong, Yunchao
    Kumar, Sanjiv
    Rowley, Henry A.
    Lazebnik, Svetlana
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 484 - 491
  • [10] Gong YC, 2011, PROC CVPR IEEE, P817, DOI 10.1109/CVPR.2011.5995432