Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree

被引:8
作者
Arandjelovic, Relja [1 ]
Zisserman, Andrew [1 ]
机构
[1] Univ Oxford, Dept Engn Sci, Oxford OX1 3PJ, England
关键词
Approximate search; large-scale image search; image indexing; very large databases; quantization; QUANTIZATION; RECOGNITION; APPROXIMATE; OBJECT; SCENE; SHAPE;
D O I
10.1109/TPAMI.2014.2339821
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of this work is a data structure to support approximate nearest neighbor search on very large scale sets of vector descriptors. The criteria we wish to optimize are: (i) that the memory footprint of the representation should be very small (so that it fits into main memory); and (ii) that the approximation of the original vectors should be accurate. We introduce a novel encoding method, named a Set Compression Tree (SCT), that satisfies these criteria. It is able to accurately compress 1 million descriptors using only a few bits per descriptor. The large compression rate is achieved by not compressing on a per-descriptor basis, but instead by compressing the set of descriptors jointly. We describe the encoding, decoding and use for nearest neighbor search, all of which are quite straightforward to implement. The method, tested on standard benchmarks (SIFT1M and 80 Million Tiny Images), achieves superior performance to a number of state-of-the-art approaches, including Product Quantization, Locality Sensitive Hashing, Spectral Hashing, and Iterative Quantization. For example, SCT has a lower error using 5 bits than any of the other approaches, even when they use 16 or more bits per descriptor. We also include a comparison of all the above methods on the standard benchmarks.
引用
收藏
页码:2396 / 2406
页数:11
相关论文
共 27 条
  • [1] Shape quantization and recognition with randomized trees
    Amit, Y
    Geman, D
    [J]. NEURAL COMPUTATION, 1997, 9 (07) : 1545 - 1588
  • [2] Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions
    Andoni, Alexandr
    Indyk, Piotr
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 117 - 122
  • [3] [Anonymous], [No title captured]
  • [4] The Inverted Multi-Index
    Babenko, Artem
    Lempitsky, Victor
    [J]. 2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3069 - 3076
  • [5] Bosch A, 2007, IEEE I CONF COMP VIS, P1863
  • [6] Chandrasekhar V., 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), P32, DOI 10.1109/ICCVW.2011.6130219
  • [7] Cormen C., 1990, INTRO ALGORITHMS
  • [8] Ge T., 2013, P IEEE C COMP VIS PA, P744
  • [9] Gong YC, 2011, PROC CVPR IEEE, P817, DOI 10.1109/CVPR.2011.5995432
  • [10] Jegou H, 2008, LECT NOTES COMPUT SC, V5302, P304, DOI 10.1007/978-3-540-88682-2_24