Quicker ADC : Unlocking the Hidden Potential of Product Quantization With SIMD

被引:6
作者
Andre, Fabien
Kermarrec, Anne-Marie [1 ,2 ]
Le Scouarnec, Nicolas [3 ]
机构
[1] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
[2] Mediego, F-35510 Cesson Sevigne, France
[3] Broadpeak, F-35510 Cesson Sevigne, France
关键词
Image databases; information search and retrieval; nearest neighbor search; product quantization; SIMD; SEARCH;
D O I
10.1109/TPAMI.2019.2952606
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient Nearest Neighbor (NN) search in high-dimensional spaces is a foundation of many multimedia retrieval systems. A common approach is to rely on Product Quantization, which allows the storage of large vector databases in memory and efficient distance computations. Yet, implementations of nearest neighbor search with Product Quantization have their performance limited by the many memory accesses they perform. Following this observation, Andre et al. proposed Quick ADC with up to 6x faster implementations of PQ m x 4 product quantizers (PQ) leveraging specific SIMD instructions. Quicker ADC is a generalization of Quick ADC not limited to PQ m x 4 codes and supporting AVX-512, the latest revision of SIMD instruction set. In doing so, Quicker ADC faces the challenge of using efficiently 5,6 and 7-bit shuffles that do not align to computer bytes or words. To this end, we introduce (i) irregular product quantizers combining sub-quantizers of different granularity and (ii) split tables allowing lookup tables larger than registers. We evaluate Quicker ADC with multiple indexes including Inverted Multi-Indexes and IVF HNSW and show that it outperforms the reference optimized implementations (i.e., FAISS and polysemous codes) for numerous configurations. Finally, we release an open-source fork of FAISS enhanced with Quicker ADC.
引用
收藏
页码:1666 / 1677
页数:12
相关论文
共 36 条
[1]   Accelerated Nearest Neighbor Search with Quick ADC [J].
Andre, Fabien ;
Kermarrec, Anne-Marie ;
Le Scouarnec, Nicolas .
PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, :164-171
[2]  
André F, 2015, PROC VLDB ENDOW, V9, P288
[3]   AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding [J].
Babenko, Artem ;
Lempitsky, Victor .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4895-4903
[4]   Efficient Indexing of Billion-Scale datasets of deep descriptors [J].
Babenko, Artem ;
Lempitsky, Victor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2055-2063
[5]  
Babenko A, 2015, PROC CVPR IEEE, P4240, DOI 10.1109/CVPR.2015.7299052
[6]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) :1247-1260
[7]   Additive Quantization for Extreme Vector Compression [J].
Babenko, Artem ;
Lempitsky, Victor .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :931-938
[8]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, :3069-3076
[9]   Revisiting the Inverted Indices for Billion-Scale Approximate Nearest Neighbors [J].
Baranchuk, Dmitry ;
Babenko, Artem ;
Malkov, Yury .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :209-224
[10]   Bolt: Accelerated Data Mining with Fast Vector Compression [J].
Blalock, Davis W. ;
Guttag, John V. .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :727-735