Accelerated Approximate Nearest Neighbors Search Through Hierarchical Product Quantization

被引:8
作者
Abdelhadi, Ameer M. S. [1 ]
Bouganis, Christos-Savvas [1 ]
Constantinides, George A. [1 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
来源
2019 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (ICFPT 2019) | 2019年
基金
英国工程与自然科学研究理事会;
关键词
Approximate search; similarity search; nearest neighbor search; online indexing; high-dimensional indexing; product quantization; vector quantization; artificial intelligence; ALGORITHMS;
D O I
10.1109/ICFPT47387.2019.00019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A fundamental recurring task in many machine learning applications is the search for the Nearest Neighbor in high dimensional metric spaces. Towards answering queries in large scale problems, state-of-the-art methods employ Approximate Nearest Neighbors (ANN) search, a search that returns the nearest neighbor with high probability, as well as techniques that compress the dataset. Product-Quantization (PQ) based ANN search methods have demonstrated state-of-the-art performance in several problems, including classification, regression and information retrieval. The dataset is encoded into a Cartesian product of multiple low-dimensional codebooks, enabling faster search and higher compression. Being intrinsically parallel, PQ-based ANN search approaches are amendable for hardware acceleration. This paper proposes a novel Hierarchical PQ (HPQ) based ANN search method as well as an FPGA-tailored architecture for its implementation that outperforms current state of the art systems. HPQ gradually refines the search space, reducing the number of data compares and enabling a pipelined search. The mapping of the architecture on a Stratix 10 FPGA device demonstrates over x 250 speedups over current state-of-the-art systems, opening the space for addressing larger datasets and/or improving the query times of current systems.
引用
收藏
页码:90 / 98
页数:9
相关论文
共 34 条
  • [1] Modular Switched Multiported SRAM-Based Memories
    Abdelhadi, Ameer M. S.
    Lemieux, Guy G. F.
    [J]. ACM TRANSACTIONS ON RECONFIGURABLE TECHNOLOGY AND SYSTEMS, 2016, 9 (03)
  • [2] A Multi-Ported Memory Compiler Utilizing True Dual-port BRAMs
    Abdelhadi, Ameer M. S.
    Lemieux, Guy G. F.
    [J]. 2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 140 - 147
  • [3] Andoni A, 2006, ANN IEEE SYMP FOUND, P459
  • [4] [Anonymous], 2012, P INT C NEUR INF PRO
  • [5] Babenko A, 2015, PROC CVPR IEEE, P4240, DOI 10.1109/CVPR.2015.7299052
  • [6] Additive Quantization for Extreme Vector Compression
    Babenko, Artem
    Lempitsky, Victor
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 931 - 938
  • [7] Batcher KE., 1968, Proceedings of the Spring Joint Computer Conference, P307, DOI DOI 10.1145/1468075.1468121
  • [8] Bewley A., 2013, PROC AUSTRAL C ROBOT, V2, P1
  • [9] Beyer K, 1999, LECT NOTES COMPUT SC, V1540, P217
  • [10] In defense of Nearest-Neighbor based image classification
    Boiman, Oren
    Shechtman, Eli
    Irani, Michal
    [J]. 2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, : 1992 - +