GGNN: Graph-Based GPU Nearest Neighbor Search

被引:22
作者
Groh, Fabian [1 ]
Ruppert, Lukas [2 ]
Wieschollek, Patrick [1 ]
Lensch, Hendrik P. A. [2 ]
机构
[1] Amazon Tubingen, D-72076 Tubingen, Germany
[2] Univ Tubingen, Comp Grap Grp, D-72074 Tubingen, Germany
关键词
Graphics processing units; Indexes; Quantization (signal); Nearest neighbor methods; Search problems; Parallel processing; Big Data; Nearest neighbor searches; graph and tree search strategies; information retrieval; approximate search; similarity search; big data; PRODUCT QUANTIZATION; CONSTRUCTION; ALGORITHMS; VECTOR;
D O I
10.1109/TBDATA.2022.3161156
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Approximate nearest neighbor (ANN) search in high dimensions is an integral part of several computer vision systems and gains importance in deep learning with explicit memory representations. Since PQT (Wieschollek et al., 2016), FAISS (Johnson et al., 2021), and SONG (Zhao et al., 2020) started to leverage the massive parallelism offered by GPUs, GPU-based implementations are a crucial resource for today's state-of-the-art ANN methods. While most of these methods allow for faster queries, less emphasis is devoted to accelerating the construction of the underlying index structures. In this paper, we propose a novel GPU-friendly search structure based on nearest neighbor graphs and information propagation on graphs. Our method is designed to take advantage of GPU architectures to accelerate the hierarchical construction of the index structure and for performing the query. Empirical evaluation shows that GGNN significantly surpasses the state-of-the-art CPU- and GPU-based systems in terms of build-time, accuracy and search speed.
引用
收藏
页码:267 / 279
页数:13
相关论文
共 38 条
[1]   Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions [J].
Andoni, Alexandr ;
Indyk, Piotr .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :117-122
[2]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI DOI 10.3115/V1/D14-1162
[3]   ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms [J].
Aumuller, Martin ;
Bernhardsson, Erik ;
Faithfull, Alexander .
INFORMATION SYSTEMS, 2020, 87
[4]   Efficient Indexing of Billion-Scale datasets of deep descriptors [J].
Babenko, Artem ;
Lempitsky, Victor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2055-2063
[5]  
Babenko A, 2015, PROC CVPR IEEE, P4240, DOI 10.1109/CVPR.2015.7299052
[6]   Additive Quantization for Extreme Vector Compression [J].
Babenko, Artem ;
Lempitsky, Victor .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :931-938
[7]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, :3069-3076
[8]   Recommender systems survey [J].
Bobadilla, J. ;
Ortega, F. ;
Hernando, A. ;
Gutierrez, A. .
KNOWLEDGE-BASED SYSTEMS, 2013, 46 :109-132
[9]  
Chen J, 2009, J MACH LEARN RES, V10, P1989
[10]   Vector and line quantization for billion-scale similarity search on GPUs [J].
Chen, Wei ;
Chen, Jincai ;
Zou, Fuhao ;
Li, Yuan-Fang ;
Lu, Ping ;
Wang, Qiang ;
Zhao, Wei .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 99 :295-307