Angular Distance-Guided Neighbor Selection for Graph-Based Approximate Nearest Neighbor Search

被引:0
作者
Jung, Sungjun [1 ]
Park, Yongsang [1 ]
Lee, Haeun [1 ]
Oh, Young H. [2 ]
Lee, Jae W. [1 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Ajou Univ, Suwon, South Korea
来源
PROCEEDINGS OF THE ACM WEB CONFERENCE 2025, WWW 2025 | 2025年
基金
新加坡国家研究基金会;
关键词
Approximate Nearest Neighbor Search; Similarity Search; Graph-based Approximate Nearest Neighbor Search; ALGORITHM; SCALABILITY; TREES; IMAGE; QUERY;
D O I
10.1145/3696410.3714870
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Graph-based approximate nearest neighbor search (ANNS) algorithms are widely used to identify the most similar vectors to a given query vector. Graph-based ANNS consists of two stages: constructing a graph and searching on the graph for a given query vector. While reducing the query response time is of great practical importance, less attention has been paid to improving the online search method than the offline graph construction method. This paper provides an extensive experimental analysis on the popular greedy search and other search optimization strategies. We also propose a novel angular distance-guided search method for graph-based ANNS (ADA-NNS) to improve search efficiency. The key innovation of ADA-NNS is introducing a low-cost neighbor selection mechanism based on an approximate similarity score derived from angular distance estimation, which effectively filters out less relevant neighbors. We compare state-of-the-art search techniques, including FINGER, on six datasets using different similarity metrics. It provides a comprehensive perspective on their tradeoffs in terms of throughput, latency, and recall. Our evaluation shows that ADA-NNS achieves 34%-107% higher queries per second (QPS) than the greedy search at 95% recall@10 on HNSW, one of the most popular graph structures for ANNS.
引用
收藏
页码:4014 / 4023
页数:10
相关论文
共 44 条
[1]   Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions [J].
Andoni, Alexandr ;
Indyk, Piotr .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :117-122
[2]  
[Anonymous], 2014, P 2014 C EMP METH NA, DOI [DOI 10.3115/V1/D14-1162, 10.3115/v1/D14-1162]
[3]   HD-Index: Pushing the Scalability-Accuracy Boundary for Approximate kNN Search in High-Dimensional Spaces [J].
Arora, Akhil ;
Sinha, Sakshi ;
Kumar, Piyush ;
Bhattacharya, Arnab .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 11 (08) :906-919
[4]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[5]   ANN-Benchmarks: A benchmarking tool for approximate nearest neighbor algorithms [J].
Aumuller, Martin ;
Bernhardsson, Erik ;
Faithfull, Alexander .
INFORMATION SYSTEMS, 2020, 87
[6]   Efficient Indexing of Billion-Scale datasets of deep descriptors [J].
Babenko, Artem ;
Lempitsky, Victor .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2055-2063
[7]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[8]  
Boytsov Leonid, 2018, about us
[9]  
Charikar M. S., 2002, PROC 34 ACM STOC, P380
[10]  
Chen Patrick, 2023, P ACM WEB C 2023, P3225, DOI 10.1145/3543507.3583318