Pruned Bi-directed K-nearest Neighbor Graph for Proximity Search

被引:28
作者
Iwasaki, Masajiro [1 ]
机构
[1] Yahoo Japan Corp, Tokyo, Japan
来源
SIMILARITY SEARCH AND APPLICATIONS, SISAP 2016 | 2016年 / 9939卷
关键词
QUANTIZATION; SHAPE;
D O I
10.1007/978-3-319-46759-7_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we address the problems with fast proximity searches for high-dimensional data by using a graph as an index. Graph-based methods that use the k-nearest neighbor graph (KNNG) as an index perform better than tree-based and hash-based methods in terms of search precision and query time. To further improve the performance of the KNNG, the number of edges should be increased. However, increasing the number takes up more memory, while the rate of performance improvement gradually falls off. Here, we propose a pruned bi-directed KNNG (PBKNNG) in order to improve performance without increasing the number of edges. Different directed edges for existing edges between a pair of nodes are added to the KNNG, and excess edges are selectively pruned from each node. We show that the PBKNNG outperforms the KNNG for SIFT and GIST image descriptors. However, the drawback of the KNNG is that its construction cost is fatally expensive. As an alternative, we show that a graph can be derived from an approximate neighborhood graph, which costs much less to construct than a KNNG, in the same way as the PBKNNG and that it also outperforms a KNNG.
引用
收藏
页码:20 / 33
页数:14
相关论文
共 24 条
[1]  
[Anonymous], 2004, P 20 ACM S COMP
[2]  
[Anonymous], 2009, NEURIPS, P1753
[3]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[4]  
[Anonymous], 2011, P 22 INT JOINT C ART
[5]  
[Anonymous], 1993, P 4 ANN ACM SIAM S D
[6]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[7]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[8]  
Ciaccia P, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P426
[9]  
Dong W., 2011, WWW
[10]  
Gionis A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P518