Principal Component Hashing: An Accelerated Approximate Nearest Neighbor Search

被引:0
作者
Matsushita, Yusuke [1 ]
Wada, Toshikazu [1 ]
机构
[1] Wakayama Univ, Grad Sch Syst Engn, Wakayama 6408510, Japan
来源
ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS | 2009年 / 5414卷
关键词
Approximate Nearest Neighbor Search; High dimensional space; p-stable Locality Sensitive Hashing; ALGORITHM;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Nearest Neighbor (NN) search is a basic algorithm for data mining and machine learning applications. However, its acceleration in high dimensional space is a difficult problem. For solving this problem, approximate NN search algorithms have been investigated. Especially, LSH is getting highlighted recently, because it has a clear relationship between relative error ratio and the computational complexity. However, the p-stable LSH computes hash values independent of the data distributions, and hence, sometimes the search fails or consumes considerably long time. For solving this problem, we propose Principal Component Hashing (PCH), which exploits the distribution of the stored data. Through experiments, we confirmed that PCH is faster than ANN and LSH at the same accuracy.
引用
收藏
页码:374 / 385
页数:12
相关论文
共 12 条
[1]  
Andoni A, 2006, ANN IEEE SYMP FOUND, P459
[2]  
[Anonymous], ANN: A library for approximate nearest neighbor searching
[3]  
[Anonymous], 1658 INRIA
[4]  
[Anonymous], P 21 C VER LARG DAT
[5]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[6]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[7]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[8]  
Datar Mayur, 2004, P 20 ACM S COMP GEOM
[9]  
Indyk P., 1998, Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, P604, DOI 10.1145/276698.276876
[10]   A NEW VERSION OF THE NEAREST-NEIGHBOR APPROXIMATING AND ELIMINATING SEARCH ALGORITHM (AESA) WITH LINEAR PREPROCESSING TIME AND MEMORY REQUIREMENTS [J].
MICO, ML ;
ONCINA, J ;
VIDAL, E .
PATTERN RECOGNITION LETTERS, 1994, 15 (01) :9-17