Adaptive bit allocation hashing for approximate nearest neighbor search

被引:6
作者
Guo, Qin-Zhen [1 ]
Zeng, Zhi [1 ]
Zhang, Shuwu [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100190, Peoples R China
关键词
Hashing; Adaptive bit allocation; Approximate nearest neighbor search; Hamming embedding; Image retrieval; VECTOR QUANTIZATION; IMAGE; REPRESENTATION; ALGORITHM; MIXTURE; SCENE;
D O I
10.1016/j.neucom.2014.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Using hashing algorithms to learn binary codes representation of data for fast approximate nearest neighbor (ANN) search has attracted more and more attention. Most existing hashing methods employ various hash functions to encode data. The resulting binary codes can be obtained by concatenating bits produced by those hash functions. These methods usually have two main steps: projection and thresholding. One problem with these methods is that every dimension of the projected data is regarded as of same importance and encoded by one bit, which may result in ineffective codes. In this paper, we introduce an adaptive bit allocation hashing (ABAH) method to encode data for ANN search. The basic idea is, according to the dispersions of all the dimensions after projection we use different numbers of bits to encode them. In our method, more bits will be adaptively allocated to encode dimensions with larger dispersion while fewer bits for dimensions with smaller dispersion. This novel bit allocation scheme makes our hashing method effectively preserve the neighborhood structure in the original data space. Extensive experiments show that the proposed ABAH significantly outperforms other state-of-the-art methods for ANN search task. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:719 / 728
页数:10
相关论文
共 43 条
[1]  
[Anonymous], P C ADV NEUR INF PRO
[2]  
[Anonymous], 2008, P 1 ACM INT C MULTIM
[3]  
[Anonymous], 2010, P 16 ACM SIGKDD INT, DOI 10.1145/1835804.1835946
[4]  
[Anonymous], 2012, P INT C MACH LEARN
[5]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[6]  
BENTLEY JL, 1990, PROCEEDINGS OF THE SIXTH ANNUAL SYMPOSIUM ON COMPUTATIONAL GEOMETRY, P187, DOI 10.1145/98524.98564
[7]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[8]   A Fast Approximate Nearest Neighbor Search Algorithm in the Hamming Space [J].
Esmaeili, Mani Malek ;
Ward, Rabab Kreidieh ;
Fatourechi, Mehrdad .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (12) :2481-2488
[9]  
Gionis A, 1999, PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P518
[10]  
Gong YC, 2011, PROC CVPR IEEE, P817, DOI 10.1109/CVPR.2011.5995432