Locality-Sensitive Hashing for Chi2 Distance

被引:53
作者
Gorisse, David [1 ]
Cord, Matthieu [2 ]
Precioso, Frederic [3 ]
机构
[1] Yakaz Lab, F-75002 Paris, France
[2] UPMC Sorbonne Univ, LIP6, F-75005 Paris, France
[3] CNRS, Lab Informat Signaux & Syst Sophia Antipolis, UNS I3S UMR6070, Sophia Antipolis 6903, France
关键词
Sublinear algorithm; approximate nearest neighbors; locality sensitive hashing; chi2; distance; image retrieval; NEAREST-NEIGHBOR; IMAGE;
D O I
10.1109/TPAMI.2011.193
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the past 10 years, new powerful algorithms based on efficient data structures have been proposed to solve the problem of Nearest Neighbors search (or Approximate Nearest Neighbors search). If the Euclidean Locality Sensitive Hashing algorithm, which provides approximate nearest neighbors in a euclidean space with sublinear complexity, is probably the most popular, the euclidean metric does not always provide as accurate and as relevant results when considering similarity measure as the Earth-Mover Distance and chi(2) distances. In this paper, we present a new LSH scheme adapted to chi(2) distance for approximate nearest neighbors search in high-dimensional spaces. We define the specific hashing functions, we prove their local-sensitivity, and compare, through experiments, our method with the Euclidean Locality Sensitive Hashing algorithm in the context of image retrieval on real image databases. The results prove the relevance of such a new LSH scheme either providing far better accuracy in the context of image retrieval than euclidean scheme for an equivalent speed, or providing an equivalent accuracy but with a high gain in terms of processing speed.
引用
收藏
页码:402 / 409
页数:8
相关论文
共 21 条
  • [1] Andoni A, 2006, ANN IEEE SYMP FOUND, P459
  • [2] [Anonymous], P INT WORKSH STAT CO
  • [3] [Anonymous], 2007, P 33 INT C VER LARG
  • [4] [Anonymous], P 17 IEEE INT C IM P
  • [5] [Anonymous], 2008, ADV NEURAL INF PROCE
  • [6] [Anonymous], P IEEE INT C COMP VI
  • [7] [Anonymous], IEEE T MULTIMEDIA
  • [8] [Anonymous], P 19 INT C PATT REC
  • [9] Support vector machines for histogram-based image classification
    Chapelle, O
    Haffner, P
    Vapnik, VN
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05): : 1055 - 1064
  • [10] Histograms of oriented gradients for human detection
    Dalal, N
    Triggs, B
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 886 - 893