DB-LSH 2.0: Locality-Sensitive Hashing With Query-Based Dynamic Bucketing

被引:3
|
作者
Tian, Yao [1 ]
Zhao, Xi [1 ]
Zhou, Xiaofang [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci & Engn, Kowloon, Hong Kong, Peoples R China
关键词
Costs; Indexes; Search problems; Time complexity; Nearest neighbor methods; Hash functions; Extraterrestrial measurements; Approximate nearest neighbor search; high-dimensional spaces; locality sensitive hashing; APPROXIMATE NEAREST-NEIGHBOR; SEARCH; TREES;
D O I
10.1109/TKDE.2023.3295831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Locality-sensitive hashing (LSH) is a promising family of methods for the high-dimensional approximate nearest neighbor (ANN) search problem due to its sub-linear query time and strong theoretical guarantee. Existing LSH methods either suffer from large index sizes and hash boundary problems, or incur a linear cost for high-quality candidate identification. This dilemma is addressed in a novel method called DB-LSH proposed in this paper. It organizes the projected spaces with multi-dimensional indexes instead of fixed-width hash buckets, which significantly reduces space costs. High-quality candidates can be generated efficiently by dynamically constructing query-based hypercubic buckets with the required widths through index-based window queries. A novel incremental search strategy called DBI-LSH is also developed to further boost the query performance, which incrementally accesses the next best point for higher accuracy and efficiency. Considering the intermediate query information of each query, DBA-LSH is designed to adaptively tune termination conditions without scarifying the success probability. Our theoretical analysis proves that DB-LSH has a smaller query cost than the existing work while DBA-LSH and DBI-LSH have lower expected query costs than DB-LSH. An extensive range of experiments on real-world data show the superiority of our approaches over the state-of-the-art methods in both efficiency and accuracy.
引用
收藏
页码:1000 / 1015
页数:16
相关论文
共 38 条
  • [1] DB-LSH: Locality-Sensitive Hashing with Query-based Dynamic Bucketing
    Tian, Yao
    Zhao, Xi
    Thou, Xiaofang
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2250 - 2262
  • [2] BCH-LSH: a new scheme of locality-sensitive hashing
    Ma, Yuena
    Feng, Xiaoyi
    Liu, Yang
    Li, Shuhong
    IET IMAGE PROCESSING, 2018, 12 (06) : 850 - 855
  • [3] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Liu, Renyang
    Zhao, Jun
    Chu, Xing
    Liang, Yu
    Zhou, Wei
    He, Jing
    SOFT COMPUTING, 2024, 28 (02) : 887 - 902
  • [4] Can LSH (locality-sensitive hashing) be replaced by neural network?
    Renyang Liu
    Jun Zhao
    Xing Chu
    Yu Liang
    Wei Zhou
    Jing He
    Soft Computing, 2024, 28 : 1041 - 1053
  • [5] Query-aware locality-sensitive hashing scheme for l p norm
    Huang, Qiang
    Feng, Jianlin
    Fang, Qiong
    Ng, Wilfred
    Wang, Wei
    VLDB JOURNAL, 2017, 26 (05) : 683 - 708
  • [6] P-QALSH: Parallelizing Query Aware Locality-Sensitive Hashing for Big Data
    Huang, Yikai
    Yao, Zhili
    Feng, Jianlin
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 629 - 635
  • [7] NV-QALSH: An NVM-Optimized Implementation of Query-Aware Locality-Sensitive Hashing
    Yao, Zhili
    Zhang, Jiaqiao
    Feng, Jianlin
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2021, PT II, 2021, 12924 : 58 - 69
  • [8] Fast Access for Star Catalog Based on Locality-Sensitive Hashing
    Zhu H.
    Liang B.
    Zhang T.
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2018, 36 (05): : 988 - 994
  • [9] MIXED-LSH: Reduction of Remote Accesses in Distributed Locality-Sensitive Hashing based on L1-distance
    Koga, Hisashi
    Oguri, Masayuki
    Watanabe, Toshinori
    2012 IEEE 26TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2012, : 175 - 182
  • [10] S2JS']JSD-LSH: A Locality-Sensitive Hashing Schema for Probability Distributions
    Mao, Xian-Ling
    Feng, Bo-Si
    Hao, Yi-Jing
    Nie, Liqiang
    Huang, Heyan
    Wen, Guihua
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3244 - 3251