LSH BANDING FOR LARGE-SCALE RETRIEVAL WITH MEMORY AND RECALL CONSTRAINTS

被引:2
|
作者
Covell, Michele [1 ]
Baluja, Shumeet [1 ]
机构
[1] Google Inc, Google Res, Mountain View, CA 94043 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Multimedia databases; Information retrieval; Fingerprint identification; Pattern matching;
D O I
10.1109/ICASSP.2009.4959971
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give.
引用
收藏
页码:1865 / 1868
页数:4
相关论文
共 50 条
  • [21] DC-GNN: Decoupled Graph Neural Networks for Improving and Accelerating Large-Scale E-commerce Retrieval
    Feng, Chenchen
    He, Yu
    Wen, Shiyang
    Liu, Guojun
    Wang, Liang
    Xu, Jian
    Zheng, Bo
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 32 - 40
  • [22] Semantic signatures for large-scale visual localization
    Li Weng
    Valérie Gouet-Brunet
    Bahman Soheilian
    Multimedia Tools and Applications, 2021, 80 : 22347 - 22372
  • [23] Semantic signatures for large-scale visual localization
    Weng, Li
    Gouet-Brunet, Valerie
    Soheilian, Bahman
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (15) : 22347 - 22372
  • [24] Entropy-Balanced Bitmap Tree for Shape-Based Object Retrieval From Large-Scale Satellite Imagery Databases
    Scott, Grant J.
    Klaric, Matthew N.
    Davis, Curt H.
    Shyu, Chi-Ren
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (05): : 1603 - 1616
  • [25] Large-scale Bayesian logistic regression for text categorization
    Genkin, Alexander
    Lewis, David D.
    Madigan, David
    TECHNOMETRICS, 2007, 49 (03) : 291 - 304
  • [26] LASH: Large-Scale Academic Deep Semantic Hashing
    Guo, Jia-Nan
    Mao, Xian-Ling
    Lan, Tian
    Tu, Rong-Xin
    Wei, Wei
    Huang, Heyan
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1734 - 1746
  • [27] Graphics recognition for a large-scale airplane information system
    Baum, LS
    Boose, JH
    Kelley, RJ
    GRAPHICS RECOGNITION: ALGORITHMS AND SYSTEMS, 1998, 1389 : 291 - 301
  • [28] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [29] Improving large-scale search engines with semantic annotations
    Fuentes-Lorenzo, Damaris
    Fernandez, Norberto
    Fisteus, Jesus A.
    Sanchez, Luis
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (06) : 2287 - 2296
  • [30] Periscoping: Private Key Distribution for Large-Scale Mixnets
    Liu, Shuhao
    Chen, Li
    Fu, Yuanzhong
    IEEE INFOCOM 2024-IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, 2024, : 681 - 690