LSH BANDING FOR LARGE-SCALE RETRIEVAL WITH MEMORY AND RECALL CONSTRAINTS

被引:2
作者
Covell, Michele [1 ]
Baluja, Shumeet [1 ]
机构
[1] Google Inc, Google Res, Mountain View, CA 94043 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Multimedia databases; Information retrieval; Fingerprint identification; Pattern matching;
D O I
10.1109/ICASSP.2009.4959971
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give.
引用
收藏
页码:1865 / 1868
页数:4
相关论文
共 50 条
  • [31] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [32] A Pattern Matching Method for Large-Scale Multipurpose Process Scheduling
    He, Yaohua
    Hui, Chi-Wai
    AICHE JOURNAL, 2011, 57 (03) : 671 - 694
  • [33] Adaptive Powerball Stochastic Conjugate Gradient for Large-Scale Learning
    Yang, Zhuang
    IEEE TRANSACTIONS ON BIG DATA, 2023, 9 (06) : 1598 - 1606
  • [34] Large-scale Multi-modal Search and QA at Alibaba
    Jin, Rong
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 8 - 8
  • [35] Large-Scale Speaker Diarization for Long Recordings and Small Collections
    Huijbregts, Marijn
    van Leeuwen, David A.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (02): : 404 - 413
  • [36] Audio-visual large-scale video copy detection
    Liu, Yang
    Xu, Changsheng
    Lu, Hanqing
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2011, 88 (18) : 3803 - 3816
  • [37] MULTI-CONCEPT LEARNING WITH LARGE-SCALE MULTIMEDIA LEXICONS
    Xie, Lexing
    Yan, Rong
    Yang, Jun
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2148 - 2151
  • [38] Accelerating fingerprint identification using FPGA for large-scale applications
    Shafiq, Mohsin
    Taj, Imtiaz A.
    Ghafoor, Mubeen
    Tariq, Syed Ali
    Abbas, Assad
    Zomaya, Albert Y.
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2020, 141 : 35 - 48
  • [39] Fast Registration Methodology for Fastener Assembly of Large-Scale Structure
    Xu, Jing
    Chen, Rui
    Chen, Heping
    Zhang, Song
    Chen, Ken
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (01) : 717 - 726
  • [40] Towards Automatic Large-Scale Identification of Birds in Audio Recordings
    Lasseck, Mario
    EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, 2015, 9283 : 364 - 375