LSH BANDING FOR LARGE-SCALE RETRIEVAL WITH MEMORY AND RECALL CONSTRAINTS

被引:2
|
作者
Covell, Michele [1 ]
Baluja, Shumeet [1 ]
机构
[1] Google Inc, Google Res, Mountain View, CA 94043 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Multimedia databases; Information retrieval; Fingerprint identification; Pattern matching;
D O I
10.1109/ICASSP.2009.4959971
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give.
引用
收藏
页码:1865 / 1868
页数:4
相关论文
共 50 条
  • [1] Large-Scale Data Processing for Information Retrieval Applications
    Khandel, Pooya
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 3489 - 3489
  • [2] NLP and Large-Scale Information Retrieval on Mathematical Texts
    Dong, Yihe
    MATHEMATICAL SOFTWARE - ICMS 2018, 2018, 10931 : 156 - 164
  • [3] Large-scale retrieval for medical image analytics: A comprehensive review
    Li, Zhongyu
    Zhang, Xiaofan
    Mueller, Henning
    Zhang, Shaoting
    MEDICAL IMAGE ANALYSIS, 2018, 43 : 66 - 84
  • [4] Learning Neural Bag-of-Features for Large-Scale Image Retrieval
    Passalis, Nikolaos
    Tefas, Anastasios
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (10): : 2641 - 2652
  • [5] A heuristic approach for λ-representative information retrieval from large-scale data
    Zhang, Jin
    Wei, Qiang
    Chen, Guoqing
    INFORMATION SCIENCES, 2014, 277 : 825 - 841
  • [6] Attention-Based Graph Summarization for Large-Scale Information Retrieval
    Shabani, Nasrin
    Beheshti, Amin
    Jolfaei, Alireza
    Wu, Jia
    Haghighi, Venus
    Najafabadi, Maryam Khanian
    Foo, Jin
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2024, 70 (03) : 6224 - 6235
  • [7] Arabic Question Answering System for Information Retrieval on Large-scale Image Objects
    Al-Zubi, Sawsan
    Awaysheh, Feras M.
    Al-Shboul, Bashar Awad
    2021 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT DATA SCIENCE TECHNOLOGIES AND APPLICATIONS (IDSTA), 2021, : 162 - 170
  • [8] VisIRR: Visual Analytics for Information Retrieval and Recommendation with Large-Scale Document Data
    Choo, Jaegul
    Lee, Changhyun
    Kim, Hannah
    Lee, Hanseung
    Liu, Zhicheng
    Kannan, Ramakrishnan
    Stolper, Charles D.
    Stasko, John
    Drake, Barry L.
    Park, Haesun
    2014 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2014, : 243 - 244
  • [9] Retrieval-Enhanced Generative Model for Large-Scale Knowledge Graph Completion
    Yu, Donghan
    Yang, Yiming
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2334 - 2338
  • [10] A cloud-based framework for large-scale traditional Chinese medical record retrieval
    Liu, Lijun
    Liu, Li
    Fu, Xiaodong
    Huang, Qingsong
    Zhang, Xianwen
    Zhang, Yin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2018, 77 : 21 - 33