LSH BANDING FOR LARGE-SCALE RETRIEVAL WITH MEMORY AND RECALL CONSTRAINTS

被引:2
作者
Covell, Michele [1 ]
Baluja, Shumeet [1 ]
机构
[1] Google Inc, Google Res, Mountain View, CA 94043 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
Multimedia databases; Information retrieval; Fingerprint identification; Pattern matching;
D O I
10.1109/ICASSP.2009.4959971
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Locality Sensitive Hashing (LSH) is widely used for efficient retrieval of candidate matches in very large audio, video, and image systems. However, extremely large reference databases necessitate a guaranteed limit on the memory used by the table lookup itself, no matter how the entries crowd different parts of the signature space, a guarantee that LSH does not give. In this paper, we provide such guaranteed limits, primarily through the design of the LSH bands. When combined with data-adaptive bin splitting (needed on only 0.04% of the occupied bins) this approach provides the required guarantee on memory usage. At the same time, it avoids the reduced recall that more extensive use of bin splitting would give.
引用
收藏
页码:1865 / 1868
页数:4
相关论文
共 50 条
  • [41] KNOW: Developing large-scale multilingual technologies for language understanding
    Agirre, Eneko
    Castellon, Irene
    Padro, Lluis
    Climent, Salvador
    Rigau, German
    Alonso, Laura
    Cuadros, Montse
    Coll-Florit, Marta
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2009, (43): : 377 - 378
  • [42] Construction of Deep Resolution and Retrieval Platform for Large Scale Scientific and Technical Literature
    Wu Suyan
    Li Wenbo
    Wu Jiangrui
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 375 - 379
  • [43] Mining Music from Large-Scale, Peer-to-Peer Networks
    Shavitt, Yuval
    Weinsberg, Ela
    Weinsberg, Udi
    IEEE MULTIMEDIA, 2011, 18 (01) : 14 - 22
  • [45] Speeding up and enhancing a large-scale fingerprint identification system on GPU*
    Hong Hai Le
    Ngoc Hoa Nguyen
    Tri-Thanh Nguyen
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2018, 2 (02) : 147 - 162
  • [46] Self-adaptive approximate queries for large-scale information aggregation
    Brunner, Rene
    Freitag, Felix
    Navarro, Leandro
    Rana, Omer F.
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2012, 8 (03) : 225 - 247
  • [47] Diagnosing Performance Issues for Large-Scale Microservice Systems With Heterogeneous Graph
    Tao, Lei
    Lu, Xianglin
    Zhang, Shenglin
    Luan, Jiaqi
    Li, Yingke
    Li, Mingjie
    Li, Zeyan
    Yu, Qingyang
    Xie, Hucheng
    Xu, Ruijie
    Hu, Chenyuan
    Yang, Canqun
    Pei, Dan
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (05) : 2223 - 2235
  • [48] In Vivo Evaluation of Large-scale IR-based Traceability Recovery
    Borg, Markus
    2011 15TH EUROPEAN CONFERENCE ON SOFTWARE MAINTENANCE AND REENGINEERING (CSMR), 2011, : 365 - 368
  • [49] Listen or interact? A Large-scale survey on music listening and management behaviours
    Kamalzadeh, Mohsen
    Baur, Dominikus
    Moeller, Torsten
    JOURNAL OF NEW MUSIC RESEARCH, 2016, 45 (01) : 42 - 67
  • [50] Large-Scale Similarity-Based Join Processing in Multimedia Databases
    Kosch, Harald
    Woelfl, Andreas
    ADVANCES IN MULTIMEDIA MODELING, 2012, 7131 : 418 - 428