Hash Bit Selection for Nearest Neighbor Search

被引:36
作者
Liu, Xianglong [1 ]
He, Junfeng [2 ]
Chang, Shih-Fu [3 ]
机构
[1] Beihang Univ, State Key Lab Software Dev Environm, Beijing 10091, Peoples R China
[2] Google, Mountain View, CA 94043 USA
[3] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
基金
中国国家自然科学基金;
关键词
Nearest neighbor search; hash bit selection; bit reliability; bit complementarity; normalized dominant set; locality-sensitive hashing; QUANTIZATION;
D O I
10.1109/TIP.2017.2695895
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To overcome the barrier of storage and computation when dealing with gigantic-scale data sets, compact hashing has been studied extensively to approximate the nearest neighbor search. Despite the recent advances, critical design issues remain open in how to select the right features, hashing algorithms, and/or parameter settings. In this paper, we address these by posing an optimal hash bit selection problem, in which an optimal subset of hash bits are selected from a pool of candidate bits generated by different features, algorithms, or parameters. Inspired by the optimization criteria used in existing hashing algorithms, we adopt the bit reliability and their complementarity as the selection criteria that can be carefully tailored for hashing performance in different tasks. Then, the bit selection solution is discovered by finding the best tradeoff between search accuracy and time using a modified dynamic programming method. To further reduce the computational complexity, we employ the pairwise relationship among hash bits to approximate the highorder independence property, and formulate it as an efficient quadratic programming method that is theoretically equivalent to the normalized dominant set problem in a vertex-and edge-weighted graph. Extensive large-scale experiments have been conducted under several important application scenarios of hash techniques, where our bit selection framework can achieve superior performance over both the naive selection methods and the state-of-the-art hashing algorithms, with significant accuracy gains ranging from 10% to 50%, relatively.
引用
收藏
页码:5367 / 5380
页数:14
相关论文
共 50 条
  • [21] Fast Nearest Neighbor Search with Keywords
    Tao, Yufei
    Sheng, Cheng
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (04) : 878 - 888
  • [22] A scalable solution to the nearest neighbor search problem through local-search methods on neighbor graphs
    Tellez, Eric S.
    Ruiz, Guillermo
    Chavez, Edgar
    Graff, Mario
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (02) : 763 - 777
  • [23] Extremely Low Bit-Rate Nearest Neighbor Search Using a Set Compression Tree
    Arandjelovic, Relja
    Zisserman, Andrew
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (12) : 2396 - 2406
  • [24] Hash Bit Selection Based on Collaborative Neurodynamic Optimization
    Li, Xinqi
    Wang, Jun
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (10) : 11144 - 11155
  • [25] Efficient Nearest Neighbor Search by Removing Anti-hub
    Tanaka, Kimihiro
    Matsui, Yusuke
    Satoh, Shin'ichi
    [J]. PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 285 - 293
  • [26] Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search
    Liu, Xianglong
    Li, Zhujin
    Deng, Cheng
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) : 5324 - 5336
  • [27] In-Memory Nearest Neighbor Search with FeFET Multi-Bit Content-Addressable Memories
    Kazemi, Arman
    Sharifi, Mohammad Mehdi
    Laguna, Ann Franchesca
    Mueller, Franz
    Rajaei, Ramin
    Olivo, Ricardo
    Kaempfe, Thomas
    Niemier, Michael
    Hu, X. Sharon
    [J]. PROCEEDINGS OF THE 2021 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2021), 2021, : 1084 - 1089
  • [28] Locally Optimized Hashing for Nearest Neighbor Search
    Tokui, Seiya
    Sato, Issei
    Nakagawa, Hiroshi
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II, 2015, 9078 : 498 - 509
  • [29] Singleton indexes for nearest neighbor. search
    Tellez, E. S.
    Ruiz, G.
    Chavez, E.
    [J]. INFORMATION SYSTEMS, 2016, 60 : 50 - 68
  • [30] Secure Approximate Nearest Neighbor Search with Locality-Sensitive Hashing
    Song, Shang
    Liu, Lin
    Chen, Rongmao
    Peng, Wei
    Wang, Yi
    [J]. COMPUTER SECURITY - ESORICS 2023, PT III, 2024, 14346 : 411 - 430