Fast spectral analysis for approximate nearest neighbor search

被引:1
|
作者
Wang, Jing [1 ]
Shen, Jie [2 ]
机构
[1] Amazon, New York, NY 10001 USA
[2] Stevens Inst Technol, Hoboken, NJ 07030 USA
关键词
Approximate nearest neighbor search; Spectral analysis; Hashing; Noise; Subspace; QUANTIZATION; ALGORITHMS;
D O I
10.1007/s10994-021-06124-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale machine learning, of central interest is the problem of approximate nearest neighbor (ANN) search, where the goal is to query particular points that are close to a given object under certain metric. In this paper, we develop a novel data-driven ANN search algorithm where the data structure is learned by fast spectral technique based on s landmarks selected by approximate ridge leverage scores. We show that with overwhelming probability, our algorithm returns the (1 + epsilon/4)-ANN for any approximation parameter epsilon is an element of (0, 1). A remarkable feature of our algorithm is that it is computationally efficient. Specifically, learning k-length hash codes requires O((s(3) + ns(2)) log n) running time and O(d(2)) extra space, and returning the (1 + epsilon/4)-ANN of the query needs O(k log n) running time. The experimental results on computer vision and natural language understanding tasks demonstrate the significant advantage of our algorithm compared to state-of-the-art methods.
引用
收藏
页码:2297 / 2322
页数:26
相关论文
共 50 条
  • [21] ANNA: Specialized Architecture for Approximate Nearest Neighbor Search
    Lee, Yejin
    Choi, Hyunji
    Min, Sunhong
    Lee, Hyunseung
    Beak, Sangwon
    Jeong, Dawoon
    Lee, Jae W.
    Ham, Tae Jun
    2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, : 169 - 183
  • [22] Self-Organizing Binary Encoding for Approximate Nearest Neighbor Search
    Ozan, Ezgi Can
    Kiranyaz, Serkan
    Gabbouj, Moncef
    Hu, Xiaohua
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1103 - 1107
  • [23] Binary Hashing for Approximate Nearest Neighbor Search on Big Data: A Survey
    Cao, Yuan
    Qi, Heng
    Zhou, Wenrui
    Kato, Jien
    Li, Keqiu
    Liu, Xiulong
    Gui, Jie
    IEEE ACCESS, 2018, 6 : 2039 - 2054
  • [24] Optimized K-means Hashing for Approximate Nearest Neighbor Search
    Guo, Qin-Zhen
    Zeng, Zhi
    Zhang, Shuwu
    Zhang, Yuan
    Zhang, Guixuan
    MATERIAL SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY II, 2014, 651-653 : 2168 - 2171
  • [25] Cluster Guided Truncated Hashing for Enhanced Approximate Nearest Neighbor Search
    Liu, Mingyang
    Yang, Zuyuan
    Han, Wei
    Xie, Shengli
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 181 - 185
  • [26] Tree-based compact hashing for approximate nearest neighbor search
    Hou, Guangdong
    Cui, Runpeng
    Pan, Zheng
    Zhang, Changshui
    NEUROCOMPUTING, 2015, 166 : 271 - 281
  • [27] A novel cell partition method by introducing Silhouette Coefficient for fast approximate nearest neighbor search
    Song, Wenwen
    Wang, Yang
    Pan, Zhibin
    INFORMATION SCIENCES, 2023, 642
  • [28] WARank: Weighted Asymmetric Ranking for Approximate Nearest Neighbor Search
    Cao, Yuan
    Qi, Heng
    Li, Keqiu
    Jin, Yingwei
    Li, Zhiyang
    CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 297 - 304
  • [29] Residual Vector Product Quantization for approximate nearest neighbor search
    Niu, Lushuai
    Xu, Zhi
    Zhao, Longyang
    He, Daojing
    Ji, Jianqiu
    Yuan, Xiaoli
    Xue, Mian
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 232
  • [30] Product quantization with dual codebooks for approximate nearest neighbor search
    Pan, Zhibin
    Wang, Liangzhuang
    Wang, Yang
    Liu, Yuchen
    NEUROCOMPUTING, 2020, 401 : 59 - 68