A Learning-to-Rank Formulation of Clustering-Based Approximate Nearest Neighbor Search

被引:1
|
作者
Vecchiato, Thomas [1 ]
Lucchese, Claudio [1 ]
Nardini, Franco Maria [2 ]
Bruch, Sebastian [3 ]
机构
[1] Ca Foscari Univ Venice, Venice, Italy
[2] ISTI CNR, Pisa, Italy
[3] Pinecone, New York, NY USA
来源
PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024 | 2024年
关键词
Approximate Nearest Neighbor Search; Inverted File; Learning to; Rank; EFFICIENT;
D O I
10.1145/3626772.3657931
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A critical piece of the modern information retrieval puzzle is approximate nearest neighbor search. Its objective is to return a set of k . data points that are closest to a query point, with its accuracy measured by the proportion of exact nearest neighbors captured in the returned set. One popular approach to this question is clustering: The indexing algorithm partitions data points into non-overlapping subsets and represents each partition by a point such as its centroid. The query processing algorithm first identifies the nearest clusters-a process known as routing-then performs a nearest neighbor search over those clusters only. In this work, we make a simple observation: The routing function solves a ranking problem. Its quality can therefore be assessed with a ranking metric, making the function amenable to learning-to-rank. Interestingly, ground-truth is often freely available: Given a query distribution in a top-k. configuration, the ground-truth is the set of clusters that contain the exact top-k. vectors. We develop this insight and apply it to Maximum Inner Product Search (MIPS). As we demonstrate empirically on various datasets, learning a simple linear function consistently improves the accuracy of clustering-based MIPS.
引用
收藏
页码:2261 / 2265
页数:5
相关论文
共 50 条
  • [31] Approximate Nearest Neighbor Search Using Enhanced Accumulative Quantization
    Ai, Liefu
    Cheng, Hongjun
    Wang, Xiaoxiao
    Chen, Chunsheng
    Liu, Deyang
    Zheng, Xin
    Wang, Yuanzhi
    ELECTRONICS, 2022, 11 (14)
  • [32] ADAPTIVE BIT ALLOCATION HASHING FOR APPROXIMATE NEAREST NEIGHBOR SEARCH
    Guo, Qin-Zhen
    Zeng, Zhi
    Zhang, Shuwu
    Zhang, Yuan
    Wang, Fangyuan
    2013 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2013), 2013,
  • [33] Principal Component Hashing: An Accelerated Approximate Nearest Neighbor Search
    Matsushita, Yusuke
    Wada, Toshikazu
    ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 : 374 - 385
  • [34] Tree and Graph Based Two-Stages Routing for Approximate Nearest Neighbor Search
    Li, Jiannan
    Zhang, Zhenyu
    Wang, Xiaoling
    Li, Haoyang
    WEB AND BIG DATA, APWEB-WAIM 2024, PT III, 2024, 14963 : 376 - 390
  • [35] Quantization-Based Approximate Nearest Neighbor Search with Optimized Multiple Residual Codebooks
    Uchida, Yusuke
    Takagi, Koichi
    Kawada, Ryoichi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (07): : 1510 - 1514
  • [36] Optimized residual vector quantization for efficient approximate nearest neighbor search
    Ai, Liefu
    Yu, Junqing
    Wu, Zebin
    He, Yunfeng
    Guan, Tao
    MULTIMEDIA SYSTEMS, 2017, 23 (02) : 169 - 181
  • [37] A new fast inverted file-based algorithm for approximate nearest neighbor search without accuracy reduction
    Liu, Yuchen
    Pan, Zhibin
    Wang, Liangzhuang
    Wang, Yang
    INFORMATION SCIENCES, 2022, 608 : 613 - 629
  • [38] Self-Organizing Binary Encoding for Approximate Nearest Neighbor Search
    Ozan, Ezgi Can
    Kiranyaz, Serkan
    Gabbouj, Moncef
    Hu, Xiaohua
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 1103 - 1107
  • [39] Optimized residual vector quantization for efficient approximate nearest neighbor search
    Liefu Ai
    Junqing Yu
    Zebin Wu
    Yunfeng He
    Tao Guan
    Multimedia Systems, 2017, 23 : 169 - 181
  • [40] Binary Hashing for Approximate Nearest Neighbor Search on Big Data: A Survey
    Cao, Yuan
    Qi, Heng
    Zhou, Wenrui
    Kato, Jien
    Li, Keqiu
    Liu, Xiulong
    Gui, Jie
    IEEE ACCESS, 2018, 6 : 2039 - 2054