Approximate nearest neighbor search by cyclic hierarchical product quantization

被引:0
作者
Xu, Zhi [1 ]
Zhou, Mengdong [1 ]
Liu, Yuxuan [2 ]
Zhao, Longyang [1 ]
Liu, Jiajia [3 ]
机构
[1] Guilin Univ Elect Technol, Sch Comp Sci & Informat Secur, Guilin, Peoples R China
[2] Guilin Univ Elect Technol, Sch Mech & Elect Engn, Guilin, Peoples R China
[3] Civil Aviat Flight Univ China, Sch Inst Elect & Elect Engn, Guanghan, Peoples R China
关键词
Vector quantization; Approximate nearest neighbor search; Hierarchical quantization; Residual quantization;
D O I
10.1007/s11760-025-04030-w
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Vector quantization (VQ) is a widely used Approximate Nearest Neighbor (ANN) search method. By constructing multiple codebooks, VQ can create more codeword vectors with lower memory consumption, enabling the indexing of large-scale database. In recent years, many VQ-based methods have been proposed, but the codeword vectors constructed in these methods are often underutilized due to insufficient data support, and the unimodal data distribution within the partition is not considered. To address these issues, we propose a new quantization method, Cyclic Hierarchical Product Quantization (CHPQ). This method first constructs a hierarchical quantization structure in each subspace, with each hierarchical structure composed of several sub-quantizers. Then, the codebook is locally optimized under the sub-quantizers according to the data distribution of each Voronoi cell, significantly improving quantization performance compared to other methods and greatly enhancing the accuracy of ANN search. Additionally, this paper proposes a new hierarchical quantization structure, termed cyclic hierarchical structure, which can generate more diverse codeword vectors in different space partitions compared to the traditional hierarchical quantization structure. Experiment results demonstrate that CHPQ outperforms existing methods in terms of retrieval accuracy while maintaining comparable computational efficiency.
引用
收藏
页数:11
相关论文
共 26 条
[1]   Optimized residual vector quantization for efficient approximate nearest neighbor search [J].
Ai, Liefu ;
Yu, Junqing ;
Wu, Zebin ;
He, Yunfeng ;
Guan, Tao .
MULTIMEDIA SYSTEMS, 2017, 23 (02) :169-181
[2]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (06) :1247-1260
[3]   Additive Quantization for Extreme Vector Compression [J].
Babenko, Artem ;
Lempitsky, Victor .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :931-938
[4]   Searching in high-dimensional spaces -: Index structures for improving the performance of multimedia Databases [J].
Böhm, C ;
Berchtold, S ;
Keim, D .
ACM COMPUTING SURVEYS, 2001, 33 (03) :322-373
[5]   Tensor Quantization: High-Dimensional Data Compression [J].
Chang, Shih Yu ;
Wu, Hsiao-Chun .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (08) :5566-5580
[6]   Approximate Nearest Neighbor Search by Residual Vector Quantization [J].
Chen, Yongjian ;
Guan, Tao ;
Wang, Cheng .
SENSORS, 2010, 10 (12) :11259-11273
[7]  
Furcy D, 2005, 19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), P125
[8]   Optimized Product Quantization for Approximate Nearest Neighbor Search [J].
Ge, Tiezheng ;
He, Kaiming ;
Ke, Qifa ;
Sun, Jian .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2946-2953
[9]   Quantization [J].
Gray, RM ;
Neuhoff, DL .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1998, 44 (06) :2325-2383
[10]   Embedding-based Retrieval in Facebook Search [J].
Huang, Jui-Ting ;
Sharma, Ashish ;
Sun, Shuying ;
Xia, Li ;
Zhang, David ;
Pronin, Philip ;
Padmanabhan, Janani ;
Ottaviano, Giuseppe ;
Yang, Linjun .
KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, :2553-2561