Codebook-softened product quantization for high accuracy approximate nearest neighbor search

被引:4
作者
Fan, Jingya [1 ]
Pan, Zhibin [1 ,3 ,4 ]
Wang, Liangzhuang [1 ]
Wang, Yang [2 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, Xian 710049, Shaanxi, Peoples R China
[2] Xian Univ Technol, Dept Informat Sci, Xian 710048, Shaanxi, Peoples R China
[3] Zhengzhou Xinda Inst Adv Technol, Zhengzhou 450002, Peoples R China
[4] Xi An Jiao Tong Univ, Res Inst, Hangzhou 311215, Zhejiang, Peoples R China
基金
中国国家自然科学基金;
关键词
Approximate nearest neighbor search; Product quantization; Vector quantization; High-dimensional space; Soften codebooks; BINARY QUANTIZATION; NORMALITY; CODES;
D O I
10.1016/j.neucom.2022.08.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Product quantization (PQ) is a fundamental technique for approximate nearest neighbor (ANN) search in many applications such as information retrieval, computer vision and pattern recognition. In the existing PQ-based methods for approximate nearest neighbor search, the reachable best search accuracy is achieved by using fixed codebooks. The search performance is limited by the quality of the hard codebooks. Unlike the existing methods, in this paper, we present a novel codebook-softened product quantization (CSPQ) method to achieve more quantization levels by softening codebooks. We firstly analyze how well the database vectors match the trained codebooks by examining quantization error for each database vector, and select the bad-matching database vectors. Then, we give the trained codebooks bbit freedom to soften codebooks. Finally, by minimizing quantization errors, the bad-matching vectors are encoded by softened codebooks and the labels of best-matching codebooks are recorded. Experimental results on SIFT1M, GIST1M and SIFT10M show that, despite its simplicity, our proposed method achieves higher accuracy compared with PQ and it can be combined with other nonexhaustive frameworks to achieve fast search. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:107 / 116
页数:10
相关论文
共 50 条
[31]   Codewords-Expanded Enhanced Residual Vector Quantization for Approximate Nearest Neighbor Search [J].
Ai L. ;
Cheng H. ;
Tao Y. ;
Yu J. ;
Zheng X. ;
Liu D. .
Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (03) :459-469
[32]   A New Cell-Level Search Based Non-Exhaustive Approximate Nearest Neighbor (ANN) Search Algorithm in the Framework of Product Quantization [J].
Wang, Yang ;
Pan, Zhibin ;
Li, Rui .
IEEE ACCESS, 2019, 7 :37059-37070
[33]   HDIdx: High-dimensional indexing for efficient approximate nearest neighbor search [J].
Wan, Ji ;
Tang, Sheng ;
Zhang, Yongdong ;
Li, Jintao ;
Wu, Pengcheng ;
Hoi, Steven C. H. .
NEUROCOMPUTING, 2017, 237 :401-404
[34]   ANNA: Specialized Architecture for Approximate Nearest Neighbor Search [J].
Lee, Yejin ;
Choi, Hyunji ;
Min, Sunhong ;
Lee, Hyunseung ;
Beak, Sangwon ;
Jeong, Dawoon ;
Lee, Jae W. ;
Ham, Tae Jun .
2022 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2022), 2022, :169-183
[35]   Asymmetric Mapping Quantization for Nearest Neighbor Search [J].
Hong, Weixiang ;
Tang, Xueyan ;
Meng, Jingjing ;
Yuan, Junsong .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (07) :1783-1790
[36]   Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search [J].
Liu, Xianglong ;
Li, Zhujin ;
Deng, Cheng ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) :5324-5336
[37]   SONG: Approximate Nearest Neighbor Search on GPU [J].
Zhao, Weijie ;
Tan, Shulong ;
Li, Ping .
2020 IEEE 36TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2020), 2020, :1033-1044
[38]   Principal Component Hashing: An Accelerated Approximate Nearest Neighbor Search [J].
Matsushita, Yusuke ;
Wada, Toshikazu .
ADVANCES IN IMAGE AND VIDEO TECHNOLOGY, PROCEEDINGS, 2009, 5414 :374-385
[39]   A fast binary encoding mechanism for approximate nearest neighbor search [J].
Zhao, Hongwei ;
Wang, Zhen ;
Liu, Pingping ;
Wu, Bin .
NEUROCOMPUTING, 2016, 178 :112-122
[40]   Adaptive bit allocation hashing for approximate nearest neighbor search [J].
Guo, Qin-Zhen ;
Zeng, Zhi ;
Zhang, Shuwu .
NEUROCOMPUTING, 2015, 151 :719-728