STACKED PRODUCT QUANTIZATION FOR NEAREST NEIGHBOR SEARCH ON LARGE DATASETS

被引:0
作者
Wang, Jun [1 ]
Li, Zhiyang [1 ]
Du, Yegang [2 ]
Qu, Wenyu [3 ]
机构
[1] Dalian Maritime Univ, Dalian, Peoples R China
[2] Japan Adv Inst Sci & Technol, Nomi, Japan
[3] Tianjin Univ, Tianjin, Peoples R China
来源
2016 IEEE TRUSTCOM/BIGDATASE/ISPA | 2016年
关键词
Nearest neighbor search; big data; high dimensional vector; vector quantization;
D O I
10.1109/TrustCom.2016.248
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High dimensional vector quantization plays an important role in KNN search on large datasets. In recent years, there have a large literature on vector quantization such as product quantization( PQ), optimized product quantization(OPQ), additive quantization (AQ), stacked quantization(SQ). However, these vector quantization faced with large quantization error or low efficiency codebook learning and encoding. In this paper, we propose a new vector quantization method called SPQ which combines the strength of PQ and SQ. On one hand, compared with PQ, we can get a more precise subcodebook in each subspace. On the other hand, we can generate codebook within consuming less time and memory than SQ. Extensive experiments on benchmark datasets demonstrate that SPQ can generate codebook and encoding faster than SQ while maintain the same quantization error. Furthermore we show that SPQ have good scalability, which compare favorably with the sate-of-the-art.
引用
收藏
页码:1621 / 1627
页数:7
相关论文
共 13 条
[1]  
[Anonymous], 2014, ARXIV14112173
[2]  
[Anonymous], 2014, COMPUTER SCI
[3]  
Aulin Tor, 2014, ENCY STAT SCI, V36, P209
[4]   Additive Quantization for Extreme Vector Compression [J].
Babenko, Artem ;
Lempitsky, Victor .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :931-938
[5]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[6]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7]   Optimized Product Quantization for Approximate Nearest Neighbor Search [J].
Ge, Tiezheng ;
He, Kaiming ;
Ke, Qifa ;
Sun, Jian .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :2946-2953
[8]   Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval [J].
Gong, Yunchao ;
Lazebnik, Svetlana ;
Gordo, Albert ;
Perronnin, Florent .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (12) :2916-2929
[9]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[10]   Product Quantization for Nearest Neighbor Search [J].
Jegou, Herve ;
Douze, Matthijs ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (01) :117-128