The Inverted Multi-Index

被引:133
作者
Babenko, Artem [1 ,2 ]
Lempitsky, Victor [3 ]
机构
[1] Yandex, Moscow, Russia
[2] Natl Res Univ, Higher Sch Econ, London, England
[3] Skolkovo Inst Sci & Technol, Moscow, Russia
关键词
Image retrieval; Index; nearest neighbor search; product quantization; OBJECT; SCENE;
D O I
10.1109/TPAMI.2014.2361319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new data structure for efficient similarity search in very large datasets of high-dimensional vectors is introduced. This structure called the inverted multi-index generalizes the inverted index idea by replacing the standard quantization within inverted indices with product quantization. For very similar retrieval complexity and pre-processing time, inverted multi-indices achieve a much denser subdivision of the search space compared to inverted indices, while retaining their memory efficiency. Our experiments with large datasets of SIFT and GIST vectors demonstrate that because of the denser subdivision, inverted multi-indices are able to return much shorter candidate lists with higher recall. Augmented with a suitable reranking procedure, multi-indices were able to significantly improve the speed of approximate nearest neighbor search on the dataset of 1 billion SIFT vectors compared to the best previously published systems, while achieving better recall and incurring only few percent of memory overhead.
引用
收藏
页码:1247 / 1260
页数:14
相关论文
共 39 条
[1]  
[Anonymous], 2006, 2006 IEEE COMP SOC C
[2]  
[Anonymous], 2008, Introduction to information retrieval
[3]  
[Anonymous], ACM MULTIMEDIA, DOI DOI 10.1145/1027527.1027729
[4]   The Inverted Multi-Index [J].
Babenko, Artem ;
Lempitsky, Victor .
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, :3069-3076
[5]  
Babenko Artem., 2014, CoRR
[6]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[7]   In defense of Nearest-Neighbor based image classification [J].
Boiman, Oren ;
Shechtman, Eli ;
Irani, Michal .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1992-+
[8]  
Charikar M., 2002, P THIR 4 ANN ACM S T, P380
[9]   Total recall: Automatic query expansion with a generative feature model for object retrieval [J].
Chum, Ondrej ;
Philbin, James ;
Sivic, Josef ;
Isard, Michael ;
Zisserman, Andrew .
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+
[10]  
Chum Ondrej., 2007, CIVR 07, P549, DOI DOI 10.1145/1282280.1282359