Large-Scale Image Retrieval with Compressed Fisher Vectors

被引:341
作者
Perronnin, Florent
Liu, Yan
Sanchez, Jorge
Poirier, Herve
机构
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5540009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of large-scale image search has been traditionally addressed with the bag-of-visual-words (BOV). In this article, we propose to use as an alternative the Fisher kernel framework. We first show why the Fisher representation is well-suited to the retrieval problem: it describes an image by what makes it different from other images. One drawback of the Fisher vector is that it is high-dimensional and, as opposed to the BOV, it is dense. The resulting memory and computational costs do not make Fisher vectors directly amenable to large-scale retrieval. Therefore, we compress Fisher vectors to reduce their memory footprint and speed-up the retrieval. We compare three binarization approaches: a simple approach devised for this representation and two standard compression techniques. We show on two publicly available datasets that compressed Fisher vectors perform very well using as little as a few hundreds of bits per image, and significantly better than a very recent compressed BOV approach.
引用
收藏
页码:3384 / 3391
页数:8
相关论文
共 27 条
[1]  
[Anonymous], 2008, NIPS
[2]  
[Anonymous], 2002, STOC
[3]  
[Anonymous], 2007, ICCV
[4]  
[Anonymous], IJCV
[5]  
[Anonymous], 2007, CVPR
[6]  
[Anonymous], 2009, ADV NEURAL INFORM PR
[7]  
[Anonymous], 2009, ICCV
[8]  
[Anonymous], BMVC
[9]  
[Anonymous], 2007, CVPR
[10]  
[Anonymous], 2007, CVPR