Large-Scale Image Retrieval with Compressed Fisher Vectors

被引:319
作者
Perronnin, Florent
Liu, Yan
Sanchez, Jorge
Poirier, Herve
机构
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5540009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of large-scale image search has been traditionally addressed with the bag-of-visual-words (BOV). In this article, we propose to use as an alternative the Fisher kernel framework. We first show why the Fisher representation is well-suited to the retrieval problem: it describes an image by what makes it different from other images. One drawback of the Fisher vector is that it is high-dimensional and, as opposed to the BOV, it is dense. The resulting memory and computational costs do not make Fisher vectors directly amenable to large-scale retrieval. Therefore, we compress Fisher vectors to reduce their memory footprint and speed-up the retrieval. We compare three binarization approaches: a simple approach devised for this representation and two standard compression techniques. We show on two publicly available datasets that compressed Fisher vectors perform very well using as little as a few hundreds of bits per image, and significantly better than a very recent compressed BOV approach.
引用
收藏
页码:3384 / 3391
页数:8
相关论文
共 27 条
  • [1] [Anonymous], 2008, NIPS
  • [2] [Anonymous], 2002, STOC
  • [3] [Anonymous], 2007, ICCV
  • [4] [Anonymous], IJCV
  • [5] [Anonymous], 2007, CVPR
  • [6] [Anonymous], 2009, ADV NEURAL INFORM PR
  • [7] [Anonymous], 2009, ICCV
  • [8] [Anonymous], BMVC
  • [9] [Anonymous], 2007, CVPR
  • [10] [Anonymous], 2007, CVPR