Triangulation embedding and democratic aggregation for image search

被引:142
作者
Jegou, Herve [1 ]
Zisserman, Andrew [2 ]
机构
[1] Inria, Rennes, France
[2] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
来源
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2014年
关键词
D O I
10.1109/CVPR.2014.417
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the design of a single vector representation for an image that embeds and aggregates a set of local patch descriptors such as SIFT. More specifically we aim to construct a dense representation, like the Fisher Vector or VLAD, though of small or intermediate size. We make two contributions, both aimed at regularizing the individual contributions of the local descriptors in the final representation. The first is a novel embedding method that avoids the dependency on absolute distances by encoding directions. The second contribution is a "democratization" strategy that further limits the interaction of unrelated descriptors in the aggregation stage. These methods are complementary and give a substantial performance boost over the state of the art in image search with short or mid-size vectors, as demonstrated by our experiments on standard public image retrieval benchmarks.
引用
收藏
页码:3310 / 3317
页数:8
相关论文
共 34 条
  • [1] [Anonymous], 2012, ECCV
  • [2] [Anonymous], 2013, ICCV
  • [3] [Anonymous], 2010, CVPR
  • [4] [Anonymous], 2012, PAMI
  • [5] [Anonymous], 2007, CVPR
  • [6] [Anonymous], 2011, ICML
  • [7] [Anonymous], 2010, CVPR
  • [8] [Anonymous], 2010, PAMI
  • [9] [Anonymous], 2005, ICCV
  • [10] [Anonymous], 2007, CVPR