Aggregating binary local descriptors for image retrieval

被引:5
作者
Amato, Giuseppe [1 ]
Falchi, Fabrizio [1 ]
Vadicamo, Lucia [1 ]
机构
[1] CNR, Inst Informat Sci & Technol ISTI, Via Moruzzi 1, I-56124 Pisa, Italy
关键词
Binary local feature; Fisher vector; VLAD; Bag of words; Convolutional neural network; Content-based image retrieval; FISHER VECTOR; QUANTIZATION; FEATURES;
D O I
10.1007/s11042-017-4450-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Content-Based Image Retrieval based on local features is computationally expensive because of the complexity of both extraction and matching of local feature. On one hand, the cost for extracting, representing, and comparing local visual descriptors has been dramatically reduced by recently proposed binary local features. On the other hand, aggregation techniques provide a meaningful summarization of all the extracted feature of an image into a single descriptor, allowing us to speed up and scale up the image search. Only a few works have recently mixed together these two research directions, defining aggregation methods for binary local features, in order to leverage on the advantage of both approaches.In this paper, we report an extensive comparison among state-of-the-art aggregation methods applied to binary features. Then, we mathematically formalize the application of Fisher Kernels to Bernoulli Mixture Models. Finally, we investigate the combination of the aggregated binary features with the emerging Convolutional Neural Network (CNN) features. Our results show that aggregation methods on binary features are effective and represent a worthwhile alternative to the direct matching. Moreover, the combination of the CNN with the Fisher Vector (FV) built upon binary features allowed us to obtain a relative improvement over the CNN results that is in line with that recently obtained using the combination of the CNN with the FV built upon SIFTs. The advantage of using the FV built upon binary features is that the extraction process of binary features is about two order of magnitude faster than SIFTs.
引用
收藏
页码:5385 / 5415
页数:31
相关论文
共 80 条
  • [1] Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces
    Alcantarilla, Pablo F.
    Nuevo, Jesus
    Bartoli, Adrien
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [2] Visual Recognition of Ancient Inscriptions Using Convolutional Neural Network and Fisher Vector
    Amato, Giuseppe
    Falchi, Fabrizio
    Vadicamo, Lucia
    [J]. ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2016, 9 (04):
  • [3] Deep Permutations: Deep Convolutional Neural Networks and Permutation-Based Indexing
    Amato, Giuseppe
    Falchi, Fabrizio
    Gennaro, Claudio
    Vadicamo, Lucia
    [J]. SIMILARITY SEARCH AND APPLICATIONS, SISAP 2016, 2016, 9939 : 93 - 106
  • [4] [Anonymous], 2006, 2006 IEEE COMP SOC C
  • [5] [Anonymous], PAMI
  • [6] [Anonymous], 2013, NIPS
  • [7] [Anonymous], ARXIVABS13101531
  • [8] [Anonymous], PROCEEDINGS OF THE 1
  • [9] [Anonymous], IEEE C COMP VIS PATT
  • [10] [Anonymous], RR8325