Sparse coding based VLAD for efficient image retrieval

被引:0
作者
Reddy, Mopuri K. [1 ]
Talur, Jayasimha [1 ]
Babu, R. Venkatesh [1 ]
机构
[1] Indian Inst Sci, SERC, Video Analyt Lab, Bangalore, Karnataka, India
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTING AND COMMUNICATION TECHNOLOGIES (IEEE CONECCT) | 2014年
关键词
Vector of Locally Aggregated Descriptors (VLAD); Image Retrieval; SIFT; Sparse Coding; mean average precision (mAP);
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Representing images and videos in the form of compact codes has emerged as an important research interest in the vision community, in the context of web scale image/video search. Recently proposed Vector of Locally Aggregated Descriptors (VLAD), has been shown to outperform the existing retrieval techniques, while giving a desired compact representation. VLAD aggregates the local features of an image in the feature space. In this paper, we propose to represent the local features extracted from an image, as sparse codes over an over-complete dictionary, which is obtained by K-SVD based dictionary training algorithm. The proposed VLAD aggregates the residuals in the space of these sparse codes, to obtain a compact representation for the image. Experiments are performed over the 'Holidays' database using SIFT features. The performance of the proposed method is compared with the original VLAD. The 4% increment in the mean average precision (mAP) indicates the better retrieval performance of the proposed sparse coding based VLAD.
引用
收藏
页数:4
相关论文
共 9 条
  • [1] [Anonymous], 2007, P IEEE CVPR
  • [2] Jegou H, 2008, LECT NOTES COMPUT SC, V5302, P304, DOI 10.1007/978-3-540-88682-2_24
  • [3] Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039
  • [4] Packing bag-of-features
    Jegou, Herve
    Douze, Matthijs
    Schmid, Cordelia
    [J]. 2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, : 2357 - 2364
  • [5] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [6] Mairal J., 2009, P 26 ANN INT C MACH, P689
  • [7] Large-Scale Image Retrieval with Compressed Fisher Vectors
    Perronnin, Florent
    Liu, Yan
    Sanchez, Jorge
    Poirier, Herve
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3384 - 3391
  • [8] Video Google: A text retrieval approach to object matching in videos
    Sivic, J
    Zisserman, A
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1470 - +
  • [9] Turpin A., 2006, Proceedings of the Twenty-Ninth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P11, DOI 10.1145/1148170.1148176