Practical scalable image analysis and indexing using Hadoop

被引:7
作者
Hare, Jonathon S. [1 ]
Samangooei, Sina [1 ]
Lewis, Paul H. [1 ]
机构
[1] Univ Southampton, Sch Elect & Comp Sci, Southampton SO17 1BJ, Hants, England
关键词
MapReduce; Hadoop; Bag of visual words; Image retrieval; SCALE;
D O I
10.1007/s11042-012-1256-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability to handle very large amounts of image data is important for image analysis, indexing and retrieval applications. Sadly, in the literature, scalability aspects are often ignored or glanced over, especially with respect to the intricacies of actual implementation details. In this paper we present a case-study showing how a standard bag-of-visual-words image indexing pipeline can be scaled across a distributed cluster of machines. In order to achieve scalability, we investigate the optimal combination of hybridisations of the MapReduce distributed computational framework which allows the components of the analysis and indexing pipeline to be effectively mapped and run on modern server hardware. We then demonstrate the scalability of the approach practically with a set of image analysis and indexing tools built on top of the Apache Hadoop MapReduce framework. The tools used for our experiments are freely available as open-source software, and the paper fully describes the nuances of their implementation.
引用
收藏
页码:1215 / 1248
页数:34
相关论文
共 49 条
  • [41] Scaling analysis of a neocortex inspired cognitive model on the Cray XD1
    Rice, Kenneth L.
    Taha, Tarek M.
    Vutsinas, Christopher N.
    [J]. JOURNAL OF SUPERCOMPUTING, 2009, 47 (01) : 21 - 43
  • [42] Shan Y, 2010, FPGA 10, P93
  • [43] Video Google: A text retrieval approach to object matching in videos
    Sivic, J
    Zisserman, A
    [J]. NINTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS I AND II, PROCEEDINGS, 2003, : 1470 - +
  • [44] Content-based image retrieval at the end of the early years
    Smeulders, AWM
    Worring, M
    Santini, S
    Gupta, A
    Jain, R
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (12) : 1349 - 1380
  • [45] Subramanya A, 2009, NIPS WORKSH LEARN CO
  • [46] White B., 2010, MDMKDD, P9
  • [47] Xu NY, 2009, ACM T RECONFIG TECHN, V1, P19
  • [48] Ye J., 2009, P 18 ACM C INF KNOWL, P2061, DOI 10.1145/1645953.1646301
  • [49] Yibin Li, 2009, 2009 IEEE International Conference on Automation and Logistics (ICAL), P1957, DOI 10.1109/ICAL.2009.5262626