Combining CPU and GPU architectures for fast similarity search

被引:23
作者
Krulis, Martin [1 ]
Skopal, Tomas [1 ]
Lokoc, Jakub [1 ]
Beecks, Christian [2 ]
机构
[1] Charles Univ Prague, Fac Math & Phys, SIRET Res Grp, Prague, Czech Republic
[2] Rhein Westfal TH Aachen, Data Management & Data Explorat Grp, Aachen, Germany
关键词
Similarity search; Database indexing; Parallel computing; GPU; Pivot table; Metric; Ptolemaic; Multimedia databases; DISTANCE;
D O I
10.1007/s10619-012-7092-4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Signature Quadratic Form Distance on feature signatures represents a flexible distance-based similarity model for effective content-based multimedia retrieval. Although metric indexing approaches are able to speed up query processing by two orders of magnitude, their applicability to large-scale multimedia databases containing billions of images is still a challenging issue. In this paper, we propose a parallel approach that balances the utilization of CPU and many-core GPUs for efficient similarity search with the Signature Quadratic Form Distance. In particular, we show how to process multiple distance computations and other parts of the search procedure in parallel, achieving maximal performance of the combined CPU/GPU system. The experimental evaluation demonstrates that our approach implemented on a common workstation with 2 GPU cards outperforms traditional parallel implementation on a high-end 48-core NUMA server in terms of efficiency almost by an order of magnitude. If we consider also the price of the high-end server that is ten times higher than that of the GPU workstation then, based on price/performance ratio, the GPU-based similarity search beats the CPU-based solution by almost two orders of magnitude. Although proposed for the SQFD, our approach of fast GPU-based similarity search is applicable for any distance function that is efficiently parallelizable in the SIMT execution model.
引用
收藏
页码:179 / 207
页数:29
相关论文
共 31 条
  • [1] [Anonymous], 2004, P 2004 EUR ACM SIGGR, DOI DOI 10.1145/1057432.1057436
  • [2] Beecks C., 2011, P ACM INT C MULT RET, P24
  • [3] Beecks C, 2012, LECT NOTES COMPUT SC, V7131, P346
  • [4] A COMPARATIVE STUDY OF SIMILARITY MEASURES FOR CONTENT-BASED MULTIMEDIA RETRIEVAL
    Beecks, Christian
    Uysal, Merih Seran
    Seidl, Thomas
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1552 - 1557
  • [5] Beecks Christian, 2009, P 17 ACM INT C MULT, P697, DOI DOI 10.1145/1631272.1631391
  • [6] Beecks Christian., 2010, ACM International Conference on Image and Video Retrieval, P438, DOI [10.1145/1816041.1816105, DOI 10.1145/1816041.1816105]
  • [7] Bolettieri P, 2009, COPHIR TEST COLLECTI
  • [8] Bustos B, 2006, LECT NOTES COMPUT SC, V3994, P196
  • [9] Searching in metric spaces
    Chávez, E
    Navarro, G
    BaezaYates, R
    Marroquín, JL
    [J]. ACM COMPUTING SURVEYS, 2001, 33 (03) : 273 - 321
  • [10] Features for image retrieval: an experimental comparison
    Deselaers, Thomas
    Keysers, Daniel
    Ney, Hermann
    [J]. INFORMATION RETRIEVAL, 2008, 11 (02): : 77 - 107