Parallelization and Optimization of SIFT on GPU Using CUDA

被引:13
作者
Zhou, Yonglong [1 ]
Mei, Kuizhi [1 ]
Ji, Xiang [1 ]
Dong, Peixiang [1 ]
机构
[1] Xi An Jiao Tong Univ, Xian 710049, Peoples R China
来源
2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC) | 2013年
关键词
D O I
10.1109/HPCC.and.EUC.2013.192
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Scale-invariant feature transform (SIFT) based feature extraction algorithm is widely applied to extract features from images, and it is very attractive to accelerate these SIFT based algorithms on GPU. In this paper, we present several parallel computing strategies, implement and optimize the SIFT algorithm using CUDA programming model on GPU. Each stage of SIFT is analyzed in detail to choose the parallel strategy. On the basis of the elementary CUDA-SIFT and CUDA architecture, we optimize the implementation from several aspects to speedup the CUDA-SIFT. Experimental results demonstrate that our implementation after optimization is 2.5 times faster than previous optimization, and our CUDA based SIFT can run at the speed of 20 frames per second on most images with 1280x960 resolution in the test. Using 1920x1440 image to test, we have obtained a speed of 11 frames per second on average, which is about 60 times faster than the CPU implementation of SIFT. In short, our implementation obtains appropriate accuracy and higher efficiency compared to CPU implementations and other GPU implementations, which is attributed to our dedicated optimization strategies.
引用
收藏
页码:1351 / 1358
页数:8
相关论文
共 13 条
  • [1] [Anonymous], 2010, P 18 ACM INT C MULT
  • [2] [Anonymous], P INT C CENTR EUR CO
  • [3] Feng H, 2008, I S WORKL CHAR PROC, P12
  • [4] Glaskowsky P.N., 2009, NVIDIA's Fermi : the first complete GPU computing architecture
  • [5] Harris M., 2007, SC07: High Performance Computing With CUDA
  • [6] Lowe D. G., 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision, P1150, DOI 10.1109/ICCV.1999.790410
  • [7] Distinctive image features from scale-invariant keypoints
    Lowe, DG
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
  • [8] Mikolajczyk K., 2007, COLLABORATIVE WORK V
  • [9] NVIDIA C., 2011, NVIDIA CUDA PROGR GU
  • [10] Feature tracking and matching in video using programmable graphics hardware
    Sinha, Sudipta N.
    Frahm, Jan-Michael
    Pollefeys, Marc
    Genc, Yakup
    [J]. MACHINE VISION AND APPLICATIONS, 2011, 22 (01) : 207 - 217