A Real-time GPU Implementation of the SIFT Algorithm for Large-Scale Video Analysis Tasks

被引:13
作者
Fassold, Hannes [1 ]
Rosner, Jakub [2 ]
机构
[1] JOANNEUM RES, DIGITAL Inst Informat & Commun Technol, A-8010 Graz, Austria
[2] Silesian Tech Univ, PhD Fac Data Min, PL-44100 Gliwice, Poland
来源
REAL-TIME IMAGE AND VIDEO PROCESSING 2015 | 2015年 / 9400卷
关键词
SIFT descriptor; real-time processing; GPU; CUDA;
D O I
10.1117/12.2083201
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The SIFT algorithm is one of the most popular feature extraction methods and therefore widely used in all sort of video analysis tasks like instance search and duplicate/ near-duplicate detection. We present an efficient GPU implementation of the SIFT descriptor extraction algorithm using CUDA. The major steps of the algorithm are presented and for each step we describe how to efficiently parallelize it massively, how to take advantage of the unique capabilities of the GPU like shared memory / texture memory and how to avoid or minimize common GPU performance pitfalls. We compare the GPU implementation with the reference CPU implementation in terms of runtime and quality and achieve a speedup factor of approximately 3 - 5 for SD and 5 - 6 for Full HD video with respect to a multi-threaded CPU implementation, allowing us to run the SIFT descriptor extraction algorithm in real-time on SD video. Furthermore, quality tests show that the GPU implementation gives the same quality as the reference CPU implementation from the HessSIFT library. We further describe the benefits of GPU-accelerated SIFT descriptor calculation for video analysis applications such as near-duplicate video detection.
引用
收藏
页数:8
相关论文
共 14 条
[1]  
Adams A., 2011, COMPUTING SINGULAR V
[2]  
[Anonymous], 2006, WORKSH EDG COMP US N
[3]  
[Anonymous], 2009, BMVC
[4]  
Bailer W., 2010, Proceedings 2010 Conference on Visual Media Production (CVMP 2010). 7th European Conference on Visual Media Production, P17, DOI 10.1109/CVMP.2010.10
[5]   SURF: Speeded up robust features [J].
Bay, Herbert ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION - ECCV 2006 , PT 1, PROCEEDINGS, 2006, 3951 :404-417
[6]   Scalable Learning for Object Detection with GPU Hardware [J].
Coates, Adam ;
Baumstarck, Paul ;
Le, Quoc ;
Ng, Andrew Y. .
2009 IEEE-RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2009, :4287-+
[7]  
HESS R., 2010, P INT C MULTIMEDIA, P1493, DOI DOI 10.1145/1873951.1874256
[8]  
Heymann S., 2007, WSCG 07
[9]  
Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039
[10]  
Leutenegger S, 2011, IEEE I CONF COMP VIS, P2548, DOI 10.1109/ICCV.2011.6126542