GPU Acceleration of Object Detection on Video Stream using CUDA

被引:0
作者
Dawwd, Shefa A. [1 ]
Salim, Ula T. [1 ]
机构
[1] Univ Mosul, Dept Comp Engn, Coll Engn, Mosul, Iraq
来源
2013 FIRST INTERNATIONAL SCIENTIFIC CONFERENCE ON ELECTRICAL, COMMUNICATION, COMPUTER, POWER, AND CONTROL ENGINEERING (ICECCPCE'13) | 2013年
关键词
CUDA; GPU; Object detection; PSAD; SAD; sliding window; template matching; video stream;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Object detection is one of the important applications of the computer vision, image and video processing. However the best accuracy and fast invariant detecting function under changing object states such as (position, scale, illumination and noise) is a central aspect problem of the object detection in the video frames and images that cannot be realized by using sequential processing with a single core General Purpose Central Processing Unit (GPCPU). In this paper, to solve these problems and speed up the highly intensive calculation required,, a simple and an efficient template matching algorithm of object detection is proposed. It is based on using sliding window across the video frame with applying two similarity measures as a window function: Sum of Absolute Difference (SAD) and pyramid downscale multi-resolution Sum of Absolute Difference that is called (PSAD). The implementation operation is achieved by using Graphic Processing Unit (GPU) that is based on using parallel processing technique, Data Level Parallelism (DLP), and single instruction multiple data (SIMD) operations with Compute Unified Device Architecture (CUDA). For both SAD and PSAD a comparable speedup of 161x and 97x is achieved respectively for an image size of 768x567 using MATLAB environment.
引用
收藏
页码:198 / 203
页数:6
相关论文
共 12 条
[1]  
Alsaade Fawaz, 2012, RES J INFORM TECHNOL, V4, P204, DOI [10.3923/rjit.2012.204.211, DOI 10.3923/RJIT.2012.204.211]
[2]  
[Anonymous], 2007, NVIDIA CUDA PROGR GU
[3]  
CHITU M., 2010, OPEN SOURCE SCI J, V2
[4]  
Choudhary B.K., 2012, J. Inf. Syst. Commun, V3, P269
[5]  
Dawoud N., 2012, P INT MULTICONFERENC, V1
[6]  
Fowers J, 2012, FPGA 12: PROCEEDINGS OF THE 2012 ACM-SIGDA INTERNATIONAL SYMPOSIUM ON FIELD PROGRAMMABLE GATE ARRAYS, P47
[7]  
Gao W., 2012, SIMPLE METHOD ACCELE
[8]  
Kyrkou C., 2010, THESIS, P116
[9]  
Mehta S, 2010, HIPC C
[10]  
Moore N., 2011, Proceedings of the 2011 Symposium on Application Accelerators in High-Performance Computing (SAAHPC 2011), P103, DOI 10.1109/SAAHPC.2011.11