Design and Performance Evaluation of Image Processing Algorithms on GPUs

被引:79
作者
Park, In Kyu [1 ]
Singhal, Nitin [2 ]
Lee, Man Hee [1 ]
Cho, Sungdae [2 ]
Kim, Chris W. [3 ]
机构
[1] Inha Univ, Sch Informat & Commun Engn, Inchon 402751, South Korea
[2] Samsung Elect Co Ltd, Telecommun Module Lab, Digital Media & Commun R&D Ctr, Suwon 443742, South Korea
[3] NVIDIA Corp, Seoul 135729, South Korea
关键词
GPU; CUDA; image processing; parallel implementation; GPGPU; ARCHITECTURE; COMPRESSION;
D O I
10.1109/TPDS.2010.115
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we construe key factors in design and evaluation of image processing algorithms on the massive parallel graphics processing units (GPUs) using the compute unified device architecture (CUDA) programming model. A set of metrics, customized for image processing, is proposed to quantitatively evaluate algorithm characteristics. In addition, we show that a range of image processing algorithms map readily to CUDA using multiview stereo matching, linear feature extraction, JPEG2000 image encoding, and nonphotorealistic rendering (NPR) as our example applications. The algorithms are carefully selected from major domains of image processing, so they inherently contain a variety of subalgorithms with diverse characteristics when implemented on the GPU. Performance is evaluated in terms of execution time and is compared to the fastest host-only version implemented using OpenMP. It is shown that the observed speedup varies extensively depending on the characteristics of each algorithm. Intensive analysis is conducted to show the appropriateness of the proposed metrics in predicting the effectiveness of an application for parallel implementation.
引用
收藏
页码:91 / 104
页数:14
相关论文
共 36 条
[1]  
Adams MD, 2000, 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, P53, DOI 10.1109/ICIP.2000.899223
[2]  
Allusse Yannick., 2008, Proceedings of the 16th ACM international conference on Multimedia, P1089
[3]   A VLSI architecture for lifting-based forward and inverse wavelet transform [J].
Andra, K ;
Chakrabarti, C ;
Acharya, T .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (04) :966-977
[4]  
[Anonymous], P ACM SIGGRAPH 07 A
[5]  
[Anonymous], 2000, 154441 ISOIEC
[6]  
[Anonymous], COMP UN DEV ARCH CUD
[7]  
[Anonymous], P IEEE INT C IM PROC
[8]  
[Anonymous], OP COMP LANG OPENCL
[9]  
[Anonymous], EURASIP J APPL SIGNA
[10]  
[Anonymous], CUJ2K JPEG2000 ENCOD