An Optimized Parallel IDCT on Graphics Processing Units

被引:0
|
作者
Wang, Biao [1 ]
Alvarez-Mesa, Mauricio [1 ]
Chi, Chi Ching [1 ]
Juurlink, Ben [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
关键词
IDCT; GPU; H.264; OpenCL; parallel programming;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [31] Passive Radar Parallel Processing Using General-Purpose Computing on Graphics Processing Units
    Szczepankiewicz, Karolina
    Malanowski, Mateusz
    Szczepankiewicz, Michal
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2015, 61 (04) : 357 - 363
  • [32] The fast multipole method on parallel clusters, multicore processors, and graphics processing units
    Darve, Eric
    Cecka, Cris
    Takahashi, Toru
    COMPTES RENDUS MECANIQUE, 2011, 339 (2-3): : 185 - 193
  • [33] Parallel medical image reconstruction: from graphics processing units (GPU) to Grids
    Schellmann, Maraike
    Gorlatch, Sergei
    Meilaender, Dominik
    Koesters, Thomas
    Schaefers, Klaus
    Wuebbeling, Frank
    Burger, Martin
    JOURNAL OF SUPERCOMPUTING, 2011, 57 (02): : 151 - 160
  • [34] Parallel Computation of Trajectories Using Graphics Processing Units and Interpolated Gravity Models
    Arora, Nitin
    Vittaldev, Vivek
    Russell, Ryan P.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2015, 38 (08) : 1345 - 1355
  • [35] Data-Parallel High-Precision Multiplication on Graphics Processing Units
    Isupov, Konstantin
    Kuvaev, Alexander
    Knyazkov, Vladimir
    SUPERCOMPUTING (RUSCDAYS 2019), 2019, 1129 : 15 - 25
  • [36] Massively parallel simulations of relativistic fluid dynamics on graphics processing units with CUDA
    Bazow, Dennis
    Heinz, Ulrich
    Strickland, Michael
    COMPUTER PHYSICS COMMUNICATIONS, 2018, 225 : 92 - 113
  • [37] Massively Parallel Two-Dimensional TLM Algorithm on Graphics Processing Units
    Rossi, Filippo V.
    So, Poman P. M.
    Fichtner, Nikolaus
    Russer, Peter
    2008 IEEE MTT-S INTERNATIONAL MICROWAVE SYMPOSIUM DIGEST, VOLS 1-4, 2008, : 153 - +
  • [38] Highly Parallel Decoding of Space-Time Codes on Graphics Processing Units
    Bollapalli, Kalyana C.
    Wu, Yiyue
    Gulati, Kanupriya
    Khatri, Sunil
    Calderbank, A. Robert
    2009 47TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING, VOLS 1 AND 2, 2009, : 1262 - +
  • [39] Parallel Electronic Structure Calculations Using Multiple Graphics Processing Units (GPUs)
    Hakala, Samuli
    Havu, Ville
    Enkovaara, Jussi
    Nieminen, Risto
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING (PARA 2012), 2013, 7782 : 63 - 76
  • [40] Parallel unmixing of remotely sensed hyperspectral images on commodity graphics processing units
    Sanchez, Sergio
    Paz, Abel
    Martin, Gabriel
    Plaza, Antonio
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (13): : 1538 - 1557