An Optimized Parallel IDCT on Graphics Processing Units

被引:0
作者
Wang, Biao [1 ]
Alvarez-Mesa, Mauricio [1 ]
Chi, Chi Ching [1 ]
Juurlink, Ben [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
来源
EURO-PAR 2012: PARALLEL PROCESSING WORKSHOPS | 2013年 / 7640卷
关键词
IDCT; GPU; H.264; OpenCL; parallel programming;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [21] An Optimized Multicolor Point-Implicit Solver for Unstructured Grid Applications on Graphics Processing Units
    Zubair, Mohammad
    Nielsen, Eric
    Luitjens, Justin
    Hammond, Dana
    PROCEEDINGS OF 2016 6TH WORKSHOP ON IRREGULAR APPLICATIONS: ARCHITECTURE AND ALGORITHMS (IA3), 2016, : 18 - 25
  • [22] Real-time massively parallel processing of Spectral Optical Coherence Tomography data on Graphics Processing Units
    Sylwestrzak, Marcin
    Szlag, Daniel
    Szkulmowski, Maciej
    Targowski, Piotr
    OPTICAL COHERENCE TOMOGRAPHY AND COHERENCE TECHNIQUES V, 2011, 8091
  • [23] Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units
    Rupp, Karl
    Weinbub, Josef
    Juengel, Ansgar
    Grasser, Tibor
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2016, 43 (02):
  • [24] Rigid body constraints realized in massively-parallel molecular dynamics on graphics processing units
    Trung Dac Nguyen
    Phillips, Carolyn L.
    Anderson, Joshua A.
    Glotzer, Sharon C.
    COMPUTER PHYSICS COMMUNICATIONS, 2011, 182 (11) : 2307 - 2313
  • [25] Optical diagnostics of a single evaporating droplet using fast parallel computing on graphics processing units
    Jakubczyk, D.
    Migacz, S.
    Derkachov, G.
    Wozniak, M.
    Archer, J.
    Kolwas, K.
    OPTO-ELECTRONICS REVIEW, 2016, 24 (03) : 108 - 116
  • [26] Fast Parallel Hough Transform Linear Features Extracting Method Based on Graphics Processing Units
    Yan, Ran
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [27] Graphics Processing Units for HEP trigger systems
    Ammendola, R.
    Bauce, M.
    Biagioni, A.
    Chiozzi, S.
    Ramusino, A. Cotta
    Fantechi, R.
    Fiorini, M.
    Giagu, S.
    Gianoli, A.
    Lamanna, G.
    Lonardo, A.
    Messina, A.
    Neri, I.
    Paolucci, P. S.
    Piandani, R.
    Pontisso, L.
    Rescigno, M.
    Simula, F.
    Sozzi, M.
    Vicini, P.
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 2016, 824 : 307 - 310
  • [28] Multilayer Packet Classification With Graphics Processing Units
    Varvello, Matteo
    Laufer, Rafael
    Zhang, Feixiong
    Lakshman, T. V.
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2016, 24 (05) : 2760 - 2773
  • [29] Programmable and Scalable Architecture for Graphics Processing Units
    de La Lama, Carlos S.
    Jaaskelainen, Pekka
    Takala, Jarmo
    EMBEDDED COMPUTER SYSTEMS: ARCHITECTURES, MODELING, AND SIMULATION, PROCEEDINGS, 2009, 5657 : 2 - +
  • [30] Accelerating Viterbi algorithm on graphics processing units
    Hanif, Muhammad Kashif
    Zimmermann, Karl-Heinz
    COMPUTING, 2017, 99 (11) : 1105 - 1123