An Optimized Parallel IDCT on Graphics Processing Units

被引:0
|
作者
Wang, Biao [1 ]
Alvarez-Mesa, Mauricio [1 ]
Chi, Chi Ching [1 ]
Juurlink, Ben [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
关键词
IDCT; GPU; H.264; OpenCL; parallel programming;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [41] Parallel medical image reconstruction: from graphics processing units (GPU) to Grids
    Maraike Schellmann
    Sergei Gorlatch
    Dominik Meiländer
    Thomas Kösters
    Klaus Schäfers
    Frank Wübbeling
    Martin Burger
    The Journal of Supercomputing, 2011, 57 : 151 - 160
  • [42] Parallel Power Flow on Graphics Processing Units for Concurrent Evaluation of Many Networks
    Roberge, Vincent
    Tarbouchi, Mohammed
    Okou, Francis
    IEEE TRANSACTIONS ON SMART GRID, 2017, 8 (04) : 1639 - 1648
  • [43] NPGPU: Network Processing on Graphics Processing Units
    Deng, Yangdong
    Jiao, Xiaomemg
    Mu, Shuai
    Kang, Kang
    Zhu, Yuhao
    THEORETICAL AND MATHEMATICAL FOUNDATIONS OF COMPUTER SCIENCE, 2011, 164 : 313 - +
  • [44] A survey of graph processing on graphics processing units
    Ha-Nguyen Tran
    Cambria, Erik
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (05): : 2086 - 2115
  • [45] A survey of graph processing on graphics processing units
    Ha-Nguyen Tran
    Erik Cambria
    The Journal of Supercomputing, 2018, 74 : 2086 - 2115
  • [46] Parallel Multigrid Preconditioning on Graphics Processing Units (GPUs) for Robust Power Grid Analysis
    Feng, Zhuo
    Zeng, Zhiyu
    PROCEEDINGS OF THE 47TH DESIGN AUTOMATION CONFERENCE, 2010, : 661 - 666
  • [47] Parallel genetic algorithms on the graphics processing units using island model and simulated annealing
    Li, Cheng-Chieh
    Lin, Chu-Hsing
    Liu, Jung-Chun
    ADVANCES IN MECHANICAL ENGINEERING, 2017, 9 (07):
  • [48] GPUDePiCt: A Parallel Implementation of a Clustering Algorithm for Computing Degenerate Primers on Graphics Processing Units
    Cickovski, Trevor
    Flor, Tiffany
    Irving-Sachs, Galen
    Novikov, Philip
    Parda, James
    Narasimhan, Giri
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (02) : 445 - 454
  • [49] Alinea: An Advanced Linear Algebra Library for Massively Parallel Computations on Graphics Processing Units
    Magoules, Frederic
    Ahamed, Abal-Kassim Cheik
    INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (03): : 284 - 310
  • [50] Efficient Parallel Lists Intersection and Index Compression Algorithms using Graphics Processing Units
    Ao, Naiyong
    Zhang, Fan
    Wu, Di
    Stones, Douglas S.
    Wang, Gang
    Liu, Xiaoguang
    Liu, Jing
    Lin, Sheng
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 4 (08): : 470 - 481