An Optimized Parallel IDCT on Graphics Processing Units

被引:0
|
作者
Wang, Biao [1 ]
Alvarez-Mesa, Mauricio [1 ]
Chi, Chi Ching [1 ]
Juurlink, Ben [1 ]
机构
[1] Tech Univ Berlin, Berlin, Germany
关键词
IDCT; GPU; H.264; OpenCL; parallel programming;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we present an implementation of the H.264/AVC Inverse Discrete Cosine Transform (IDCT) optimized for Graphics Processing Units (GPUs) using OpenCL. By exploiting that most of the input data of the IDCT for real videos are zero valued coefficients a new compacted data representation is created that allows for several optimizations. Experimental evaluations conducted on different GPUs show average speedups from 1.7x to 7.4x compared to an optimized single-threaded SIMD CPU version.
引用
收藏
页码:155 / 164
页数:10
相关论文
共 50 条
  • [1] Optimized Parallel Implementation of Gillespie's First Reaction Method on Graphics Processing Units
    Dittamo, Cristian
    Cangelosi, Davide
    2009 INTERNATIONAL CONFERENCE ON COMPUTER MODELING AND SIMULATION, PROCEEDINGS, 2009, : 156 - 161
  • [2] Parallel pattern mining on Graphics Processing Units
    Hryniow, Krzysztof
    PROCEEDINGS OF THE 2013 14TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2013, : 134 - 139
  • [3] A Parallel Algebraic Multigrid Solver on Graphics Processing Units
    Haase, Gundolf
    Liebmann, Manfred
    Douglas, Craig C.
    Plank, Gernot
    HIGH PERFORMANCE COMPUTING AND APPLICATIONS, 2010, 5938 : 38 - +
  • [4] Parallel computing on graphics processing units and heterogeneous platforms
    Bientinesi, Paolo
    Herrero, Jose R.
    Quintana-Orti, Enrique S.
    Strzodka, Robert
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (06): : 1525 - 1527
  • [5] Heterogeneous multicore parallel programming for graphics processing units
    Bodin, Francois
    Bihan, Stephane
    SCIENTIFIC PROGRAMMING, 2009, 17 (04) : 325 - 335
  • [6] Parallel Ant Colony Optimization on Graphics Processing Units
    Delevacq, Audrey
    Delisle, Pierre
    Gravel, Marc
    Krajecki, Michael
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2013, 73 (01) : 52 - 61
  • [7] PARALLEL INVERSION OF GEOELECTRICAL DATA WITH CLUSTER UNITS AND GRAPHICS PROCESSING UNITS
    Pieta, Anna
    Bala, Justyna
    12TH INTERNATIONAL MULTIDISCIPLINARY SCIENTIFIC GEOCONFERENCE, SGEM 2012, VOL. III, 2012, : 199 - 204
  • [8] Parallel Mining of Neuronal Spike Streams on Graphics Processing Units
    Cao, Yong
    Patnaik, Debprakash
    Ponce, Sean
    Archuleta, Jeremy
    Butler, Patrick
    Feng, Wu-chun
    Ramakrishnan, Naren
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2012, 40 (06) : 605 - 632
  • [9] Massively Parallel Expectation Maximization Using Graphics Processing Units
    Altinigneli, Muzaffer Can
    Plant, Claudia
    Boehm, Christian
    19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), 2013, : 838 - 846
  • [10] Parallel UPGMA Algorithm on Graphics Processing Units Using CUDA
    Chen, Yu-Rong
    Hung, Che Lun
    Lin, Yu-Shiang
    Lin, Chun-Yuan
    Lee, Tien-Lin
    Lee, Kual-Zheng
    2012 IEEE 14TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2012 IEEE 9TH INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (HPCC-ICESS), 2012, : 849 - 854