A Self-Reconfigurable Platform for Scalable DCT Computation Using Compressed Partial Bitstreams and BlockRAM Prefetching

被引:14
作者
Huang, Jian [1 ]
Lee, Jooheung [1 ]
机构
[1] Univ Cent Florida, Sch Elect Engn & Comp Sci, Orlando, FL 32816 USA
关键词
Data compression; DCT; FPGA; reconfigurable architectures; video coding;
D O I
10.1109/TCSVT.2009.2031464
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a self-reconfigurable platform which can reconfigure the architecture of discrete cosine transform (DCT) computations during run-time using dynamic partial reconfiguration. The scalable architecture of DCT computations can compute different numbers of DCT coefficients in a zig-zag scan order to adapt to different requirements, such as power consumption, hardware resources, and performance. We propose a configuration manager, which is implemented in the embedded processor in order to adaptively control the reconfiguration of scalable DCT architecture during run-time. In addition, we use the Lempel-Ziv-Storer-Szymanski algorithm for compression of the partial bitstreams and on-chip BlockRAM as a cache to reduce latency overhead for loading the partial bitstreams from the off-chip memory for run-time reconfiguration. A hardware module is designed for parallel reconfiguration of the partial bitstreams. The experimental results show that our approach can reduce the external memory accesses by 69% and can achieve a 400 MB/s reconfiguration rate. Detailed trade-offs of power, throughput, and quality are investigated, and used as a criterion for self-reconfiguration.
引用
收藏
页码:1623 / 1632
页数:10
相关论文
共 18 条
  • [1] Self-reconfiguration on Spartan-III FPGAs with compressed partial bitstreams via a parallel configuration access port (cPCAP) core
    Bayar, Salih
    Yurdakul, Arda
    [J]. PRIME: 2008 PHD RESEARCH IN MICROELECTRONICS AND ELECTRONICS, PROCEEDINGS, 2008, : 137 - 140
  • [2] Braun L, 2008, I C FIELD PROG LOGIC, P606
  • [3] A low power 8 x 8 direct 2-D DCT chip design
    Chang, HC
    Jiu, JY
    Chen, LL
    Chen, LG
    [J]. JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2000, 26 (03): : 319 - 332
  • [4] Clausen C. A., 2007, Document - International Research Group on Wood Protection, P1
  • [5] DIPPERSTEIN M, 2008, LZSS LZ77 DISCUSSION
  • [6] Design and implementaion of a 2D-DCT architecture using coefficient distributed arithmetic
    Ghosh, S
    Venigalla, S
    Bayoumi, M
    [J]. IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: NEW FRONTIERS IN VLSI DESIGN, 2005, : 162 - 166
  • [7] Göhringer D, 2008, I C FIELD PROG LOGIC, P494, DOI 10.1109/FPL.2008.4629992
  • [8] New cost-effective VLSI implementation of a 2-D discrete cosine transform and its inverse
    Gong, D
    He, Y
    Cao, ZG
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (04) : 405 - 415
  • [9] Khu A., 2001, XILINX FPGA CONFIGUR
  • [10] Kinane A, 2004, LECT NOTES COMPUT SC, V3254, P780