A Memory-Efficient Scalable Architecture for Lifting-Based Discrete Wavelet Transform

被引:23
作者
Hu, Yusong [1 ]
Jong, Ching Chuen [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Integrated Syst Res Lab, Singapore 639798, Singapore
[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
Discrete wavelet transform (DWT); lifting scheme; parallel 2-D DWT architecture; systolic array; VLSI ARCHITECTURE; HIGH-PERFORMANCE; 2-D DWT; SCHEME; IMPLEMENTATION; 1-D;
D O I
10.1109/TCSII.2013.2268335
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this brief, we propose a new parallel lifting-based 2-D DWT architecture with high memory efficiency and short critical path. The memory efficiency is achieved with a novel scanning method that enables tradeoff of external memory bandwidth and on-chip memory. Based on the data flow graph of the flipped lifting algorithm, processing units (PUs) are developed for maximally utilizing the inherent parallelism. With S number of PUs, the throughput can be scaled while keeping the latency constant. Compared with the best existing architecture, the proposed architecture requires less memory. For an N x N image, the proposed architecture consumes a total of only 3N + 24S words of transposition memory, temporal memory, and pipeline registers. The synthesized results in a 90-nm CMOS process show that it achieves better area-delay products than the best existing design by 32.3%, 31.5%, and 27.0% when S = 2, 4, and 8, respectively, and by 26%, 26%, and 22% when the overhead for buffering the required overlapped pixels is taken into account.
引用
收藏
页码:502 / 506
页数:5
相关论文
共 18 条