Performance analysis and architecture design for parallel EBCOT encoder of JPEG2000

被引:18
作者
Zhang, Yi-Zhen [1 ]
Xu, Chao
Wang, Wen-Tao
Chen, Liang-Bin
机构
[1] Peking Univ, State Key Lab Machine Percept, Beijing 100871, Peoples R China
[2] Telecom Paris, Signal & Image Proc Dept, F-75634 Paris 13, France
[3] Schlumberger China Ltd, Beijing 100080, Peoples R China
关键词
embedded block coding with optimized truncation (EBCOT); JPEG2000; parallel architectures; VLSI;
D O I
10.1109/TCSVT.2007.903789
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The algorithm of embedded block coding with optimization truncation (EBCOT) is one of the key techniques in JPEG2000 standard. The high-speed/performance hardware designs for EBCOT are critical to various high-resolution-applications, such as digital cameras, digital video recorders, and HDTV, etc. This paper presents a detailed performance analysis for EBCOT Tier-1 and its bit plane parallel architecture design. By analyzing the bit plane parallel context modeling, we conclude that the difference between the output rate of context modeling module and that of arithmetic coding module degrades the performance of whole EBCOT parallel encoding. We propose two improved methods; referred to as data-pairs ordering (DPO) and flexible MQ (FMQ) coder. It solves the configuration problem between the parallel context modeling module and the sequent arithmetic coding module, takes full advantage of the bit plane parallel encoding technique, and improves the coding speed and efficiency of the EBCOT encoder significantly. The design of parallel EBCOT encoder is tested on the field-programmable gate array (FPGA) platform of Altera Company. The simulation results show that it can on average encode 54 million samples at 55-MHz working frequency. It is equivalent to encoding a 9000 x 6000 image per second or encoding 720 p (1280 x 720, 4:.2:2) HDTV picture sequence nearly 30 frames per second. Compared with the conventional bit plane parallel architecture design, the proposed one can reduce execution time by 24%.
引用
收藏
页码:1336 / 1347
页数:12
相关论文
共 21 条
[1]   A high-performance JPEG2000 architecture [J].
Andra, K ;
Chakrabarti, C ;
Acharya, T .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (03) :209-218
[2]  
Andra K, 2001, INT CONF ACOUST SPEE, P1101, DOI 10.1109/ICASSP.2001.941112
[3]  
CHEN HH, 2002, P IEEE INT S CIRC SY, V4, P329
[4]  
Chiang JS, 2004, 2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 3, PROCEEDINGS, P865
[5]  
Chiang JS, 2002, 2002 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL I, PROCEEDINGS, P773
[6]   Parallel embedded block coding architecture for JPEG 2000 [J].
Fang, HC ;
Chang, YW ;
Wang, TC ;
Lian, CJ ;
Chen, LG .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (09) :1086-1097
[7]  
Fang HC, 2003, PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II, P736
[8]  
Gormish MJ, 2000, 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, P29, DOI 10.1109/ICIP.2000.899217
[9]  
Han YJ, 2005, ASIA S PACIF DES AUT, P1284
[10]  
*ISO IEC, 2002, JTC1SC29WG1 ISOIEC