A very low bit rate video coding combined with fast adaptive block size motion estimation and nonuniform scalar quantization multiwavelet transform

被引:4
作者
Chen, JH [1 ]
Zhou, JL [1 ]
Yu, SS [1 ]
Xu, J [1 ]
Zhong, L [1 ]
Zheng, JH [1 ]
机构
[1] Huazhong Univ Sci & Technol, Comp Sch Sci & Technol, Wuhan 430074, Hubei, Peoples R China
关键词
motion estimation; video compression; scalar quantization; wavelet transform coding;
D O I
10.1007/s11042-005-6852-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a very low bit rate video coding framework in which motion correlation between successive video frames is exploited in the multiwavelet transform domain. Some complicated techniques, such as spatial prediction in intra coding, adaptive block size motion estimation, more than one previous frames for prediction in inter frames, and content adaptive binary arithmetic coding (CABAC) are used in H.26L standard. The testing results show that H.26L can greatly outperform MPEG-4 ASP in both PSNR and visual quality. However, the encoding of H.26L costs too much time for it is complex to use fast motion search in adaptive block size motion estimation, and CABAC needs much time to generate the code list for entropy coding. Whereas, only four types of symbol are generated after zero tree wavelet coding so that the entropy coding can cost less time than CABAC. Moreover, if we select 8x8 sized block as a basic mode, which can be united into the large size mode if neighbored 8x8 sized blocks have same reference frame and motion vector, then the fast motion estimation can be feasible. Accordingly, a fast motion search algorithm, multiwavelet transform, and a novel adaptive quantization schemer are applied to the proposed coding frame. Experimental results reveal 0.2-0.5 dB increase in coded PSNR at low bit rates over the state-of-the-art H.26L recommendation, and similar improvements over MPEG-4 at high bit rates, with a considerable improvement in subjective reconstruction quality, while simultaneously supporting a scalable representation.
引用
收藏
页码:123 / 144
页数:22
相关论文
共 24 条
[11]  
*MPEG VID GROUP, 2001, JTCISC29WG11N4355 IS
[12]   3-DIMENSIONAL SUBBAND CODING OF VIDEO [J].
PODILCHUK, CI ;
JAYANT, NS ;
FARVARDIN, N .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1995, 4 (02) :125-139
[13]  
POPESCU BP, 2001, 2001 IEEE INT C AC S, V3, P1793
[14]   A new, fast, and efficient image codec based on set partitioning in hierarchical trees [J].
Said, A ;
Pearlman, WA .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1996, 6 (03) :243-250
[15]   Image coding based on a morphological representation of wavelet data [J].
Servetto, SD ;
Ramchandran, K ;
Orchard, MT .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (09) :1161-1174
[16]   EMBEDDED IMAGE-CODING USING ZEROTREES OF WAVELET COEFFICIENTS [J].
SHAPIRO, JM .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) :3445-3462
[17]   The application of multiwavelet filterbanks to image processing [J].
Strela, V ;
Heller, PN ;
Strang, G ;
Topiwala, P ;
Heil, C .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (04) :548-563
[18]   Motion estimation methods for overlapped block motion compensation [J].
Su, JK ;
Mersereau, RM .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (09) :1509-1521
[19]   MULTIRATE 3-D SUBBAND CODING OF VIDEO [J].
TAUBMAN, D ;
ZAKHOR, A .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (05) :572-588
[20]   A SCALABLE MOTION-COMPENSATED SUBBAND IMAGE CODER [J].
TSUNASHIMA, K ;
STAMPLEMAN, JB ;
BOVE, VM .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1994, 42 (2-4) :1894-1901