State-of-the-art and trends in scalable video compression with wavelet-based approaches

被引:62
作者
Adami, Nicola [1 ]
Signoroni, Alberto [1 ]
Leonardi, Riccardo [1 ]
机构
[1] Univ Brescia, Fac Engn, Dept Elect & Automat, I-25123 Brescia, Italy
关键词
Entropy coding; motion compensated temporal filtering (MCTF); MPEG; Scalable Video Coding (SVC); spatio-temporal multiresolution representations; video coding architectures; video quality assessment; wavelets;
D O I
10.1109/TCSVT.2007.906828
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scalable Video Coding (SVC) differs form traditional single point approaches mainly because it allows to encode in a unique bit stream several working points corresponding to different quality, picture size and frame rate. This work describes the current state-of-the-art in SVC, focusing on wavelet based motion-compensated approaches (WSVC). It reviews individual components that have been designed to address the problem over the years and how such components are typically combined to achieve meaningful WSVC architectures. Coding schemes which mainly differ from the space-time order in which the wavelet transforms operate are here compared, discussing strengths and weaknesses of the resulting implementations. An evaluation of the achievable coding performances is provided considering the reference architectures studied and developed by ISO/MPEG in its exploration on WSVC. The paper also attempts to draw a list of major differences between wavelet based solutions and the SVC standard jointly targeted by ITU and ISO/MPEG. A major emphasis is devoted to a promising WSVC solution, named STP-tool, which presents architectural similarities with respect to the SVC standard. The paper ends drawing some evolution trends for WSVC systems and giving insights on video coding applications which could benefit by a wavelet based approach.
引用
收藏
页码:1238 / 1255
页数:18
相关论文
共 86 条
[1]   A fully scalable video coder with inter-scale wavelet prediction and morphological coding [J].
Adami, N ;
Brescianini, M ;
Dalai, M ;
Leonardi, R ;
Signoroni, A .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 :535-546
[2]  
ADAMI N, 2004, 70 MPEG M PALM MALL
[3]   Complete-to-overcomplete discrete wavelet transforms: Theory and applications [J].
Andreopoulos, Y ;
Munteanu, A ;
Van der Auwera, G ;
Cornelis, JPH ;
Schelkens, P .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2005, 53 (04) :1398-1412
[4]   In-band motion compensated temporal filtering [J].
Andreopoulos, Y ;
Munteanu, A ;
Barbarien, J ;
Van der Schaar, M ;
Cornelis, J ;
Schelkens, P .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2004, 19 (07) :653-673
[5]  
[Anonymous], 1992, Multirate Systems and Filter Banks
[6]   Motion and texture rate-allocation for prediction-based scalable motion-vector coding [J].
Barbarien, J ;
Munteanu, A ;
Verdicchio, F ;
Andreopoulos, Y ;
Cornelis, J ;
Schelkens, P .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2005, 20 (04) :315-342
[7]  
BEERMANN M, 2005, 74 MPEG M NIC FRANC
[8]  
Bottreau V, 2001, 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, P1017, DOI 10.1109/ICIP.2001.958669
[9]  
BOTTREAU V, 2004, 70 MPEG M PALM MALL
[10]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540