Direct mode coding for bipredictive slices in the H.264 standard

被引:57
作者
Tourapis, AM [1 ]
Wu, F
Li, SP
机构
[1] DoCoMo Commun Labs USA Inc, San Jose, CA 95110 USA
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
biprediction; DIRECT mode; H.264; motion compensation; MPEG-4; AVC; spatial correlation; temporal correlation; video coding;
D O I
10.1109/TCSVT.2004.837021
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The new H.264 (MPEG-4 AVC) video coding standard can achieve considerably higher coding efficiency compared to previous standards. This is accomplished mainly due to the consideration of variable block sizes for motion compensation, multiple reference frames, intra prediction, but also due to better exploitation of the spatiotemporal correlation that may exist between adjacent Macroblocks, with the SKIP mode in predictive (P) slices and the two DIRECT modes in bipredictive (B) slices. These modes, when signaled, could in effect represent the motion of a macroblock (MB) or block without having to transmit any additional motion information required by other inter-MB types. This property also allows these modes to be highly compressible especially due to the consideration of run length coding strategies. Although spatial correlation of motion vectors from adjacent MBs is used for SKIP mode to predict its motion parameters, until recently, DIRECT mode considered only temporal correlation of adjacent pictures. In this letter, we introduce alternative methods for the generation of the motion information for the DIRECT mode using spatial or combined spatiotemporal correlation. Considering that temporal correlation requires that the motion and timestamp information from previous pictures are available in both the encoder and decoder, it is shown that our spatial-only method can reduce or eliminate such requirements while, at the same time, achieving similar performance. The combined methods, on the other hand, by jointly exploiting spatial and temporal correlation either at the MB or slice/picture level, can achieve even higher coding efficiency. Finally, improvements on the existing Rate Distortion Optimization related to B slices within the H.264 codec are also presented, which can lead to improvements of up to 16% in bit rate reduction or, equivalently, more than 0.7 dB in PSNR.
引用
收藏
页码:119 / 126
页数:8
相关论文
共 9 条
  • [1] [Anonymous], JOINT VIDEO TEAM REF
  • [2] Bjontegaard G., 2001, ITU-T VCEG-M33
  • [3] Generalized B pictures and the draft H.264/AVC video-com-pression standard
    Flierl, M
    Girod, B
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (07) : 587 - 597
  • [4] *ISO IEC, 2000, 13818 ISO IEC
  • [5] *ISO IEC, 2001, 14496 ISO IEC
  • [6] Predictive RD optimized motion estimation for very low bit-rate video coding
    Kossentini, F
    Lee, YW
    Smith, MJT
    Ward, RK
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1997, 15 (09) : 1752 - 1763
  • [7] Rate-distortion optimization for video compression
    Sullivan, GJ
    Wiegand, T
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) : 74 - 90
  • [8] Enhanced predictive zonal search for single and multiple frame motion estimation
    Tourapis, AM
    [J]. VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2002, PTS 1 AND 2, 2002, 4671 : 1069 - 1079
  • [9] Overview of the H.264/AVC video coding standard
    Wiegand, T
    Sullivan, GJ
    Bjontegaard, G
    Luthra, A
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2003, 13 (07) : 560 - 576