Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard

被引:169
作者
Wiegand, T
Lightstone, M
Mukherjee, D
Campbell, TG
Mitra, SK
机构
[1] CHROMAT RES INC, MT VIEW, CA 94043 USA
[2] UNIV CALIF SANTA BARBARA, DEPT ELECT & COMP ENGN, SANTA BARBARA, CA 93106 USA
[3] COMPRESS LABS INC, SAN JOSE, CA 95134 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/76.488825
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of encoder optimization in a macroblock-based multimode video compression system, An efficient solution is proposed in which, for a given image region, the optimum combination of macroblock modes and the associated mode parameters are jointly selected so as to minimize the over;dl distortion for a given bit-rate budget, Conditions for optimizing the encoder operation are derived within a rate-constrained product code framework using a Lagrangian formulation, The instantaneous rate of the encoder is controlled by a single Lagrange multiplier that makes the method amenable to mobile wireless networks with time-varying capacity, When rate and distortion dependencies are introduced between adjacent blocks (as is the case when the motion vectors are differentially encoded and/or overlapped block motion compensation is employed), the ensuing encoder complexity is surmounted using dynamic programming. Due to the generic nature of the algorithm, it can be successfully applied to the problem of encoder control in numerous video coding standards, including H.261, MPEG-1, and MPEG-2, Moreover, the strategy is especially relevant for very low bit rate coding over wireless communication channels where the low dimensionality of the images associated with these bit rates makes real-time implementation very feasible, Accordingly, in this paper, the method is successfully applied to the emerging H.263 video coding standard with excellent results at rates as low as 8.0 Kb per second. Direct comparisons with the H.263 test model, TMN5, demonstrate that gains in peak signal-to-noise ratios (PSNR) are achievable over a wide range of rates.
引用
收藏
页码:182 / 190
页数:9
相关论文
共 21 条
[1]  
[Anonymous], ADAPTIVE FILTER THEO, DOI DOI 10.1109/ISCAS.2017.8050871
[2]   A STABLE FEEDBACK-CONTROL OF THE BUFFER STATE USING THE CONTROLLED LAGRANGE MULTIPLIER METHOD [J].
CHOI, JH ;
PARK, DC .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1994, 3 (05) :546-558
[3]  
Dennis, 1996, NUMERICAL METHODS UN
[5]   VITERBI ALGORITHM [J].
FORNEY, GD .
PROCEEDINGS OF THE IEEE, 1973, 61 (03) :268-278
[6]  
*ISOIEC, 1993, 111722 ISOIEC 2
[7]  
*ITUT, 1993, H261 ITUT
[8]  
*ITUT, 1995, H263 ITUT
[9]  
*ITUT, 1994, H262SOIEC138182 ITUT
[10]  
LEE J, 1994, P ICIP, V2, P962