A CU-Level Rate and Distortion Estimation Scheme for RDO of Hardware-Friendly HEVC Encoders Using Low-Complexity Integer DCTs

被引:13
作者
Lee, Bumshik [1 ]
Kim, Munchurl [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Elect Engn, Daejeon 34141, South Korea
关键词
HEVC; Rate-distortion optimization; integer DCT; Hadamard transform; rate estimation; distortion estimation; MODE DECISION; MULTIPLIERLESS APPROXIMATION; HADAMARD-TRANSFORM; COST ESTIMATION; SIZE DECISION; ALGORITHM;
D O I
10.1109/TIP.2016.2579559
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a low complexity coding unit (CU)-level rate and distortion estimation scheme is proposed for High Efficiency Video Coding (HEVC) hardware-friendly implementation where a Walsh-Hadamard transform (WHT)-based low-complexity integer discrete cosine transform (DCT) is employed for distortion estimation. Since HEVC adopts quadtree structures of coding blocks with hierarchical coding depths, it becomes more difficult to estimate accurate rate and distortion values without actually performing transform, quantization, inverse transform, de-quantization, and entropy coding. Furthermore, DCT for rate-distortion optimization (RDO) is computationally high, because it requires a number of multiplication and addition operations for various transform block sizes of 4-, 8-, 16-, and 32-orders and requires recursive computations to decide the optimal depths of CU or transform unit. Therefore, full RDO-based encoding is highly complex, especially for low-power implementation of HEVC encoders. In this paper, a rate and distortion estimation scheme is proposed in CU levels based on a low-complexity integer DCT that can be computed in terms of WHT whose coefficients are produced in prediction stages. For rate and distortion estimation in CU levels, two orthogonal matrices of 4 x 4 and 8 x 8, which are applied to WHT that are newly designed in a butterfly structure only with addition and shift operations. By applying the integer DCT based on the WHT and newly designed transforms in each CU block, the texture rate can precisely be estimated after quantization using the number of non-zero quantized coefficients and the distortion can also be precisely estimated in transform domain without de-quantization and inverse transform required. In addition, a non-texture rate estimation is proposed by using a pseudoentropy code to obtain accurate total rate estimates. The proposed rate and the distortion estimation scheme can effectively be used for HW-friendly implementation of HEVC encoders with 9.8% loss over HEVC full RDO, which much less than 20.3% and 30.2% loss of a conventional approach and Hadamard-only scheme, respectively.
引用
收藏
页码:3787 / 3800
页数:14
相关论文
共 37 条
[1]   A Novel Fast CU Encoding Scheme Based on Spatiotemporal Encoding Parameters for HEVC Inter Coding [J].
Ahn, Sangsoo ;
Lee, Bumshik ;
Kim, Munchurl .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (03) :422-435
[2]  
[Anonymous], 1975, ORTHOGONAL TRANSFORM
[3]  
Bjontegaard G, 2001, 13 VCEG M AUST TX US
[4]  
Bjontegaard G., 2002, 3 JVT M FAIRF VA US
[5]   DEVELOPMENT OF INTEGER COSINE TRANSFORMS BY THE PRINCIPLE OF DYADIC SYMMETRY [J].
CHAM, WK .
IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (04) :276-282
[6]  
CHEN WH, 1977, IEEE T COMMUN, V25, P1004, DOI 10.1109/TCOM.1977.1093941
[7]   Multiplierless approximation of transforms with adder constraint [J].
Chen, YJ ;
Oraintara, S ;
Tran, TD ;
Amaratunga, K ;
Nguyen, TQ .
IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (11) :344-347
[8]   Video compression using integer DCT [J].
Chen, YJ ;
Oraintara, S ;
Nguyen, T .
2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2000, :844-847
[9]   Fast CU Splitting and Pruning for Suboptimal CU Partitioning in HEVC Intra Coding [J].
Cho, Seunghyun ;
Kim, Munchurl .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) :1555-1564
[10]   A unified rate-distortion analysis framework for transform coding [J].
He, ZH ;
Mitra, SK .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2001, 11 (12) :1221-1236