2-D Order-16 Integer Transforms for HD Video Coding

被引:30
作者
Dong, Jie [1 ]
Ngan, King Ngi [1 ]
Fong, Chi-Keung [1 ]
Cham, Wai-Kuen [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Hong Kong, Peoples R China
关键词
AVS; H.264; HDTV; ICT; order-16; transform; VBT;
D O I
10.1109/TCSVT.2009.2026792
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, the spatial properties of high-definition (HD) videos are investigated based on a large set of HD video sequences. Compared with lower resolution videos, the prediction errors of HD videos have higher correlation. Hence, we propose using 2-D order-16 transforms for HD video coding, which are expected to be more efficient to exploit this spatial property, and specifically propose two types of 2-D order-16 integer transforms, nonorthogonal integer cosine transform (ICT) and modified ICT. The former resembles the discrete cosine transform (DCT) and is approximately orthogonal, of which the transform error introduced by the nonorthogonality is proven to be negligible. The latter modifies the structure of the DCT matrix and is inherently orthogonal, no matter what the values of the matrix elements are. Both types allow selecting matrix elements more freely by releasing the orthogonality constraint and can provide comparable performance with that of the DCT. Each type is integrated into the audio and video coding standard (AVS) Enhanced Profile (EP) and the H.264 High Profile (HP), respectively, and used adaptively as an alternative to the 2-D order-8 transform according to local activities. At the same time, many efforts have been devoted to further reducing the complexity of the 2-D order-16 transforms and specially for the modified ICT, a fast algorithm is developed and extended to a universal approach. Experimental results show that 2-D order-16 transforms provide significant performance improvement for both AVS Enhanced Profile and H.264 High Profile, which means they can be efficient coding tools especially for HD video coding.
引用
收藏
页码:1462 / 1474
页数:13
相关论文
共 27 条
  • [1] [Anonymous], 2001, SG16Q6 ITUT VCEG
  • [2] [Anonymous], P INT C AC SPEECH SI
  • [3] AVS, 2006, Patent No. 2006200902
  • [4] *AVS VID GROUP, 2006, AVSN1318 VID GROUP
  • [5] DEVELOPMENT OF INTEGER COSINE TRANSFORMS BY THE PRINCIPLE OF DYADIC SYMMETRY
    CHAM, WK
    [J]. IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1989, 136 (04): : 276 - 282
  • [6] AN ORDER-16 INTEGER COSINE TRANSFORM
    CHAM, WK
    CHAN, YT
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (05) : 1205 - 1208
  • [7] A new approach to compatible Adaptive Block-size Transforms
    Dong, J
    Lou, J
    Zhang, CX
    Yu, L
    [J]. Visual Communications and Image Processing 2005, Pts 1-4, 2005, 5960 : 38 - 47
  • [8] COMPARISON OF NTH-ORDER DPCM ENCODER WITH LINEAR TRANSFORMATIONS AND BLOCK QUANTIZATION TECHNIQUES
    HABIBI, A
    [J]. IEEE TRANSACTIONS ON COMMUNICATION TECHNOLOGY, 1971, CO19 (06): : 948 - &
  • [9] *ITU T, 2003, H264 ITUT
  • [10] *ITU T, 1998, H262 ITUT