Visual Perception Based Intra Coding Algorithm for H.266/VVC

被引:4
作者
Tsai, Yu-Hsiang [1 ]
Lu, Chen-Rung [1 ]
Chen, Mei-Juan [1 ]
Hsieh, Meng-Chun [1 ]
Yang, Chieh-Ming [1 ]
Yeh, Chia-Hung [2 ,3 ]
机构
[1] Natl Dong Hwa Univ, Dept Elect Engn, Hualien, Taiwan
[2] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei 106308, Taiwan
[3] Natl Sun Yat sen Univ, Dept Elect Engn, Kaohsiung, Taiwan
关键词
H.266; versatile video coding; multi-type tree; coding tool; visual perception; machine learning; intra coding; CU PARTITION;
D O I
10.3390/electronics12092079
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The latest international video coding standard, H.266/Versatile Video Coding (VVC), supports high-definition videos, with resolutions from 4 K to 8 K or even larger. It offers a higher compression ratio than its predecessor, H.265/High Efficiency Video Coding (HEVC). In addition to the quadtree partition structure of H.265/HEVC, the nested multi-type tree (MTT) structure of H.266/VVC provides more diverse splits through binary and ternary trees. It also includes many new coding tools, which tremendously increases the encoding complexity. This paper proposes a fast intra coding algorithm for H.266/VVC based on visual perception analysis. The algorithm applies the factor of average background luminance for just-noticeable-distortion to identify the visually distinguishable (VD) pixels within a coding unit (CU). We propose calculating the variances of the numbers of VD pixels in various MTT splits of a CU. Intra sub-partitions and matrix weighted intra prediction are turned off conditionally based on the variance of the four variances for MTT splits and a thresholding criterion. The fast horizontal/vertical splitting decisions for binary and ternary trees are proposed by utilizing random forest classifiers of machine learning techniques, which use the information of VD pixels and the quantization parameter. Experimental results show that the proposed algorithm achieves around 47.26% encoding time reduction with a Bjontegaard Delta Bitrate (BDBR) of 1.535% on average under the All Intra configuration. Overall, this algorithm can significantly speed up H.266/VVC intra coding and outperform previous studies.
引用
收藏
页数:13
相关论文
共 38 条
  • [1] Bjontegaard G., 2001, P VCEG M ITU T SG16, P2
  • [2] Bossen F., 2019, JVETN010
  • [3] Bradski G, 2000, DR DOBBS J, V25, P120
  • [4] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Bross B., 2018, document JVET-L0283
  • [6] Overview of the Versatile Video Coding (VVC) Standard and its Applications
    Bross, Benjamin
    Wang, Ye-Kui
    Ye, Yan
    Liu, Shan
    Chen, Jianle
    Sullivan, Gary J.
    Ohm, Jens-Rainer
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3736 - 3764
  • [7] Developments in International Video Coding Standardization After AVC, With an Overview of Versatile Video Coding (VVC)
    Bross, Benjamin
    Chen, Jianle
    Ohm, Jens-Rainer
    Sullivan, Gary J.
    Wang, Ye-Kui
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1463 - 1493
  • [8] Chen J, 2020, JVET S2002 TELECONFE
  • [9] Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding
    Chen, Mei-Juan
    Lee, Cheng-An
    Tsai, Yu-Hsiang
    Yang, Chieh-Ming
    Yeh, Chia-Hung
    Kau, Lih-Jen
    Chang, Chuan-Yu
    [J]. IEEE ACCESS, 2022, 10 : 42127 - 42136
  • [10] A perceptually tuned subband image coder based on the measure of just-noticeable-distortion profile
    Chou, CH
    Li, YC
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1995, 5 (06) : 467 - 476