Efficient Partition Decision Based on Visual Perception and Machine Learning for H.266/Versatile Video Coding

被引:14
作者
Chen, Mei-Juan [1 ]
Lee, Cheng-An [1 ]
Tsai, Yu-Hsiang [1 ]
Yang, Chieh-Ming [1 ]
Yeh, Chia-Hung [2 ,3 ]
Kau, Lih-Jen [4 ]
Chang, Chuan-Yu [5 ]
机构
[1] Natl Dong Hwa Univ, Dept Elect Engn, Hualien 974301, Taiwan
[2] Natl Taiwan Normal Univ, Dept Elect Engn, Taipei 106010, Taiwan
[3] Natl Sun Yat Sen Univ, Dept Elect Engn, Kaohsiung 804201, Taiwan
[4] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 106344, Taiwan
[5] Natl Yunlin Univ Sci & Technol, Dept Comp Sci & Informat Engn, Touliu 640301, Yunlin, Taiwan
关键词
Encoding; Random forests; Video coding; Streaming media; Copper; Visual perception; Licenses; H266; versatile video coding; intra coding; visual perception; just noticeable difference; machine learning; random forest; CU; PREDICTION; STRATEGY; VVC;
D O I
10.1109/ACCESS.2022.3168155
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
H.266/Versatile Video Coding (VVC) is the latest international video coding standard to encode ultra-high-definition video effectively. The quadtree with nested multi-type tree (QT-MTT) structure provides various sizes of coding tree partitioning and allows the nested binary tree (BT) split and ternary tree (TT) split at each QT level. Furthermore, numerous advanced coding tools are equipped in the H.266/VVC encoder. However, the encoding time increases tremendously. Previous researches regarding the fast coding algorithm of H.266/VVC seldom mention perceptual redundancy. This paper utilizes the human vision model of just noticeable difference to extract the visually distinguishable pixels that may affect the visual perception. We observe that the distributions acquired by the horizontal and vertical projections of visually distinguishable pixels within the coding unit are related to their corresponding MTT splitting modes. Therefore, the distributions representing the perceptual information of human vision are used to be the input features of machine learning. Fast MTT decision determined by the random forest models of machine learning is proposed to quickly select the partition for intra coding. Experimental results demonstrate that the proposed method can effectively accelerate intra coding process while maintaining good bitrate and video quality based on the properties of the visual perception. The proposed algorithm provides better performance than the previous work.
引用
收藏
页码:42127 / 42136
页数:10
相关论文
共 35 条
  • [1] Tunable VVC Frame Partitioning Based on Lightweight Machine Learning
    Amestoy, Thomas
    Mercat, Alexandre
    Hamidouche, Wassim
    Menard, Daniel
    Bergeron, Cyril
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 1313 - 1328
  • [2] Bjontegaard G., 2001, P 13 VCEG M AUST TX
  • [3] Bossen F., 2020, JVET-T2010
  • [4] Breiman L., 2001, Machine Learning, V45, P5
  • [5] Breiman Leo, 2017, Classi~cation and regression trees
  • [6] Bross B, 2020, JVETQ2001
  • [7] Developments in International Video Coding Standardization After AVC, With an Overview of Versatile Video Coding (VVC)
    Bross, Benjamin
    Chen, Jianle
    Ohm, Jens-Rainer
    Sullivan, Gary J.
    Wang, Ye-Kui
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1463 - 1493
  • [8] Fast 3D-HEVC Depth Intra Coding Based on Boundary Continuity
    Chen, Mei-Juan
    Lin, Jie-Ru
    Hsu, Yu-Chih
    Ciou, Yi-Sheng
    Yeh, Chia-Hung
    Lin, Min-Hui
    Kau, Lih-Jen
    Chang, Chuan-Yu
    [J]. IEEE ACCESS, 2021, 9 : 79588 - 79599
  • [9] Efficient CU and PU Decision Based on Motion Information for Interprediction of HEVC
    Chen, Mei-Juan
    Wu, Yu-De
    Yeh, Chia-Hung
    Lin, Kao-Min
    Lin, Shinfeng D.
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2018, 14 (11) : 4735 - 4745
  • [10] Error concealment of lost motion vectors with overlapped motion compensation
    Chen, MJ
    Chen, LG
    Weng, RM
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1997, 7 (03) : 560 - 563