CNN-LNN Based Fast CU Partitioning Decision for VVC 3D Video Depth Map Intra Coding

被引:2
作者
Wang, Fengqin [1 ]
Wang, Zhiying [1 ]
Zhang, Qiuwen [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Comp & Commun Engn, Zhengzhou 450002, Peoples R China
基金
中国国家自然科学基金;
关键词
VVC 3D video; depth map coding; CU early prediction; CNN-LNN;
D O I
10.1109/ACCESS.2023.3305266
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, the coding efficacy of the cutting-edge video coding standard H.266/VVC surpasses that of 3D-HEVC (3D-High Efficiency Video Coding), but the existing VVC (Versatile Video Coding) low-complexity coding algorithm is mainly optimized for 2D video coding and cannot fully utilize the characteristics of the depth map itself. Based on this, we propose a fast decision algorithm employing the CNN (Convolutional Neural Network)-LNN (Lightweight Neural Network) model to diminish the intricacy of depth map intra coding in VVC 3D video. The algorithm treats the CU partitioning process in depth map coding as a two-stage process, first adding a non-local block and spatial pyramid pooling to the CNN model, enabling the proposed CNN model to skip the flat regions in the depth map and perform adaptive partitioning prediction of CUs in the edge regions; then, the LNN model is used to make early decision on TT (Ternary Tree) partition for CUs that need to be partitioned, and skip decisions for CUs that do not need to be partitioned by TT, so as to reduce some unnecessary RDO calculations. Experimental results illustrate that the algorithm achieves a notable reduction in encoding time amounting to 43.23% on average, with a negligible impact on the increase of BDBR.
引用
收藏
页码:87420 / 87429
页数:10
相关论文
共 36 条
  • [1] Improved intra-subpartition coding mode for versatile video coding
    Akbulut, Orhan
    Konyar, Mehmet Zeki
    [J]. SIGNAL IMAGE AND VIDEO PROCESSING, 2022, 16 (05) : 1363 - 1368
  • [2] Fast intra-coding unit partition decision in H.266/FVC based on deep learning
    Amna, Maraoui
    Imen, Werda
    Ezahra, Sayadi Fatma
    Mohamed, Atri
    [J]. JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1971 - 1981
  • [3] Bakkouri S, 2020, 2020 INT C INT SYST, DOI [10.1109/iscv49265.2020.9204037, DOI 10.1109/ISCV49265.2020.9204037]
  • [4] MPEG Immersive Video Coding Standard
    Boyce, Jill M.
    Dore, Renaud
    Dziembowski, Adrian
    Fleureau, Julien
    Jung, Joel
    Kroon, Bart
    Salahieh, Basel
    Vadakital, Vinod Kumar Malamal
    Yu, Lu
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1521 - 1536
  • [5] Fast 3D-HEVC Depth Intra Coding Based on Boundary Continuity
    Chen, Mei-Juan
    Lin, Jie-Ru
    Hsu, Yu-Chih
    Ciou, Yi-Sheng
    Yeh, Chia-Hung
    Lin, Min-Hui
    Kau, Lih-Jen
    Chang, Chuan-Yu
    [J]. IEEE ACCESS, 2021, 9 : 79588 - 79599
  • [6] Subjective and Objective Quality Assessment of Compressed 4K UHD Videos for Immersive Experience
    Cheon, Manri
    Lee, Jong-Seok
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (07) : 1467 - 1480
  • [7] Fast intra mode decision and fast CU size decision for depth video coding in 3D-HEVC
    Chiang, Jui-Chiu
    Peng, Kuan-Kai
    Wu, Chao-Chun
    Deng, Chih-You
    Lie, Wen-Nung
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2019, 71 : 13 - 23
  • [8] A Computation Complexity Reduction of the Size Decision Algorithm in 3D-HEVC Depth Map Intracoding
    Hamout, Hamza
    Elyousfi, Abderrahmane
    [J]. ADVANCES IN MULTIMEDIA, 2022, 2022
  • [9] Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (09) : 1904 - 1916
  • [10] A VVC Proposal With Quaternary Tree Plus Binary-Ternary Tree Coding Block Structure and Advanced Coding Techniques
    Huang, Yu-Wen
    Hsu, Chih-Wei
    Chen, Ching-Yeh
    Chuang, Tzu-Der
    Hsiang, Shih-Ta
    Chen, Chun-Chia
    Chiang, Man-Shu
    Lai, Chen-Yen
    Tsai, Chia-Ming
    Su, Yu-Chi
    Lin, Zhi-Yi
    Hsiao, Yu-Ling
    Chubach, Olena
    Lin, Yu-Cheng
    Lei, Shaw-Min
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (05) : 1311 - 1325