Low Complexity Coding Unit Decision for Video-Based Point Cloud Compression

被引:5
作者
Gao, Wei [1 ,2 ]
Yuan, Hang [1 ,2 ]
Li, Ge [3 ]
Li, Zhu [4 ]
Yuan, Hui [5 ]
机构
[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen 518055, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[3] Peking Univ, Sch Elect & Engn, Shenzhen 518055, Peoples R China
[4] Univ Missouri Kansas City, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA
[5] Shandong Univ, Sch Control Sci & Engn, Jinan 250100, Peoples R China
关键词
V-PCC; CU partition; machine learning; cross-projection information; rate-distortion-oriented; RATE-DISTORTION OPTIMIZATION; CU SIZE DECISION; PARTITION; INTRAMODE; MPEG;
D O I
10.1109/TIP.2023.3337637
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With growing demand for point cloud coding, Video-based Point Cloud Compression (V-PCC) is released for dynamic point clouds, relying on mature 2D video coding techniques. However, the huge computational complexity of 2D video codec is inherited by V-PCC, thereby resulting in a notably time-consuming encoding process for the projection videos. To accelerate the compression, this paper proposes a low complexity coding unit decision algorithm for V-PCC intra coding. First, the 2D sequences (occupancy, geometry, and attribute sequences) are projected from same 3D point could frames in V-PCC. By exploring the strong correlations among them, the cross-projection information is creatively proposed for improving the predication performance of CU partition. Second, considering the disparate coding losses generated by incorrect partitioning decisions of different CUs, we develop a rate-distortion-oriented learning approach aimed at increasing the decision accuracy of the CUs, severely affecting coding performance. Third, to accommodate the particular coding architecture of V-PCC intra configuration, we further devise an overall framework, including targeted feature extraction and partitioning decision for intra and inter coding of geometry and attribute sequences. The final experimental results strongly demonstrate the effectiveness of our proposed algorithm. The time consumption of total projection sequence compression can be reduced by 57.80%, while the coding losses on Geom.BD-TotalRate (D1 and D2) and Attr.BD-TotalRate Luma component are only 0.08%, 0.33%, and 0.16%, respectively, which can be negligible. To the best of our knowledge, the proposed algorithm achieves state-of-the-art performance for accelerating the projection sequence compression in V-PCC All-Intra configuration.
引用
收藏
页码:149 / 162
页数:14
相关论文
共 45 条
  • [1] Dynamic Point Cloud Texture Video Compression using the Edge Position Difference Oriented Motion Model
    Ahmmed, Ashek
    Paul, Manoranjan
    Pickering, Mark
    [J]. 2021 DATA COMPRESSION CONFERENCE (DCC 2021), 2021, : 335 - 335
  • [2] Video-Based Point Cloud Compression Artifact Removal
    Akhtar, Anique
    Gao, Wen
    Li, Li
    Li, Zhu
    Jia, Wei
    Liu, Shan
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2866 - 2876
  • [3] [Anonymous], 2020, Standard ISO/IEC JTC1/SC29/WG11 MPEG2020/N19324
  • [4] [Anonymous], 2019, Software Description of AVS Point Cloud Coding Exploration Model (PCEM v0.1)
  • [5] Bjontegaard G., 2001, SG16VCEGM33 ITU
  • [6] Compression of Sparse and Dense Dynamic Point Clouds-Methods and Standards
    Cao, Chao
    Preda, Marius
    Zakharchenko, Vladyslav
    Jang, Euee S.
    Zaharia, Titus
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1537 - 1558
  • [7] Chen J, 2016, P IEEE INT C MULT EX, P1
  • [8] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [9] Fast Mode and Partition Decision Using Machine Learning for Intra-Frame Coding in HEVC Screen Content Coding Extension
    Duanmu, Fanyi
    Ma, Zhan
    Wang, Yao
    [J]. IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2016, 6 (04) : 517 - 531
  • [10] fraunhofer, HEVC Test Model Version 16.20 Screen Content Model Version 8.8 HM-16.20+SCM-8.8