Inter-Frame Compression for Dynamic Point Cloud Geometry Coding

被引:5
作者
Akhtar, Anique [1 ]
Li, Zhu [2 ]
van der Auwera, Geert [1 ]
机构
[1] Qualcomm Technol Inc, San Diego 92121, CA USA
[2] Univ Missouri, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA
关键词
Point cloud; compression; PCC; deep learning; neural network;
D O I
10.1109/TIP.2023.3343096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient point cloud compression is essential for applications like virtual and mixed reality, autonomous driving, and cultural heritage. This paper proposes a deep learning-based inter-frame encoding scheme for dynamic point cloud geometry compression. We propose a lossy geometry compression scheme that predicts the latent representation of the current frame using the previous frame by employing a novel feature space inter-prediction network. The proposed network utilizes sparse convolutions with hierarchical multiscale 3D feature learning to encode the current frame using the previous frame. The proposed method introduces a novel predictor network for motion compensation in the feature domain to map the latent representation of the previous frame to the coordinates of the current frame to predict the current frame's feature embedding. The framework transmits the residual of the predicted features and the actual features by compressing them using a learned probabilistic factorized entropy model. At the receiver, the decoder hierarchically reconstructs the current frame by progressively rescaling the feature embedding. The proposed framework is compared to the state-of-the-art Video-based Point Cloud Compression (V-PCC) and Geometry-based Point Cloud Compression (G-PCC) schemes standardized by the Moving Picture Experts Group (MPEG). The proposed method achieves more than 88% BD-Rate (Bjontegaard Delta Rate) reduction against G-PCCv20 Octree, more than 56% BD-Rate savings against G-PCCv20 Trisoup, more than 62% BD-Rate reduction against V-PCC intra-frame encoding mode, and more than 52% BD-Rate savings against V-PCC P-frame-based inter-frame encoding mode using HEVC. These significant performance gains are cross-checked and verified in the MPEG working group.
引用
收藏
页码:584 / 594
页数:11
相关论文
共 41 条
  • [1] [Anonymous], 2021, MPEG-PCC-TMC13: Geometry Based Point Cloud Compression G-PCC
  • [2] [Anonymous], 2022, MPEG-PCC-TMC2: Video Based Point Cloud Compression VPCC
  • [3] Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
  • [4] Biswas S., 2020, ADV NEUR IN, V33
  • [5] Bjontegaard G., 2001, VCEG-M33
  • [6] End-to-End Optimized ROI Image Compression
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
  • [7] Compression of Sparse and Dense Dynamic Point Clouds-Methods and Standards
    Cao, Chao
    Preda, Marius
    Zakharchenko, Vladyslav
    Jang, Euee S.
    Zaharia, Titus
    [J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1537 - 1558
  • [8] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
    Choy, Christopher
    Gwak, JunYoung
    Savarese, Silvio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
  • [9] LEARNING-BASED LOSSLESS POINT CLOUD GEOMETRY CODING USING SPARSE TENSORS
    Dat Thanh Nguyen
    Kaup, Andre
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2341 - 2345
  • [10] Motion-Compensated Compression of Dynamic Voxelized Point Clouds
    de Queiroz, Ricardo L.
    Chou, Philip A.
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) : 3886 - 3895