Inter-Frame Compression for Dynamic Point Cloud Geometry Coding

被引：5

作者：

Akhtar, Anique ^{[1
]}

Li, Zhu ^{[2
]}

van der Auwera, Geert ^{[1
]}

机构：

[1] Qualcomm Technol Inc, San Diego 92121, CA USA

[2] Univ Missouri, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2024年 / 33卷

关键词：

Point cloud; compression; PCC; deep learning; neural network;

D O I：

10.1109/TIP.2023.3343096

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Efficient point cloud compression is essential for applications like virtual and mixed reality, autonomous driving, and cultural heritage. This paper proposes a deep learning-based inter-frame encoding scheme for dynamic point cloud geometry compression. We propose a lossy geometry compression scheme that predicts the latent representation of the current frame using the previous frame by employing a novel feature space inter-prediction network. The proposed network utilizes sparse convolutions with hierarchical multiscale 3D feature learning to encode the current frame using the previous frame. The proposed method introduces a novel predictor network for motion compensation in the feature domain to map the latent representation of the previous frame to the coordinates of the current frame to predict the current frame's feature embedding. The framework transmits the residual of the predicted features and the actual features by compressing them using a learned probabilistic factorized entropy model. At the receiver, the decoder hierarchically reconstructs the current frame by progressively rescaling the feature embedding. The proposed framework is compared to the state-of-the-art Video-based Point Cloud Compression (V-PCC) and Geometry-based Point Cloud Compression (G-PCC) schemes standardized by the Moving Picture Experts Group (MPEG). The proposed method achieves more than 88% BD-Rate (Bjontegaard Delta Rate) reduction against G-PCCv20 Octree, more than 56% BD-Rate savings against G-PCCv20 Trisoup, more than 62% BD-Rate reduction against V-PCC intra-frame encoding mode, and more than 52% BD-Rate savings against V-PCC P-frame-based inter-frame encoding mode using HEVC. These significant performance gains are cross-checked and verified in the MPEG working group.

引用

页码：584 / 594

页数：11

共 41 条

[1] [Anonymous], 2021, MPEG-PCC-TMC13: Geometry Based Point Cloud Compression G-PCC
[2] [Anonymous], 2022, MPEG-PCC-TMC2: Video Based Point Cloud Compression VPCC
[3] Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
[4] Biswas S., 2020, ADV NEUR IN, V33
[5] Bjontegaard G., 2001, VCEG-M33
[6] End-to-End Optimized ROI Image Compression
Cai, Chunlei
Chen, Li
Zhang, Xiaoyun
Gao, Zhiyong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
[7] Compression of Sparse and Dense Dynamic Point Clouds-Methods and Standards
Cao, Chao
Preda, Marius
Zakharchenko, Vladyslav
Jang, Euee S.
Zaharia, Titus
[J]. PROCEEDINGS OF THE IEEE, 2021, 109 (09) : 1537 - 1558
[8] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
Choy, Christopher
Gwak, JunYoung
Savarese, Silvio
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
[9] LEARNING-BASED LOSSLESS POINT CLOUD GEOMETRY CODING USING SPARSE TENSORS
Dat Thanh Nguyen
Kaup, Andre
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2341 - 2345
[10] Motion-Compensated Compression of Dynamic Voxelized Point Clouds
de Queiroz, Ricardo L.
Chou, Philip A.
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) : 3886 - 3895

← 1 2 3 4 5 →