Real-Time LiDAR Point Cloud Compression Using Bi-Directional Prediction and Range-Adaptive Floating-Point Coding

被引：8

作者：

Zhao, Lili ^{[1
]}

Ma, Kai-Kuang ^{[2
]}

Lin, Xuhu ^{[1
]}

Wang, Wenyi ^{[1
]}

Chen, Jianwen ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

[2] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

来源：

IEEE TRANSACTIONS ON BROADCASTING | 2022年 / 68卷 / 03期

关键词：

Point cloud compression; Laser radar; Three-dimensional displays; Encoding; Image coding; Transform coding; Sensors; LiDAR; point cloud compression; frame prediction; floating-point coding; ARCHITECTURE; BROADCAST; BAND;

D O I：

10.1109/TBC.2022.3162406

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Due to the large amount of data involved in the three-dimensional (3D) LiDAR point clouds, point cloud compression (PCC) becomes indispensable to many real-time applications. In autonomous driving of connected vehicles for example, point clouds are constantly acquired along the time and subjected to be compressed. Among the existing PCC methods, very few of them have effectively removed the temporal redundancy inherited in the point clouds. To address this issue, a novel lossy LiDAR PCC system is proposed in this paper, which consists of the inter-frame coding and the intra-frame coding. For the former, a deep-learning approach is proposed to conduct bi-directional frame prediction using an asymmetric residual module and 3D space-time convolutions; the proposed network is called the bi-directional prediction network (BPNet). For the latter, a novel range-adaptive floating-point coding (RAFC) algorithm is proposed for encoding the reference frames and the B-frame prediction residuals in the 32-bit floating-point precision. Since the pixel-value distribution of these two types of data are quite different, various encoding modes are designed for providing adaptive selection. Extensive simulation experiments have been conducted using multiple point cloud datasets, and the results clearly show that our proposed PCC system consistently outperforms the state-of-the-art MPEG G-PCC in terms of data fidelity and localization, while delivering real-time performance.

引用

页码：620 / 635

页数：16

共 56 条

[1] Point Cloud Geometry Prediction Across Spatial Scale using Deep Learning [J].

Akhtar, Anique ;

Gao, Wen ;

Zhang, Xiang ;

Li, Li ;

Li, Zhu ;

Liu, Shan .

2020 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2020, :70-73

[2]

[Anonymous], 2020, JTC1SC29WE11 ISOIEC

[3]

[Anonymous], 2011, Point cloud library

[4]

[Anonymous], 2021, Standard ISO/IEC JTC1/SC29/WG7

[5]

[Anonymous], 2019, FPZIP VERSION 1 3 0

[6]

[Anonymous], 2020, Standard ISO/IEC JTC1/SC29/WG11 N19084

[7] Depth-Aware Video Frame Interpolation [J].

Bao, Wenbo ;

Lai, Wei-Sheng ;

Ma, Chao ;

Zhang, Xiaoyun ;

Gao, Zhiyong ;

Yang, Ming-Hsuan .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3698-3707

[8]

BZip2, BZIP2 COMPR PROGR

[9]

Chandraker Manmohan, 2020, ARXIV201208512

[10]

Chen XYL, 2019, IEEE INT C INT ROBOT, P4530, DOI 10.1109/IROS40897.2019.8967704

← 1 2 3 4 5 6 →