3DAC: Learning Attribute Compression for Point Clouds

被引：11

作者：

Fang, Guangchi ^{[1
]}

Hu, Qingyong ^{[2
]}

Wang, Hanyun ^{[3
]}

Xu, Yiling ^{[4
]}

Guo, Yulan ^{[1
,5
]}

机构：

[1] Sun Yat Sen Univ, Shenzhen Campus, Guangzhou, Peoples R China

[2] Univ Oxford, Oxford, England

[3] Informat Engn Univ, Zhengzhou, Peoples R China

[4] Shanghai Jiao Tong Univ, Shanghai, Peoples R China

[5] Natl Univ Def Technol, Changsha, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52688.2022.01440

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study the problem of attribute compression for large-scale unstructured 3D point clouds. Through an in-depth exploration of the relationships between different encoding steps and different attribute channels, we introduce a deep compression network, termed 3DAC, to explicitly compress the attributes of 3D point clouds and reduce storage usage in this paper. Specifically, the point cloud attributes such as color and reflectance are firstly converted to transform coefficients. We then propose a deep entropy model to model the probabilities of these coefficients by considering information hidden in attribute transforms and previous encoded attributes. Finally, the estimated probabilities are used to further compress these transform coefficients to a final attributes bitstream. Extensive experiments conducted on both indoor and outdoor large-scale open point cloud datasets, including ScanNet and SemanticKITTI, demonstrated the superior compression rates and reconstruction quality of the proposed method.

引用

页码：14799 / 14808

页数：10

共 56 条

[1] [Anonymous], 2004, H. 264 and MPEG-4 video compression: video coding for nextgeneration multimedia
[2] Bai Yuanchao, 2021, CVPR, P11946
[3] Balle J., 2018, P INT C LEARN REPR, P1
[4] Balle Johannes, 2017, 5 INT C LEARN REPR
[5] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Behley, Jens
Garbade, Martin
Milioto, Andres
Quenzel, Jan
Behnke, Sven
Stachniss, Cyrill
Gall, Juergen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
[6] Bellard F., 2015, Bpg Image Format
[7] Biswas Sourav, 2020, NEURIPS, V33
[8] Brock A., 2016, Generative and discriminative voxel modeling with convolutional neural networks
[9] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
Choy, Christopher
Gwak, JunYoung
Savarese, Silvio
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
[10] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
Dai, Angela
Chang, Angel X.
Savva, Manolis
Halber, Maciej
Funkhouser, Thomas
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443

← 1 2 3 4 5 6 →