3DAC: Learning Attribute Compression for Point Clouds

被引:11
作者
Fang, Guangchi [1 ]
Hu, Qingyong [2 ]
Wang, Hanyun [3 ]
Xu, Yiling [4 ]
Guo, Yulan [1 ,5 ]
机构
[1] Sun Yat Sen Univ, Shenzhen Campus, Guangzhou, Peoples R China
[2] Univ Oxford, Oxford, England
[3] Informat Engn Univ, Zhengzhou, Peoples R China
[4] Shanghai Jiao Tong Univ, Shanghai, Peoples R China
[5] Natl Univ Def Technol, Changsha, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52688.2022.01440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study the problem of attribute compression for large-scale unstructured 3D point clouds. Through an in-depth exploration of the relationships between different encoding steps and different attribute channels, we introduce a deep compression network, termed 3DAC, to explicitly compress the attributes of 3D point clouds and reduce storage usage in this paper. Specifically, the point cloud attributes such as color and reflectance are firstly converted to transform coefficients. We then propose a deep entropy model to model the probabilities of these coefficients by considering information hidden in attribute transforms and previous encoded attributes. Finally, the estimated probabilities are used to further compress these transform coefficients to a final attributes bitstream. Extensive experiments conducted on both indoor and outdoor large-scale open point cloud datasets, including ScanNet and SemanticKITTI, demonstrated the superior compression rates and reconstruction quality of the proposed method.
引用
收藏
页码:14799 / 14808
页数:10
相关论文
共 56 条
  • [1] [Anonymous], 2004, H. 264 and MPEG-4 video compression: video coding for nextgeneration multimedia
  • [2] Bai Yuanchao, 2021, CVPR, P11946
  • [3] Balle J., 2018, P INT C LEARN REPR, P1
  • [4] Balle Johannes, 2017, 5 INT C LEARN REPR
  • [5] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [6] Bellard F., 2015, Bpg Image Format
  • [7] Biswas Sourav, 2020, NEURIPS, V33
  • [8] Brock A., 2016, Generative and discriminative voxel modeling with convolutional neural networks
  • [9] 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks
    Choy, Christopher
    Gwak, JunYoung
    Savarese, Silvio
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3070 - 3079
  • [10] ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes
    Dai, Angela
    Chang, Angel X.
    Savva, Manolis
    Halber, Maciej
    Funkhouser, Thomas
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2432 - 2443