TransPCGC: Point Cloud Geometry Compression Based on Transformers

被引:0
作者
Lu, Shiyu [1 ]
Yang, Huamin [1 ]
Han, Cheng [1 ]
机构
[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
基金
国家重点研发计划;
关键词
point cloud geometry compression; transformers; convolution;
D O I
10.3390/a16100484
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the often substantial size of the real-world point cloud data, efficient transmission and storage have become critical concerns. Point cloud compression plays a decisive role in addressing these challenges. Recognizing the importance of capturing global information within point cloud data for effective compression, many existing point cloud compression methods overlook this crucial aspect. To tackle this oversight, we propose an innovative end-to-end point cloud compression method designed to extract both global and local information. Our method includes a novel Transformer module to extract rich features from the point cloud. Utilization of a pooling operation that requires no learnable parameters as a token mixer for computing long-distance dependencies ensures global feature extraction while significantly reducing both computations and parameters. Furthermore, we employ convolutional layers for feature extraction. These layers not only preserve the spatial structure of the point cloud, but also offer the advantage of parameter independence from the input point cloud size, resulting in a substantial reduction in parameters. Our experimental results demonstrate the effectiveness of the proposed TransPCGC network. It achieves average Bjontegaard Delta Rate (BD-Rate) gains of 85.79% and 80.24% compared to Geometry-based Point Cloud Compression (G-PCC). Additionally, in comparison to the Learned-PCGC network, our approach attains an average BD-Rate gain of 18.26% and 13.83%. Moreover, it is accompanied by a 16% reduction in encoding and decoding time, along with a 50% reduction in model size.
引用
收藏
页数:14
相关论文
共 65 条
  • [11] Dynamic Point Cloud Compression Based on Projections, Surface Reconstruction and Video Compression
    Dumic, Emil
    Bjelopera, Anamaria
    Nuechter, Andreas
    [J]. SENSORS, 2022, 22 (01)
  • [12] Priority-based encoding of triangle mesh connectivity for a known geometry
    Dvorak, Jan
    Kacerekova, Zuzana
    Vanccek, Petr
    Vasa, Libor
    [J]. COMPUTER GRAPHICS FORUM, 2023, 42 (01) : 60 - 71
  • [13] Point-cloud based 3D object detection and classification methods for self-driving applications: A survey and taxonomy
    Fernandes, Duarte
    Silva, Antonio
    Nevoa, Rafael
    Simoes, Claudia
    Gonzalez, Dibet
    Guevara, Miguel
    Novais, Paulo
    Monteiro, Joao
    Melo-Pinto, Pedro
    [J]. INFORMATION FUSION, 2021, 68 : 161 - 191
  • [14] LFT-Net: Local Feature Transformer Network for Point Clouds Analysis
    Gao, Yongbin
    Liu, Xuebing
    Li, Jun
    Fang, Zhijun
    Jiang, Xiaoyan
    Huq, Kazi Mohammed Saidul
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 2158 - 2168
  • [15] github, 2017, Google Draco 3D Data Compression
  • [16] Adaptive Deep Learning-Based Point Cloud Geometry Coding
    Guarda, Andre F. R.
    Rodrigues, Nuno M. M.
    Pereira, Fernando
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) : 415 - 430
  • [17] OctSqueeze: Octree-Structured Entropy Model for LiDAR Compression
    Huang, Lila
    Wang, Shenlong
    Wong, Kelvin
    Liu, Jerry
    Urtasun, Raquel
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 1310 - 1320
  • [18] 3D Point Cloud Geometry Compression on Deep Learning
    Huang, Tianxin
    Liu, Yong
    [J]. PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 890 - 898
  • [19] Stratified Transformer for 3D Point Cloud Segmentation
    Lai, Xin
    Liu, Jianhui
    Jiang, Li
    Wang, Liwei
    Zhao, Hengshuang
    Liu, Shu
    Qi, Xiaojuan
    Jia, Jiaya
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8490 - 8499
  • [20] Lee-Thorp J, 2022, NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, P4296