TransPCGC: Point Cloud Geometry Compression Based on Transformers

被引：0

作者：

Lu, Shiyu ^{[1
]}

Yang, Huamin ^{[1
]}

Han, Cheng ^{[1
]}

机构：

[1] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China

来源：

ALGORITHMS | 2023年 / 16卷 / 10期

基金：

国家重点研发计划;

关键词：

point cloud geometry compression; transformers; convolution;

D O I：

10.3390/a16100484

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Due to the often substantial size of the real-world point cloud data, efficient transmission and storage have become critical concerns. Point cloud compression plays a decisive role in addressing these challenges. Recognizing the importance of capturing global information within point cloud data for effective compression, many existing point cloud compression methods overlook this crucial aspect. To tackle this oversight, we propose an innovative end-to-end point cloud compression method designed to extract both global and local information. Our method includes a novel Transformer module to extract rich features from the point cloud. Utilization of a pooling operation that requires no learnable parameters as a token mixer for computing long-distance dependencies ensures global feature extraction while significantly reducing both computations and parameters. Furthermore, we employ convolutional layers for feature extraction. These layers not only preserve the spatial structure of the point cloud, but also offer the advantage of parameter independence from the input point cloud size, resulting in a substantial reduction in parameters. Our experimental results demonstrate the effectiveness of the proposed TransPCGC network. It achieves average Bjontegaard Delta Rate (BD-Rate) gains of 85.79% and 80.24% compared to Geometry-based Point Cloud Compression (G-PCC). Additionally, in comparison to the Learned-PCGC network, our approach attains an average BD-Rate gain of 18.26% and 13.83%. Moreover, it is accompanied by a 16% reduction in encoding and decoding time, along with a 50% reduction in model size.

引用

页数：14

共 65 条

[1] [Anonymous], 2021, ISO/IEC 23090-5:2021
[2] [Anonymous], 2023, ISO/IEC 23090-9:2023
[3] 3D Point Cloud Compression with Recurrent Neural Network and Image Compression Methods
Beemelmanns, Till
Tao, Yuchen
Lampe, Bastian
Reiher, Lennart
van Kempen, Raphael
Woopen, Timo
Eckstein, Lutz
[J]. 2022 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2022, : 345 - 351
[4] Biswas S., 2020, ADV NEUR IN, V33
[5] Brown TB, 2020, ADV NEUR IN, V33
[6] D'Eon E., 2017, ISO/IEC JTC1/SC29 Jt, V7, P11
[7] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
Dai, Angela
Qi, Charles Ruizhongtai
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
[8] LEARNING-BASED LOSSLESS COMPRESSION OF 3D POINT CLOUD GEOMETRY
Dat Thanh Nguyen
Quach, Maurice
Valenzise, Giuseppe
Duhamel, Pierre
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4220 - 4224
[9] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929, 10.48550/arXiv.2010.11929]

← 1 2 3 4 5 6 7 →