Explorations on 3D point clouds coding using transformers and patches

被引:1
作者
Marques, Miguel [1 ]
Cruz, Luis A. da Silva [2 ]
机构
[1] Univ Coimbra, Dept Elect & Comp Engn, Coimbra, Portugal
[2] Inst Telecomunicacoes, Coimbra, Portugal
来源
2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年
关键词
Point cloud; compression; deep learning;
D O I
10.1109/EUVIP53989.2022.9922700
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As 3D point clouds become more common as a representation of 3D visual content, the need to efficiently compress these data grows ever stronger. Research has shown that deep learning based approaches to point cloud coding see an increase in performance when compared with competing encoders like those developed by MPEG. This article examines and evaluates the use of the Transformer architectures and patch-based inputs combined with well developed deep learning static point cloud compression solutions described in the literature. To that end, we propose four new deep learning encoders. The obtained results show an improvement over an octree based encoder proposed by MPEG and the baseline PCC Geo v2 codec in terms of a geometry fidelity metric. The article also presents an ablation study conducted to analyze the impact of several encoder related parameters and structures that can guide future research in deep learning point cloud compression.
引用
收藏
页数:6
相关论文
共 17 条
[1]  
[Anonymous], MPEG POINT CLOUD COM
[2]  
[Anonymous], 2015, TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
[3]  
[Anonymous], MPEG POINT CLOUD DAT
[4]  
Bjontegaard G., 2001, ITU T SG16 Q 6
[5]  
dEon E., 2017, ISO/ IEC JTC1/ SC29 JointWG11/ WG1 (MPEG/ JPEG) input document WG11M40059 / WG1M74006, 7, 8
[6]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[7]   Adaptive Deep Learning-Based Point Cloud Geometry Coding [J].
Guarda, Andre F. R. ;
Rodrigues, Nuno M. M. ;
Pereira, Fernando .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :415-430
[8]   PCT: Point cloud transformer [J].
Guo, Meng-Hao ;
Cai, Jun-Xiong ;
Liu, Zheng-Ning ;
Mu, Tai-Jiang ;
Martin, Ralph R. ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199
[9]   Focal Loss for Dense Object Detection [J].
Lin, Tsung-Yi ;
Goyal, Priya ;
Girshick, Ross ;
He, Kaiming ;
Dollar, Piotr .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007
[10]   Improved Deep Point Cloud Geometry Compression [J].
Quach, Maurice ;
Valenzise, Giuseppe ;
Dufaux, Frederic .
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,