Explorations on 3D point clouds coding using transformers and patches

被引：1

作者：

Marques, Miguel ^{[1
]}

Cruz, Luis A. da Silva ^{[2
]}

机构：

[1] Univ Coimbra, Dept Elect & Comp Engn, Coimbra, Portugal

[2] Inst Telecomunicacoes, Coimbra, Portugal

来源：

2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年

关键词：

Point cloud; compression; deep learning;

D O I：

10.1109/EUVIP53989.2022.9922700

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

As 3D point clouds become more common as a representation of 3D visual content, the need to efficiently compress these data grows ever stronger. Research has shown that deep learning based approaches to point cloud coding see an increase in performance when compared with competing encoders like those developed by MPEG. This article examines and evaluates the use of the Transformer architectures and patch-based inputs combined with well developed deep learning static point cloud compression solutions described in the literature. To that end, we propose four new deep learning encoders. The obtained results show an improvement over an octree based encoder proposed by MPEG and the baseline PCC Geo v2 codec in terms of a geometry fidelity metric. The article also presents an ablation study conducted to analyze the impact of several encoder related parameters and structures that can guide future research in deep learning point cloud compression.

引用

页数：6

共 17 条

[1]

[Anonymous], MPEG POINT CLOUD COM

[2]

[Anonymous], 2015, TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems

[3]

[Anonymous], MPEG POINT CLOUD DAT

[4]

Bjontegaard G., 2001, ITU T SG16 Q 6

[5]

dEon E., 2017, ISO/ IEC JTC1/ SC29 JointWG11/ WG1 (MPEG/ JPEG) input document WG11M40059 / WG1M74006, 7, 8

[6]

Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]

[7] Adaptive Deep Learning-Based Point Cloud Geometry Coding [J].

Guarda, Andre F. R. ;

Rodrigues, Nuno M. M. ;

Pereira, Fernando .

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :415-430

[8] PCT: Point cloud transformer [J].

Guo, Meng-Hao ;

Cai, Jun-Xiong ;

Liu, Zheng-Ning ;

Mu, Tai-Jiang ;

Martin, Ralph R. ;

Hu, Shi-Min .

COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199

[9] Focal Loss for Dense Object Detection [J].

Lin, Tsung-Yi ;

Goyal, Priya ;

Girshick, Ross ;

He, Kaiming ;

Dollar, Piotr .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2999-3007

[10] Improved Deep Point Cloud Geometry Compression [J].

Quach, Maurice ;

Valenzise, Giuseppe ;

Dufaux, Frederic .

2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,

← 1 2 →