Graph-Based Compression of Dynamic 3D Point Cloud Sequences

被引：198

作者：

Thanou, Dorina ^{[1
]}

Chou, Philip A. ^{[2
]}

Frossard, Pascal ^{[1
]}

机构：

[1] Ecole Polytech Fed Lausanne, Signal Proc Lab LTS4, CH-1015 Lausanne, Switzerland

[2] Microsoft Res, Redmond, WA 98052 USA

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2016年 / 25卷 / 04期

关键词：

3D sequences; voxels; graph-based features; spectral graph wavelets; motion compensation; MESH COMPRESSION; REAL-TIME; SIGNALS;

D O I：

10.1109/TIP.2016.2529506

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper addresses the problem of compression of 3D point cloud sequences that are characterized by moving 3D positions and color attributes. As temporally successive point cloud frames share some similarities, motion estimation is key to effective compression of these sequences. It, however, remains a challenging problem as the point cloud frames have varying numbers of points without explicit correspondence information. We represent the time-varying geometry of these sequences with a set of graphs, and consider 3D positions and color attributes of the point clouds as signals on the vertices of the graphs. We then cast motion estimation as a feature-matching problem between successive graphs. The motion is estimated on a sparse set of representative vertices using new spectral graph wavelet descriptors. A dense motion field is eventually interpolated by solving a graph-based regularization problem. The estimated motion is finally used for removing the temporal redundancy in the predictive coding of the 3D positions and the color characteristics of the point cloud sequences. Experimental results demonstrate that our method is able to accurately estimate the motion between consecutive frames. Moreover, motion estimation is shown to bring a significant improvement in terms of the overall compression performance of the sequence. To the best of our knowledge, this is the first paper that exploits both the spatial correlation inside each frame (through the graph) and the temporal correlation between the frames (through the motion estimation) to compress the color and the geometry of 3D point cloud sequences in an efficient way.

引用

页码：1765 / 1778

页数：14

共 55 条

[51]

Zhang C, 2014, IEEE IMAGE PROC, P2066, DOI 10.1109/ICIP.2014.7025414

[52] Analyzing the Optimality of Predictive Transform Coding Using Graph-Based Models [J].

Zhang, Cha ;

Florencio, Dinei .

IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (01) :106-109

[53]

Zhou DY, 2004, ADV NEUR IN, V16, P321

[54]

Zhu X., 2003, P 12 INT C MACH LEAR, P912, DOI DOI 10.1109/18.850663

[55]

Zhu XF, 2012, INT CONF ACOUST SPEE, P3921, DOI 10.1109/ICASSP.2012.6288775

← 1 2 3 4 5 6 →