Efficient Projected Frame Padding for Video-Based Point Cloud Compression

被引:39
作者
Li, Li [1 ]
Li, Zhu [2 ]
Liu, Shan [3 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
[2] Univ Missouri Kansas City, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA
[3] Tencent Amer, 661 Bryant St, Palo Alto, CA 94301 USA
关键词
Three-dimensional displays; Geometry; Software; Two dimensional displays; Color; Software algorithms; Transform coding; Frame padding; high efficiency video coding; occupancy map; point cloud compression; video-based point cloud compression;
D O I
10.1109/TMM.2020.3016894
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The state-of-the-art 2D-based dynamic point cloud (DPC) compression algorithm is the video-based point cloud compression (V-PCC) developed by the Moving Pictures Experts Group (MPEG). It first projects the DPC patch by patch from 3D to 2D and organizes the projected patches into a video. The video is then efficiently compressed by High Efficiency Video Coding. However, there are many unoccupied pixels that may have a significant influence on the coding efficiency. These unoccupied pixels are currently padded using either the average of 4-neighbors for the geometry or the push-pull algorithm for the color attribute. While these algorithms are simple, the unoccupied pixels are not handled in the most efficient way. In this paper, we divide the unoccupied pixels into two groups: those that should be occupied and those that should not be occupied according to the occupancy map. We then design padding algorithms tailored to each group to improve the rate-distortion performance of the V-PCC reference software, for both the geometry and the color attribute. The first group is the unoccupied pixels that should be occupied according to the block-based occupancy map. We attempt to pad those pixels using the real points in the original DPC to improve the quality of the reconstructed DPC. Additionally, we attempt to maintain the smoothness of each block so as not to negatively influence the video compression efficiency. The second group is the unoccupied pixels that were correctly identified as unoccupied according to the block-based occupancy map. These pixels are useless for improving the reconstructed quality of the DPC. Therefore, we attempt to minimize the bit cost of these pixels without considering their reconstruction qualities. The bit cost is determined by the residue of these pixels obtained by subtracting the prediction pixels from the original pixels. Therefore, we propose padding the residue using the average residue of the occupied pixels in order to minimize the bit cost. The proposed algorithms are implemented in the V-PCC and the corresponding HEVC reference software. The experimental results show the proposed algorithms can bring significant bitrate savings compared with the V-PCC.
引用
收藏
页码:2806 / 2819
页数:14
相关论文
共 33 条
[1]  
[Anonymous], 2016, document ISO/IEC JTC1/SC29/WG1 N16331
[2]  
[Anonymous], 2019, HM1618SCM87
[3]  
[Anonymous], 2019, Point Cloud Compression Category 2 Reference SoftwareTMC2
[4]   MULTIDIMENSIONAL BINARY SEARCH TREES USED FOR ASSOCIATIVE SEARCHING [J].
BENTLEY, JL .
COMMUNICATIONS OF THE ACM, 1975, 18 (09) :509-517
[5]  
Bjontegaard Gisle, 2001, Calculation of average PSNR differ
[6]  
Bruder G, 2014, 2014 IEEE SYMPOSIUM ON 3D USER INTERFACES (3DUI), P161, DOI 10.1109/3DUI.2014.6798870
[7]  
Budagavi M., 2017, Document ISO/IEC JTC1/SC29/WG11 m41808
[8]  
Champel M.-L., 2017, P SPIE OPT ENG APPL, V10396
[9]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[10]   Motion-Compensated Compression of Dynamic Voxelized Point Clouds [J].
de Queiroz, Ricardo L. ;
Chou, Philip A. .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (08) :3886-3895