Kinect-Like Depth Data Compression

被引:38
作者
Fu, Jingjing [1 ]
Miao, Dan [2 ]
Yu, Weiren [3 ]
Wang, Shiqi [4 ]
Lu, Yan [1 ]
Li, Shipeng [1 ]
机构
[1] Microsoft Res Asia, Media Comp Grp, Beijing, Peoples R China
[2] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, Hefei 230026, Peoples R China
[3] Beihang Univ, Sch Comp Sci & Engn, Beijing, Peoples R China
[4] Peking Univ, Inst Digital Media, Beijing 100871, Peoples R China
关键词
2D+T prediction; denoising; depth volume; Kinect-like depth; lossy compression; padding;
D O I
10.1109/TMM.2013.2247584
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unlike traditional RGB video, Kinect-like depth is characterized by its large variation range and instability. As a result, traditional video compression algorithms cannot be directly applied to Kinect-like depth compression with respect to coding efficiency. In this paper, we propose a lossy Kinect-like depth compression framework based on the existing codecs, aiming to enhance the coding efficiency while preserving the depth features for further applications. In the proposed framework, the Kinect-like depth is reformed first by divisive normalized bilateral filter (DNBL) to suppress the depth noises caused by disparity normalization, and then block-level depth padding is implemented for invalid depth region compensation in collaboration with mask coding to eliminate the sharp variation caused by depth measurement failures. Before the traditional video coding, the inter-frame correlation of reformed depth is explored by proposed 2D+T prediction, in which depth volume is developed to simulate 3D volume to generate pseudo 3D prediction reference for depth uniqueness detection. The unique depth region, called active region is fed into the video encoder for traditional intra and inter prediction with residual coding, while the inactive region is skipped during depth coding. The experimental results demonstrate that our compression scheme can save 55%-85% in terms of bit cost and reduce coding complexity by 20%-65% in comparison with the traditional video compression algorithms. The visual quality of the 3D reconstruction is also improved after employing our compression scheme.
引用
收藏
页码:1340 / 1352
页数:13
相关论文
共 32 条
[1]  
Advanced Video Coding (AVC), 2004, 1449610 ISOIEC JVT
[2]  
Bovik A.C., 2000, HDB IMAGE VIDEO PROC
[3]   Depth map compression for real-time view-based rendering [J].
Chai, BB ;
Sethuraman, S ;
Sawhney, HS ;
Hatrack, P .
PATTERN RECOGNITION LETTERS, 2004, 25 (07) :755-766
[4]  
Curless B., 1996, Computer Graphics Proceedings. SIGGRAPH '96, P303, DOI 10.1145/237170.237269
[5]   Depth-image-based rendering (DIBR), compression and transmission for a new approach on 3D-TV [J].
Fehn, C .
STEREOSCOPIC DISPLAYS AND VIRTUAL REALITY SYSTEMS XI, 2004, 5291 :93-104
[6]   Real-time 3D shape measurement with digital stripe projection by texas instruments micromirror devices DMD™ [J].
Frankowski, G ;
Chen, M ;
Huth, T .
THREE-DIMENSIONAL IMAGE CAPTURE AND APPLICATIONS III, 2000, 3958 :90-105
[7]  
Fu JJ, 2012, IEEE INT SYMP CIRC S, P512, DOI 10.1109/ISCAS.2012.6272078
[8]  
Generic Coding of Moving Pictures and Associated Audio (MPEG-2), 1995, 11138182 ISOIEC JTC
[9]  
Gokturk S. B., 2004, P CVPR
[10]  
Grewatsch S., 2004, P 49 SPIES ANN M