Random access prediction structures for light field video coding with MV-HEVC

被引:10
作者
Avramelos, Vasileios [1 ]
De Praeter, Johan [1 ]
Van Wallendael, Glenn [1 ]
Lambert, Peter [1 ]
机构
[1] Univ Ghent, Dept Elect & Informat Syst, IMEC, IDLab Technol Pk Zwijnaarde 122, B-9052 Ghent, Belgium
关键词
Light field video coding; Multi-view video coding; MV-HEVC; Prediction structures; Random access; Virtual reality; Free navigation; MULTIVIEW VIDEO; IMAGE;
D O I
10.1007/s11042-019-08605-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Computational imaging and light field technology promise to deliver the required six-degrees-of-freedom for natural scenes in virtual reality. Already existing extensions of standardized video coding formats, such as multi-view coding and multi-view plus depth, are the most conventional light field video coding solutions at the moment. The latest multi-view coding format, which is a direct extension of the high efficiency video coding (HEVC) standard, is called multi-view HEVC (or MV-HEVC). MV-HEVC treats each light field view as a separate video sequence, and uses syntax elements similar to standard HEVC for exploiting redundancies between neighboring views. To achieve this, inter-view and temporal prediction schemes are deployed with the aim to find the most optimal trade-off between coding performance and reconstruction quality. The number of possible prediction structures is unlimited and many of them are proposed in the literature. Although some of them are efficient in terms of compression ratio, they complicate random access due to the dependencies on previously decoded pixels or frames. Random access is an important feature in video delivery, and a crucial requirement in multi-view video coding. In this work, we propose and compare different prediction structures for coding light field video using MV-HEVC with a focus on both compression efficiency and random accessibility. Experiments on three different short-baseline light field video sequences show the trade-off between bit-rate and distortion, as well as the average number of decoded views/frames, necessary for displaying any random frame at any time instance. The findings of this work indicate the most appropriate prediction structure depending on the available bandwidth and the required degree of random access.
引用
收藏
页码:12847 / 12867
页数:21
相关论文
共 29 条
[1]   Compression scheme for sparsely sampled light field data based on pseudo multi-view sequences [J].
Ahmad, Waqas ;
Sjostrom, Marten ;
Olsson, Roger .
OPTICS, PHOTONICS, AND DIGITAL TECHNOLOGIES FOR IMAGING APPLICATIONS V, 2018, 10679
[2]  
[Anonymous], IEEE T CIRCUITS SYST
[3]  
[Anonymous], 2013, IEEE J SELECTED TOPI
[4]  
[Anonymous], MPEG 122 M SAN DIEG
[5]  
[Anonymous], 2015, JTC1SC29WG11 ISOIEC
[6]  
[Anonymous], 3DTV C TRUE VIS CAPT
[7]  
[Anonymous], MULTIVIEW HIGH EFFIC
[8]  
[Anonymous], NEW VISUAL CODING EX
[9]  
[Anonymous], J REAL TIME IMAGE PR
[10]  
[Anonymous], 2017, ACM Trans. Graph., DOI DOI 10.1145/3072959.3073614