Viewport-adaptive 360-degree video coding

被引:2
作者
Hu, Qiang [1 ]
Zhou, Jun [2 ]
Zhang, Xiaoyun [2 ]
Shi, Zhiru [1 ]
Gao, Zhiyong [2 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Image Commun & Network Engn, Dept Elect Engn, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
360-degree video; Viewport prediction; Rate-distortion optimization (RDO); Lagrange multiplier; Video coding; RATE CONTROL ALGORITHM; NETWORK;
D O I
10.1007/s11042-019-08390-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
360-degree videos contain an omnidirectional view with ultra-high resolution, which will lead to the bandwidth-hungry issue in virtual reality (VR) applications. However, only a part of a 360-degree video is displayed on the head-mounted displays (HMDs). Thus, we propose a viewport-adaptive 360-degree video coding approach based on a novel viewport prediction strategy. Specifically, we firstly introduce a novel viewport prediction model based on deep 3-dimensional convolutional neural networks. In this model, a video saliency encoder and a trajectory encoder are trained to extract the features of video content and the history view path. With the outputs of the two encoders, a video prior analysis network is trained to adaptively determine the best fusion weight to generate the final feature. Moreover, benefiting from the viewport prediction model, a viewport-adaptive rate-distortion optimization (RDO) method is presented to decrease the bitrate and ensure an immersive experience. In addition, we also consider the scaling factor of the area from rectangular plane to spherical surface. Therefore, the Lagrange multiplier and quantization parameter are adaptively adjusted based on the weight of each coding tree unit. The experiments have demonstrated that the proposed RDO method gains considerably better RD performance than the traditional RDO method.
引用
收藏
页码:12205 / 12226
页数:22
相关论文
共 65 条
  • [1] [Anonymous], P ACM MULT
  • [2] [Anonymous], JVETC0050
  • [3] [Anonymous], IEEE T EMERGING TOPI
  • [4] [Anonymous], 2011, LOW BIT RATE ROI BAS
  • [5] [Anonymous], 2016, Document JVET-D1030
  • [6] [Anonymous], J REAL TIME IMAGE PR
  • [7] [Anonymous], 2018, IEEE Trans Circuits Syst Video Technol, DOI DOI 10.1109/TCSVT.2018.2860797
  • [8] [Anonymous], JVETC0021
  • [9] [Anonymous], 116M39532 MPEG ISOIE
  • [10] [Anonymous], JCTVCL1003