On the Optimal Encoding Ladder of Tiled 360° Videos for Head-Mounted Virtual Reality

被引:17
作者
Fan, Ching-Ling [1 ]
Yen, Shou-Cheng [1 ]
Huang, Chun-Ying [2 ]
Hsu, Cheng-Hsin [1 ]
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu 30010, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu 30010, Taiwan
关键词
Videos; Encoding; Optimization; Servers; Bandwidth; Streaming media; Bit rate; 360° videos; encoding ladders; encoder configurations; video streaming; adaptive streaming; virtual reality; augmented reality; mixed reality; extended reality; optimization;
D O I
10.1109/TCSVT.2020.3007288
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Dynamic Adaptive Streaming over HTTP (DASH) has been widely used by several popular streaming services, such as YouTube, Netflix, and Facebook. Adopting DASH requires to pre-determine a set of encoding configurations, called encoding ladder, to generate a set of representations stored on the streaming server. These representations are adaptively requested by clients according to their network conditions during streaming sessions. In this article, we aim to solve the optimal laddering problem that determines the optimal encoding ladder to maximize the client viewing quality. In particular, we consider video models, viewing probability, and client distribution to formulate the mathematical problem. We use a divide-and-conquer approach to decompose the problem into two subproblems: (i) per-class optimization for clients with different bandwidths and (ii) global optimization to maximize the overall viewing quality under the storage limit of the streaming server. We propose two algorithms for each of the per-class optimization and global optimization problems. Analytical analysis and real experiments are conducted to evaluate the performance of our proposed algorithms, compared to other state-of-the-art algorithms. Based on the results, we recommend a combination of the proposed algorithms to solve the optimal laddering problem. The evaluation results show the merits of our recommended algorithms, which: (i) outperform the state-of-the-art algorithms by up to 52.17 and 26.35 in Viewport Video Multi-Method Assessment Fusion (V-VMAF) in per-class optimization, (ii) outperform the state-of-the-art algorithms by up to 43.14 in V-VMAF for optimal laddering in global optimization, (iii) achieve good scalability under different storage limits and number of bandwidth classes, and (iv) run faster than the state-of-the-art algorithms.
引用
收藏
页码:1632 / 1647
页数:16
相关论文
共 66 条
[1]   Interactive Omnidirectional Video Delivery: A Bandwidth-Effective Approach [J].
Alface, Patrice Rondao ;
Macq, Jean-Francois ;
Verzijp, Nico .
BELL LABS TECHNICAL JOURNAL, 2012, 16 (04) :135-147
[2]  
[Anonymous], 2018, DOCUMENT DASH IF INT
[3]  
[Anonymous], 2012, 2300912019 ISOIEC
[4]  
[Anonymous], 2014, CONSTRAINED OPTIMIZA
[5]  
[Anonymous], 2017, CISCO VISUAL NETWORK
[6]   A Survey on Bitrate Adaptation Schemes for Streaming Media Over HTTP [J].
Bentaleb, Abdelhak ;
Taani, Bayan ;
Begen, Ali C. ;
Timmerer, Christian ;
Zimmermann, Roger .
IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2019, 21 (01) :562-585
[7]  
Boyd Stephen, 2004, CONVEX OPTIMIZATION
[8]  
Chakareski J., 2018, 2018 IEEE INT C COMM, P1, DOI DOI 10.1109/ICC.2018.8422859
[9]   SSIM-optimal linear image restoration [J].
Channappayya, Sumohana S. ;
Bovik, Alan C. ;
Caramanis, Constantine ;
Heath, Robert W., Jr. .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :765-768
[10]  
Cisco Systems, 2020, 2020 GLOB NETW TREND