360SRL: A SEQUENTIAL REINFORCEMENT LEARNING APPROACH FOR ABR TILE-BASED 360 VIDEO STREAMING

被引：24

作者：

Fu, Jun ^{[1
]}

Chen, Xiaoming ^{[1
]}

Zhang, Zhizheng ^{[1
]}

Wu, Shilin ^{[1
]}

Chen, Zhibo ^{[1
]}

机构：

[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年

关键词：

adaptive bitrate decision; tile-based streaming; sequential reinforcement learning; DASH;

D O I：

10.1109/ICME.2019.00058

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Tile-based 360-degree video (360 video) streaming, employed with adaptive bitrate (ABR) algorithms, is a promising approach to offer high video quality of experience (QoE) within limited network bandwidth. Existing ABR algorithms, however, fail to achieve optimal performance in real-world fluctuated network conditions as they heavily rely on unbiased bandwidth predictions. Recently, reinforcement learning (RL) has shown promising potential in generating better ABR algorithms in 2D video streaming. However, unlike existed work in 2D video streaming, directly applying RL in the tile-based 360 video streaming is infeasible due to the resulting exponential decision space. To overcome these limitations, we propose in this paper 360SRL, an improved ABR algorithm employing Sequential RL (360SRL). Firstly, we reduce the decision space of 360SRL from exponential to linear by introducing a sequential ABR decision structure, thus making it feasible to be employed with RL. Secondly, instead of relying on accurate bandwidth predictions, 360SRL learns to make ABR decisions solely through observations of the resulting QoE performance of past decisions. Finally, we compare 360SRL to state-of-the-art ABR algorithms using trace-driven experiments. The experiment results demonstrate that 360SRL outperforms state-of-the-art algorithms with around 12% improvement in average QoE.

引用

页码：290 / 295

页数：6

共 23 条

[1]

[Anonymous], P 8 ACM MULT SYST C

[2]

[Anonymous], ADAPTIVE SYSTEMS GRO

[3]

[Anonymous], 2017, P ACM NOSSDAV 2017

[4]

[Anonymous], 2017, ARXIV170505035

[5]

[Anonymous], 2017, PYTORCH

[6]

Ban Y., 2018, 2018 IEEE INT C MULT

[7]

Ban Yixuan, 2017, VISUAL COMMUN-US, P1

[8] D-DASH: A Deep Q-Learning Framework for DASH Video Streaming [J].

Gadaleta, Matteo ;

Chiariotti, Federico ;

Rossi, Michele ;

Zanella, Andrea .

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2017, 3 (04) :703-718

[9]

Hosseini M, 2016, IEEE INT SYM MULTIM, P107, DOI [10.1109/ISM.2016.0028, 10.1109/ISM.2016.45]

[10] A Buffer-Based Approach to Rate Adaptation: Evidence from a Large Video Streaming Service [J].

Huang, Te-Yuan ;

Johari, Ramesh ;

McKeown, Nick ;

Trunnell, Matthew ;

Watson, Mark .

ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (04) :187-198

← 1 2 3 →