QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning

被引：82

作者：

Huang, Tianchi ^{[1
,3
]}

Zhang, Rui-Xiao ^{[1
]}

Zhou, Chao ^{[2
]}

Sun, Lifeng ^{[1
]}

机构：

[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China

[2] Beijing Kuaishou Technol Co Ltd, Beijing, Peoples R China

[3] Guizhou Univ, Dept Comp Sci & Technol, Guiyang, Guizhou, Peoples R China

来源：

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18) | 2018年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1145/3240508.3240545

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Real-time video streaming is now one of the main applications in all network environments. Due to the fluctuation of throughput under various network conditions, how to choose a proper bitrate adaptively has become an upcoming and interesting issue. To tackle this problem, most proposed rate control methods work for providing high video bitrates instead of video qualities. Nevertheless, we notice that there exists a trade-off between sending bitrate and video quality, which motivates us to focus on how to reach a balance between them. In this paper, we propose QARC (video Quality Aware Rate Control), a rate control algorithm that aims to obtain a higher perceptual video quality with possible lower sending rate and transmission latency. Starting from scratch, QARC uses deep reinforcement learning(DRL) algorithm to train a neural network to select future bitrates based on previously observed network status and past video frames. To overcome the "state explosion problem", we design a neural network to predict future perceptual video quality as a vector for taking the place of the raw picture in the DRL's inputs. We evaluate QARC via trace-driven simulation, outperforming existing approach with improvements in average video quality of 18% - 25% and decreasing in average latency with 23% -45%. Meanwhile, comparing QARC with offline optimal high bitrate method on various network conditions, we find that QARC also yields a solid result.

引用

页码：1208 / 1216

页数：9

共 35 条

[1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2] Alliance for Open Media, 2018, AV ON ENC
[3] [Anonymous], 2014, ARXIV NEURAL EVOLUTI
[4] [Anonymous], P 7 INT C MULT SYST
[5] [Anonymous], 2012, LNCS, DOI DOI 10.1007/978-3-642-35746-6_1
[6] [Anonymous], 2013, P USENIX NSDI
[7] [Anonymous], 1998, REINFORCEMENT LEARNI
[8] [Anonymous], TECHNICAL REPORT
[9] [Anonymous], 2016, P 7 INT C MULT SYST
[10] [Anonymous], 1996, NEURAL NETWORK DESIG

← 1 2 3 4 →