Throughput Prediction-Enhanced RL for Low-Delay Video Application

被引:2
作者
Liu, Yong [1 ]
Zhang, Chaokun [1 ]
Du, Jingshun [1 ]
Qiu, Tie [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin, Peoples R China
来源
2022 18TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING, MSN | 2022年
基金
国家重点研发计划; 中国博士后科学基金;
关键词
low-delay video application; adaptive bitrate; reinforcement learning; smoothness;
D O I
10.1109/MSN57253.2022.00119
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Maximizing user quality of experience (QoE) is the ultimate goal of video players, and adaptive bitrate (ABR) is recognized as one of the most effective solutions. Approaches employing reinforcement learning (RL) have performed well as hybrid ABR algorithms, due to the ability to learn autonomously. However, throughput, which plays a crucial role in low-delay video streaming, is difficult to predict simply in mobile and wireless networks, and the inaccurately predicted throughput can lead to the wrong selection of bitrates. Worse, the general RL approaches are prone to frequent bitrate switching due to bandwidth fluctuation. These obstacles make the RL-based ABR approach unable to truly reflect the user QoE. We propose TP-RL, an application that makes ongoing decisions to maximize user QoE. To realize this, TP-RL adopts three ideas: (i) It takes the RL neural network as the main body of decision-making, which will inherit the advantages of RL and improve on this basis; (ii) Explore Mogrifier LSTM for throughput prediction, and replace the throughput part in the state space of the original RL neural network with a prediction module; (iii) The decided bitrate is further processed to achieve better smoothness when the bandwidth fluctuates. The performance of TP-RL is evaluated in different experimental environments, and experiments show that it can improve
引用
收藏
页码:728 / 735
页数:8
相关论文
共 16 条
[1]   Data-Driven Bandwidth Prediction Models and Automated Model Selection for Low Latency [J].
Bentaleb, Abdelhak ;
Begen, Ali C. ;
Harous, Saad ;
Zimmermann, Roger .
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 :2588-2601
[2]   The Relationship Among Video Quality, Screen Resolution, and Bit Rate [J].
Cermak, G. ;
Pinson, M. ;
Wolf, S. .
IEEE TRANSACTIONS ON BROADCASTING, 2011, 57 (02) :258-262
[3]  
F. C. C. (FCC) Fcc, 2021, US
[4]   A Buffer-Based Approach to Rate Adaptation: Evidence from a Large Video Streaming Service [J].
Huang, Te-Yuan ;
Johari, Ramesh ;
McKeown, Nick ;
Trunnell, Matthew ;
Watson, Mark .
ACM SIGCOMM COMPUTER COMMUNICATION REVIEW, 2014, 44 (04) :187-198
[5]   Probe and Adapt: Rate Adaptation for HTTP Video Streaming At Scale [J].
Li, Zhi ;
Zhu, Xiaoqing ;
Gahm, Joshua ;
Pan, Rong ;
Hu, Hao ;
Begen, Ali C. ;
Oran, David .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2014, 32 (04) :719-733
[6]   Neural Adaptive Video Streaming with Pensieve [J].
Mao, Hongzi ;
Netravali, Ravi ;
Alizadeh, Mohammad .
SIGCOMM '17: PROCEEDINGS OF THE 2017 CONFERENCE OF THE ACM SPECIAL INTEREST GROUP ON DATA COMMUNICATION, 2017, :197-210
[7]  
Melis G., 2020, 8 INT C LEARNING REP
[8]  
sandvine, 2021, US
[9]   HotDASH: Hotspot Aware Adaptive Video Streaming using Deep Reinforcement Learning [J].
Sengupta, Satadal ;
Ganguly, Niloy ;
Chakraborty, Sandip ;
De, Pradipta .
2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, :165-175
[10]   DTP: Deadline-aware Transport Protocol [J].
Shi, Hang ;
Cui, Yong ;
Qian, Feng ;
Hu, Yuming .
PROCEEDINGS OF THE 2019 ASIA-PACIFIC WORKSHOP ON NETWORKING (APNET '19), 2019, :1-7