Towards Network Dynamics: Adaptive Buffer Management with Deep Reinforcement Learning

被引:0
作者
Zhu, Jing [1 ]
Wang, Dan [1 ]
Qin, Shuxin [1 ]
Tao, Gaofeng [1 ,2 ]
Gui, Hongxin [2 ]
Li, Fang [3 ]
Ou, Liang [4 ]
机构
[1] Purple Mt Labs, Nanjing, Peoples R China
[2] Shandong Future Network Res Inst, Jinan, Peoples R China
[3] China Acad Informat & Commun Technol, Beijing, Peoples R China
[4] China Telecom Res Inst, Guangzhou, Peoples R China
来源
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022) | 2022年
关键词
Delay jitter-sensitive applications; Quality of; service; Network dynamics; Adaptive buffer management; Deep reinforcement learning; VIDEO;
D O I
10.1109/GLOBECOM48099.2022.10001602
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The prosperity of cloud computing and 5G/B5G is bringing a wide range of delay jitter-sensitive applications (e.g., professional audio/video streaming and industrial automation) to large-scale IP networks. Although various network-side techniques have been proposed to guarantee the quality of service (QoS), the client-side technique by introducing a receive buffer should never be neglected from the applications' perspective. In this paper, we revisit the buffer management problem to address the disadvantages of state-of-the-art studies, which assumed network characteristics known a prior with simplified or inaccurate network models and failed to adapt to network dynamics. Specifically, we propose adaptive buffer management with deep reinforcement learning, i.e., DRL-ABM. We first define a tradeoff value to measure the buffer management performance in terms of the start-up delay, underflow frequency and packet losses. Then we formulate the DRL model and design deep neural networks (DNNs) based on the advantage actor critic (A2C) algorithm. To evaluate the performance of DRL-ABM, we perform extensive simulations. Simulation results show that DRLABM can achieve better buffer management performance, i.e., reducing the tradeoff value by at least 20% when compared with the benchmarks TBM and ABM. Moreover, DRL-ABM reduces packet losses to approximately 0, indicating that a smaller receive buffer is sufficient if managed with DRL-ABM.
引用
收藏
页码:4935 / 4940
页数:6
相关论文
共 16 条
[1]  
Chen M., 2018, INTERNET ENG TASK FO
[2]  
Dua A, 2007, GLOB TELECOMM CONF, P5226
[3]  
F. C. Commission, 2021, RAW DAT MEAS BROADB
[4]   Buffer-Aware Streaming in Small-Scale Wireless Networks: A Deep Reinforcement Learning Approach [J].
Guo, Yashuang ;
Yu, F. Richard ;
An, Jianping ;
Yang, Kai ;
He, Ying ;
Leung, Victor C. M. .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) :6891-6902
[5]   Joint routing and scheduling for large-scale deterministic IP networks [J].
Krolikowski, Jonatan ;
Martin, Sebastien ;
Medagliani, Paolo ;
Leguay, Jeremie ;
Chen, Shuang ;
Chang, Xiaodong ;
Geng, Xuesong .
COMPUTER COMMUNICATIONS, 2021, 165 :33-42
[6]   Effect of Delay and Buffering on Jitter-Free Streaming Over Random VBR Channels [J].
Liang, Guanfeng ;
Liang, Ben .
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (06) :1128-1141
[7]   Balancing interruption frequency and buffering penalties in VBR video streaming [J].
Liang, Guanfeng ;
Liang, Ben .
INFOCOM 2007, VOLS 1-5, 2007, :1406-+
[8]   Highly-Efficient and Automatic Spectrum Inspection Based on AutoEncoder and Semi-Supervised Learning for Anomaly Detection in EONs [J].
Liu, Siqi ;
Kong, Jiawei ;
Pan, Xiaoqin ;
Li, Deyun ;
Zhu, Zuqing .
JOURNAL OF LIGHTWAVE TECHNOLOGY, 2021, 39 (05) :1243-1254
[9]   Impact of Network Dynamics on User's Video Quality: Analytical Framework and QoS Provision [J].
Luan, Tom H. ;
Cai, Lin X. ;
Shen, Xuemin .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (01) :64-78
[10]  
MNIH V, 2016, P ICML, V48