Towards Network Dynamics: Adaptive Buffer Management with Deep Reinforcement Learning

被引：0

作者：

Zhu, Jing ^{[1
]}

Wang, Dan ^{[1
]}

Qin, Shuxin ^{[1
]}

Tao, Gaofeng ^{[1
,2
]}

Gui, Hongxin ^{[2
]}

Li, Fang ^{[3
]}

Ou, Liang ^{[4
]}

机构：

[1] Purple Mt Labs, Nanjing, Peoples R China

[2] Shandong Future Network Res Inst, Jinan, Peoples R China

[3] China Acad Informat & Commun Technol, Beijing, Peoples R China

[4] China Telecom Res Inst, Guangzhou, Peoples R China

来源：

2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022) | 2022年

关键词：

Delay jitter-sensitive applications; Quality of; service; Network dynamics; Adaptive buffer management; Deep reinforcement learning; VIDEO;

D O I：

10.1109/GLOBECOM48099.2022.10001602

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The prosperity of cloud computing and 5G/B5G is bringing a wide range of delay jitter-sensitive applications (e.g., professional audio/video streaming and industrial automation) to large-scale IP networks. Although various network-side techniques have been proposed to guarantee the quality of service (QoS), the client-side technique by introducing a receive buffer should never be neglected from the applications' perspective. In this paper, we revisit the buffer management problem to address the disadvantages of state-of-the-art studies, which assumed network characteristics known a prior with simplified or inaccurate network models and failed to adapt to network dynamics. Specifically, we propose adaptive buffer management with deep reinforcement learning, i.e., DRL-ABM. We first define a tradeoff value to measure the buffer management performance in terms of the start-up delay, underflow frequency and packet losses. Then we formulate the DRL model and design deep neural networks (DNNs) based on the advantage actor critic (A2C) algorithm. To evaluate the performance of DRL-ABM, we perform extensive simulations. Simulation results show that DRLABM can achieve better buffer management performance, i.e., reducing the tradeoff value by at least 20% when compared with the benchmarks TBM and ABM. Moreover, DRL-ABM reduces packet losses to approximately 0, indicating that a smaller receive buffer is sufficient if managed with DRL-ABM.

引用

页码：4935 / 4940

页数：6

共 16 条

[1]

Chen M., 2018, INTERNET ENG TASK FO

[2]

Dua A, 2007, GLOB TELECOMM CONF, P5226

[3]

F. C. Commission, 2021, RAW DAT MEAS BROADB

[4] Buffer-Aware Streaming in Small-Scale Wireless Networks: A Deep Reinforcement Learning Approach [J].