RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution

被引：38

作者：

Chen, Minghao ^{[1
]}

Li, Rongpeng ^{[1
]}

Zhao, Zhifeng ^{[1
,2
]}

Zhang, Honggang ^{[1
]}

机构：

[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China

[2] Zhejiang Lab, Hangzhou, Peoples R China

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS) | 2021年

基金：

中国国家自然科学基金;

关键词：

Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;

D O I：

10.1109/ICCWorkshops50388.2021.9473523

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN. Meanwhile, we relax the implicit assumption [i.e., the feedback of one specific data-sending rate controlling action returns at a round trip time (RTT) after the action is applied] in previous researches by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, and BBR) on network utility by achieving a relatively high throughput while reducing delay in various network environments.

引用

页数：7

共 16 条

[1]

Arjona-Medina J. A., 2019, ADV NEURAL INFORM PR, V1, P6

[2] BBR: Congestion-Based Congestion Control [J].

Cardwell, Neal ;

Cheng, Yuchung ;

Gunn, C. Stephen ;

Yeganeh, Soheil Hassas ;

Jacobson, Van .

COMMUNICATIONS OF THE ACM, 2017, 60 (02) :58-66

[3]

Claudio C., 2001, ANN INT C MOBILE COM

[4]

Jacobson V., 1990, INTERNET ENG TASK FO

[5]

Jain R., 1991, The Art of Computer Systems Performance Analysis: TechModeling. niques for Experimental Design, Measurement, Simulation

[6] SmartCC: A Reinforcement Learning Approach for Multipath TCP Congestion Control in Heterogeneous Networks [J].

Li, Wenzhong ;

Zhang, Han ;

Gao, Shaohua ;

Xue, Chaojing ;

Wang, Xiaoliang ;

Lu, Sanglu .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2621-2633

[7]

Lillicrap T. P., 2016, P 4 INT C LEARN REPR, P1

[8] CQIC: Revisiting Cross-Layer Congestion Control for Cellular Networks [J].

Lu, Feng ;

Du, Hao ;

Jain, Ankur ;

Voelker, Geoffrey M. ;

Snoeren, Alex C. ;

Terzis, Andreas .

16TH INTERNATIONAL WORKSHOP ON MOBILE COMPUTING SYSTEMS AND APPLICATIONS (HOTMOBILE' 15), 2015, :45-50

[9]

Sangtae Ha, 2008, Operating Systems Review, V42, P64, DOI 10.1145/1400097.1400105

[10]

Sutton RS, 2018, ADAPT COMPUT MACH LE, P1

← 1 2 →