RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution

被引:38
作者
Chen, Minghao [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [1 ,2 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS) | 2021年
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;
D O I
10.1109/ICCWorkshops50388.2021.9473523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN. Meanwhile, we relax the implicit assumption [i.e., the feedback of one specific data-sending rate controlling action returns at a round trip time (RTT) after the action is applied] in previous researches by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, and BBR) on network utility by achieving a relatively high throughput while reducing delay in various network environments.
引用
收藏
页数:7
相关论文
共 16 条
[1]  
Arjona-Medina J. A., 2019, ADV NEURAL INFORM PR, V1, P6
[2]   BBR: Congestion-Based Congestion Control [J].
Cardwell, Neal ;
Cheng, Yuchung ;
Gunn, C. Stephen ;
Yeganeh, Soheil Hassas ;
Jacobson, Van .
COMMUNICATIONS OF THE ACM, 2017, 60 (02) :58-66
[3]  
Claudio C., 2001, ANN INT C MOBILE COM
[4]  
Jacobson V., 1990, INTERNET ENG TASK FO
[5]  
Jain R., 1991, The Art of Computer Systems Performance Analysis: TechModeling. niques for Experimental Design, Measurement, Simulation
[6]   SmartCC: A Reinforcement Learning Approach for Multipath TCP Congestion Control in Heterogeneous Networks [J].
Li, Wenzhong ;
Zhang, Han ;
Gao, Shaohua ;
Xue, Chaojing ;
Wang, Xiaoliang ;
Lu, Sanglu .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) :2621-2633
[7]  
Lillicrap T. P., 2016, P 4 INT C LEARN REPR, P1
[8]   CQIC: Revisiting Cross-Layer Congestion Control for Cellular Networks [J].
Lu, Feng ;
Du, Hao ;
Jain, Ankur ;
Voelker, Geoffrey M. ;
Snoeren, Alex C. ;
Terzis, Andreas .
16TH INTERNATIONAL WORKSHOP ON MOBILE COMPUTING SYSTEMS AND APPLICATIONS (HOTMOBILE' 15), 2015, :45-50
[9]  
Sangtae Ha, 2008, Operating Systems Review, V42, P64, DOI 10.1145/1400097.1400105
[10]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1