RAN Information-assisted TCP Congestion Control via DRL with Reward Redistribution

被引:40
作者
Chen, Minghao [1 ]
Li, Rongpeng [1 ]
Zhao, Zhifeng [1 ,2 ]
Zhang, Honggang [1 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou, Peoples R China
来源
2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS) | 2021年
基金
中国国家自然科学基金;
关键词
Deep reinforcement learning; congestion control; radio access network; reward redistribution; delayed feedback;
D O I
10.1109/ICCWorkshops50388.2021.9473523
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we aim to propose a novel transmission control protocol (TCP) congestion control method from a cross-layer-based perspective and present a deep reinforcement learning (DRL)-driven method called DRL-3R (DRL for congestion control with Radio access network information and Reward Redistribution) so as to learn the TCP congestion control policy in a superior manner. In particular, we incorporate the RAN information to timely grasp the dynamics of RAN. Meanwhile, we relax the implicit assumption [i.e., the feedback of one specific data-sending rate controlling action returns at a round trip time (RTT) after the action is applied] in previous researches by redistributing the rewards and evaluating the merits of actions more accurately. Experiment results show that besides maintaining a reasonable fairness, DRL-3R significantly outperforms classical congestion control methods (e.g., TCP Reno, Westwood, Cubic, and BBR) on network utility by achieving a relatively high throughput while reducing delay in various network environments.
引用
收藏
页数:7
相关论文
共 16 条
  • [1] [Anonymous], 2016, INT C LEARN REPR
  • [2] Arjona-Medina J. A., 2019, ADV NEURAL INFORM PR, V1, P6
  • [3] BBR: Congestion-Based Congestion Control
    Cardwell, Neal
    Cheng, Yuchung
    Gunn, C. Stephen
    Yeganeh, Soheil Hassas
    Jacobson, Van
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (02) : 58 - 66
  • [4] Claudio C., 2001, ANN INT C MOBILE COM
  • [5] Jacobson V., 1990, INTERNET ENG TASK FO
  • [6] Jain R., 1991, ART COMPUTER SYSTEMS, P1685
  • [7] SmartCC: A Reinforcement Learning Approach for Multipath TCP Congestion Control in Heterogeneous Networks
    Li, Wenzhong
    Zhang, Han
    Gao, Shaohua
    Xue, Chaojing
    Wang, Xiaoliang
    Lu, Sanglu
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (11) : 2621 - 2633
  • [8] CQIC: Revisiting Cross-Layer Congestion Control for Cellular Networks
    Lu, Feng
    Du, Hao
    Jain, Ankur
    Voelker, Geoffrey M.
    Snoeren, Alex C.
    Terzis, Andreas
    [J]. 16TH INTERNATIONAL WORKSHOP ON MOBILE COMPUTING SYSTEMS AND APPLICATIONS (HOTMOBILE' 15), 2015, : 45 - 50
  • [9] Sangtae Ha, 2008, Operating Systems Review, V42, P64, DOI 10.1145/1400097.1400105
  • [10] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1