Network routing optimization approach based on deep reinforcement learning

被引:0
|
作者
Meng L. [1 ,2 ]
Guo B. [1 ,2 ]
Yang W. [1 ,2 ]
Zhang X. [1 ,2 ]
Zhao Z. [1 ,2 ]
Huang S. [1 ,2 ]
机构
[1] School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing
[2] State Key Laboratory of Information Photonics and Optical Communication, Beijing
关键词
Deep deterministec policy gradient (DDPG) algorithm; Deep reinforcement learning; Routing optimization;
D O I
10.12305/j.issn.1001-506X.2022.07.28
中图分类号
学科分类号
摘要
Aiming at the routing optimization problem of different network loads under the same network topology, based on the deep reinforcement learning method, two optimization methods for routing distribution based on the current network traffic state are proposed. Through the iterative interaction between the network simulation system and the deep reinforcement learning model, continuous training and optimization of network routing for the distribution of traffic relationships are realized. Improvements have been made in using the deep deterministec policy gradient (DDPG) algorithm to solve the routing optimization problem, making this optimization method more suitable for solving the problem of network routing optimization. At the same time, a brand-new link weight construction strategy is designed, which uses network traffic to construct input state elements for the neural network. Through the preprocessing of the original data, the learning efficiency of the neural network is strengthened, and the stability of the training model is greatly improved. And for the continuous action space of the high-latitude large-scale network, the action space is discretized, which effectively reduces the complexity of the action space and speeds up the model convergence. Experimental results show that the proposed optimization method can adapt to changing traffic and link status, enhance the stability of model training and improve network performance. © 2022, Editorial Office of Systems Engineering and Electronics. All right reserved.
引用
收藏
页码:2311 / 2318
页数:7
相关论文
共 22 条
  • [1] BOUZIDI H, OUTTAGARTS A, LANGAR R., Deep reinforcement learning application for network latency management in software defined networks, Proc.of the IEEE GLOBECOM, (2019)
  • [2] DIEGO K, FERNANDO M R., Paulo estevesverissimo, Christian Esteverothenberg, siamakazodolmolky, andsteveuhlig.software-defined networking, Proceedings of the IEEE, 103, 1, pp. 14-76, (2015)
  • [3] OUYANG Y, WANG L L, YANG A D., Next decade of telecommunications artificial intelligence, Telecommunications Science, 37, 3, pp. 1-36, (2021)
  • [4] ALEXANDER C, MOULI C, SAILESH K D, A SDN framework for distributed network analytics, Proc.of the IFIP/IEEE International Symposium on Integrated Network Management, pp. 9-17, (2015)
  • [5] WANG F Y, ZHANG J J, ZHENG X, Et al., Where does alpha go go: from church-turning thesis to alpha go thesis and beyond, Acta Automatica Sinica, 3, 2, pp. 113-120, (2016)
  • [6] CLARK D D, PARTRIDGE C, RAMMING J C, Et al., A knowledge plane for the internet, Proc.of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications, pp. 3-10, (2003)
  • [7] BOUTABA R, SALAHUDDIN M A, LIMAM N, Et al., A comprehensive survey on machine learning for networking: evolution, applications and research opportunities, Journal of Internet Services and Applications, (2018)
  • [8] SENDRA S, REGO A, LLORET J, Et al., Including artificial intelligence in a routing protocol using software defined networks, Proc.of the IEEE International Conference on Communications Workshops, pp. 670-674, (2017)
  • [9] STAMPA G, ARIAS M, SANCHEZ-CHARLES D, Et al., A deep-reinforcement learning approach for software-defined networking routing optimization
  • [10] LILLICRAP T P, HUNT J J, PRITZEL A, Et al., Continuous control with deep reinforcement learning