PRISMA: A Packet Routing Simulator for Multi-Agent Reinforcement Learning

被引:0
作者
Alliche, Redha A. [1 ]
Barros, Tiago Da Silva [1 ]
Aparicio-Pardo, Ramon [1 ]
Sassatelli, Lucile [2 ]
机构
[1] Univ Cote dAzur, INRIA, CNRS, I3S, Nice, France
[2] Univ Cote dAzur, Inst Univ France, CNRS, I3S, Nice, France
来源
2022 IFIP NETWORKING CONFERENCE (IFIP NETWORKING) | 2022年
关键词
ns-3; Multi-Agent; Packet Routing; Reinforcement Learning; Network Simulation; ML tool;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present PRISMA: Packet Routing Simulator for Multi-Agent Reinforcement Learning. To the best of our knowledge, this is the first tool specifically conceived to develop and test Reinforcement Learning (RL) algorithms for the Distributed Packet Routing (DPR) problem. In this problem, where a communication node selects the outgoing port to forward a packet using local information, distance-vector routing protocol (e.g., RIP) are traditionally applied. However, when network status changes very dynamically, is uncertain, or is partially hidden (e.g., wireless ad hoc networks or wired multi-domain networks), RL is an alternate solution to discover routing policies better fitted to these cases. Unfortunately, no RL tools have been developed to tackle the DPR problem, forcing the researchers to implement their own simplified RL simulation environments, complicating reproducibility and reducing realism. To overcome these issues, we present PRISMA, which offers to the community a standardized framework where: (i) communication process is realistically modelled (thanks to ns3); (ii) distributed nature is explicitly considered (nodes are implemented as separated threads); (iii) and, RL proposals can be easily developed (thanks to a modular code design and real-time training visualization interfaces) and fairly compared them.
引用
收藏
页数:6
相关论文
共 25 条
  • [1] [Anonymous], ZERO MQ OPEN SOURCE
  • [2] [Anonymous], OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
  • [3] [Anonymous], TensorBoard | TensorFlow
  • [4] [Anonymous], GYM GYM TOOLK CREAT
  • [5] [Anonymous], TENSORFLOW END TO EN
  • [6] [Anonymous], PRISMA TOOL OPEN MAR
  • [7] Learning Deep Architectures for AI
    Bengio, Yoshua
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
  • [8] Boyan J. A., 1994, Advances in Neural Information Processing Systems, V6, P671
  • [9] Multiagent Meta-Reinforcement Learning for Adaptive Multipath Routing Optimization
    Chen, Long
    Hu, Bin
    Guan, Zhi-Hong
    Zhao, Lian
    Shen, Xuemin
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5374 - 5386
  • [10] Demonstration of distributed collaborative learning with end-to-end QoT estimation in multi-domain elastic optical networks
    Chen, Xiaoliang
    Li, Baojia
    Proietti, Roberto
    Liu, Che-Yu
    Zhu, Zuqing
    Ben Yoo, S. J.
    [J]. OPTICS EXPRESS, 2019, 27 (24) : 35700 - 35709