PRISMA: A Packet Routing Simulator for Multi-Agent Reinforcement Learning

被引：0

作者：

Alliche, Redha A. ^{[1
]}

Barros, Tiago Da Silva ^{[1
]}

Aparicio-Pardo, Ramon ^{[1
]}

Sassatelli, Lucile ^{[2
]}

机构：

[1] Univ Cote dAzur, INRIA, CNRS, I3S, Nice, France

[2] Univ Cote dAzur, Inst Univ France, CNRS, I3S, Nice, France

来源：

2022 IFIP NETWORKING CONFERENCE (IFIP NETWORKING) | 2022年

关键词：

ns-3; Multi-Agent; Packet Routing; Reinforcement Learning; Network Simulation; ML tool;

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we present PRISMA: Packet Routing Simulator for Multi-Agent Reinforcement Learning. To the best of our knowledge, this is the first tool specifically conceived to develop and test Reinforcement Learning (RL) algorithms for the Distributed Packet Routing (DPR) problem. In this problem, where a communication node selects the outgoing port to forward a packet using local information, distance-vector routing protocol (e.g., RIP) are traditionally applied. However, when network status changes very dynamically, is uncertain, or is partially hidden (e.g., wireless ad hoc networks or wired multi-domain networks), RL is an alternate solution to discover routing policies better fitted to these cases. Unfortunately, no RL tools have been developed to tackle the DPR problem, forcing the researchers to implement their own simplified RL simulation environments, complicating reproducibility and reducing realism. To overcome these issues, we present PRISMA, which offers to the community a standardized framework where: (i) communication process is realistically modelled (thanks to ns3); (ii) distributed nature is explicitly considered (nodes are implemented as separated threads); (iii) and, RL proposals can be easily developed (thanks to a modular code design and real-time training visualization interfaces) and fairly compared them.

引用

页数：6

共 25 条

[1] [Anonymous], ZERO MQ OPEN SOURCE
[2] [Anonymous], OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
[3] [Anonymous], TensorBoard | TensorFlow
[4] [Anonymous], GYM GYM TOOLK CREAT
[5] [Anonymous], TENSORFLOW END TO EN
[6] [Anonymous], PRISMA TOOL OPEN MAR
[7] Learning Deep Architectures for AI
Bengio, Yoshua
[J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
[8] Boyan J. A., 1994, Advances in Neural Information Processing Systems, V6, P671
[9] Multiagent Meta-Reinforcement Learning for Adaptive Multipath Routing Optimization
Chen, Long
Hu, Bin
Guan, Zhi-Hong
Zhao, Lian
Shen, Xuemin
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5374 - 5386
[10] Demonstration of distributed collaborative learning with end-to-end QoT estimation in multi-domain elastic optical networks
Chen, Xiaoliang
Li, Baojia
Proietti, Roberto
Liu, Che-Yu
Zhu, Zuqing
Ben Yoo, S. J.
[J]. OPTICS EXPRESS, 2019, 27 (24) : 35700 - 35709

← 1 2 3 →