Multi-agent DQN with sample-efficient updates for large inter-slice orchestration problems

被引：0

作者：

Doanis, Pavlos ^{[1
]}

Spyropoulos, Thrasyvoulos ^{[2
]}

机构：

[1] EURECOM, Biot, France

[2] Tech Univ Crete, Iraklion, Greece

来源：

2024 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS, ICNC | 2024年

关键词：

Slice orchestration; Beyond 5G Networks; Reinforcement Learning; Deep-Q Network;

D O I：

10.1109/CNC59896.2024.10555923

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Data-driven network slicing has been recently explored as a major driver for beyond 5G networks. Nevertheless, we are still a long way before such solutions are practically applicable in real problems. Reinforcement learning based solutions, addressing the problem of dynamically placing virtual network function chains on top of a physical topology, have to deal with astronomically high action spaces (especially in in multi-VNF, multi-domain, and multi-slice setups). Moreover, their training is not particularly data-efficient, which can pose shortcomings, given the scarce(r) availability of cellular network related data. Multi-agent DQN can reduce the action space complexity by many orders of magnitude compared to standard DQN. Nevertheless, these algorithms are data-hungry and convergence can still be slow. To this end, in this work we introduce two additional mechanisms on top of (multi-agent) DQN to speed up training. These mechanisms intelligently decide how to store to, and how to pick from the experience replay buffer, in order to achieve more efficient parameter updates (faster learning). The convergence speed gains of the proposed scheme are validated using real traffic data.

引用

页码：772 / 777

页数：6

共 16 条

[1] Network Slicing and Softwarization: A Survey on Principles, Enabling Technologies, and Solutions [J].

Afolabi, Ibrahim ;

Taleb, Tarik ;

Samdanis, Konstantinos ;

Ksentini, Adlen ;

Flinck, Hannu .

IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2018, 20 (03) :2429-2453

[2]

[Anonymous], 2003, MobiCom

[3]

Bega D, 2020, IEEE INFOCOM SER, P794, DOI 10.1109/INFOCOM41043.2020.9155299

[4]

Bega D, 2019, IEEE INFOCOM SER, P280, DOI [10.1109/INFOCOM.2019.8737488, 10.1109/infocom.2019.8737488]

[5]

Bertsekas D. P., 2019, REINFORCEMENT LEARNI

[6] Scalable end-to-end slice embedding and reconfiguration based on independent DQN agents [J].

Doanis, Pavlos ;

Giannakas, Theodoros ;

Spyropoulos, Thrasyvoulos .

2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, :3429-3434

[7]

Harchol-Balter M., 2013, Performance Modeling and Design of Computer Systems: Queueing Theory in Action

[8] How Should I Slice My Network? A Multi-Service Empirical Evaluation of Resource Sharing Efficiency [J].

Marquez, Cristina ;

Gramaglia, Marco ;

Fiore, Marco ;

Banchs, Albert ;

Costa-Perez, Xavier .

MOBICOM'18: PROCEEDINGS OF THE 24TH ANNUAL INTERNATIONAL CONFERENCE ON MOBILE COMPUTING AND NETWORKING, 2018, :191-206

[9] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[10] A Deep Reinforcement Learning Approach for VNF Forwarding Graph Embedding [J].

Pham Tran Anh Quang ;

Hadjadj-Aoul, Yassine ;

Outtagarts, Abdelkader .

IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2019, 16 (04) :1318-1331

← 1 2 →