Multi-Agent Deep Reinforcement Learning-Based Algorithm For Fast Generalization On Routing Problems

被引:0
作者
Barbahan, Ibraheem [1 ]
Baikalov, Vladimir [1 ]
Vyatkin, Valeriy [2 ]
Filchenkov, Andrey [1 ]
机构
[1] ITMO Univ, 49 Kronverksky Pr, St Petersburg 197101, Russia
[2] Aalto Univ, Espoo 02150, Finland
来源
10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021) | 2021年 / 193卷
基金
俄罗斯科学基金会;
关键词
DQN-Routing; Distributed routing problems; Fast generalization; Multi-agent deep reinforcement learning; Routing problem; DQN; TIME;
D O I
10.1016/j.procs.2021.10.023
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We propose a fast generalization method for DQN-Routing, an algorithm based on multi -agent deep reinforcement learning that suffers from generalization problem when introduced to new topologies even if it was trained on a similar topology. The proposed method is based on the wisdom of crowds and allows the distributed routing algorithm, DQN-Routing, to generalize better to new topologies that were not seen before during training. The proposed method also aims to decrease the solution search time as the original DQN-Routing algorithm takes a long time to converge, and to increase the overall performance by minimizing the mean delivery time and total power consumption and the number of collisions. The experimental evaluation of our method proved that is capable to generalize to new topologies and outperform the DQN-Routing algorithm. (C) 2021 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC -ND license (haps:llcreativecommons.orgin Isesiby-ric-nd14.0) Peer-review under responsibility of the scientific committee of the 10th International Young Scientists Conference on Computational Science
引用
收藏
页码:228 / 238
页数:11
相关论文
共 17 条
[1]   Application of reinforcement learning to routing in distributed wireless networks: a review [J].
Al-Rawi, Hasan A. A. ;
Ng, Ming Ann ;
Yau, Kok-Lim Alvin .
ARTIFICIAL INTELLIGENCE REVIEW, 2015, 43 (03) :381-416
[2]  
[Anonymous], 2019, ARXIV191007421
[3]  
Carr T, 2018, ARXIV PREPRINT ARXIV
[4]  
Cobbe K, 2019, PR MACH LEARN RES, V97
[5]   The Immune System Computes the State of the Body: Crowd Wisdom, Machine Learning, and Immune Cell Reference Repertoires Help Manage Inflammation [J].
Cohen, Irun R. ;
Efroni, Sol .
FRONTIERS IN IMMUNOLOGY, 2019, 10
[6]  
Duraipandian M., 2019, J TRENDS COMPUT SCI, V01, P25, DOI DOI 10.36548/JTCSST.2019.1.003
[7]   A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients [J].
Grondman, Ivo ;
Busoniu, Lucian ;
Lopes, Gabriel A. D. ;
Babuska, Robert .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06) :1291-1307
[8]   A survey on position-based routing for vehicular ad hoc networks [J].
Liu, Jianqi ;
Wan, Jiafu ;
Wang, Qinruo ;
Deng, Pan ;
Zhou, Keliang ;
Qiao, Yupeng .
TELECOMMUNICATION SYSTEMS, 2016, 62 (01) :15-30
[9]   Multi-agent deep learning for simultaneous optimization for time and energy in distributed routing system [J].
Mukhutdinov, Dmitry ;
Filchenkov, Andrey ;
Shalyto, Anatoly ;
Vyatkin, Valeriy .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 94 :587-600
[10]   TIDE: Time-relevant deep reinforcement learning for routing optimization [J].
Sun, Penghao ;
Hu, Yuxiang ;
Lan, Julong ;
Tian, Le ;
Chen, Min .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 99 :401-409