Planning spatial networks with Monte Carlo tree search

被引:5
|
作者
Darvariu, Victor-Alexandru [1 ,2 ]
Hailes, Stephen [1 ]
Musolesi, Mirco [1 ,2 ,3 ]
机构
[1] UCL, London, England
[2] Alan Turing Inst, London, England
[3] Univ Bologna, Bologna, Italy
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2023年 / 479卷 / 2269期
基金
英国工程与自然科学研究理事会;
关键词
planning; spatial networks; Monte Carlo tree search; ROBUSTNESS; GO;
D O I
10.1098/rspa.2022.0383
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Epistemic Multi-agent Planning Using Monte-Carlo Tree Search
    Reifsteck, Daniel
    Engesser, Thorsten
    Mattmueller, Robert
    Nebel, Bernhard
    ADVANCES IN ARTIFICIAL INTELLIGENCE, KI 2019, 2019, 11793 : 277 - 289
  • [42] Optimizing quantum annealing schedules with Monte Carlo tree search enhanced with neural networks
    Chen, Yu-Qin
    Chen, Yu
    Lee, Chee-Kong
    Zhang, Shengyu
    Hsieh, Chang-Yu
    NATURE MACHINE INTELLIGENCE, 2022, 4 (03) : 269 - 278
  • [43] AlphaTruss: Monte Carlo Tree Search for Optimal Truss Layout Design
    Luo, Ruifeng
    Wang, Yifan
    Xiao, Weifang
    Zhao, Xianzhong
    BUILDINGS, 2022, 12 (05)
  • [44] Monte Carlo tree search control scheme for multibody dynamics applications
    Tang, Yixuan
    Orzechowski, Grzegorz
    Prokop, Ales
    Mikkola, Aki
    NONLINEAR DYNAMICS, 2024, 112 (10) : 8363 - 8391
  • [45] Monte Carlo tree search for dynamic shortest-path interdiction
    Bochkarev, Alexey A.
    Smith, J. Cole
    NETWORKS, 2024, 84 (04) : 398 - 419
  • [46] Single-player Monte-Carlo tree search for SameGame
    Schadd, Maarten P. D.
    Winands, Mark H. M.
    Tak, Mandy J. W.
    Uiterwijk, Jos W. H. M.
    KNOWLEDGE-BASED SYSTEMS, 2012, 34 : 3 - 11
  • [47] A Timetable Rescheduling Approach for Railway based on Monte Carlo Tree Search
    Wang, Rongsheng
    Zhou, Min
    Li, Yidong
    Zhang, Qi
    Dong, Hairong
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3738 - 3743
  • [48] Enhancing Monte Carlo Tree Search for Playing Hearthstone
    Choe, Jean Seong Bjorn
    Kim, Jong-Kook
    2019 IEEE CONFERENCE ON GAMES (COG), 2019,
  • [49] Monte Carlo Tree Search for Bayesian Reinforcement Learning
    Vien, Ngo Anh
    Ertel, Wolfgang
    2012 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2012), VOL 1, 2012, : 138 - 143
  • [50] PARALLEL MACHINE SCHEDULING WITH MONTE CARLO TREE SEARCH
    Agardi, Anita
    Nehez, Karoly
    ACTA POLYTECHNICA, 2021, 61 (02) : 307 - 312