Planning spatial networks with Monte Carlo tree search

被引:5
|
作者
Darvariu, Victor-Alexandru [1 ,2 ]
Hailes, Stephen [1 ]
Musolesi, Mirco [1 ,2 ,3 ]
机构
[1] UCL, London, England
[2] Alan Turing Inst, London, England
[3] Univ Bologna, Bologna, Italy
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2023年 / 479卷 / 2269期
基金
英国工程与自然科学研究理事会;
关键词
planning; spatial networks; Monte Carlo tree search; ROBUSTNESS; GO;
D O I
10.1098/rspa.2022.0383
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Monte Carlo Tree Search for Love Letter
    Omarov, Tamirlan
    Aslam, Hamna
    Brown, Joseph Alexander
    Reading, Elizabeth
    19TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION (GAME-ON(R) 2018), 2018, : 10 - 15
  • [32] Monte Carlo Tree Search with Boltzmann Exploration
    Painter, Michael
    Baioumy, Mohamed
    Hawes, Nick
    Lacerda, Bruno
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [33] Classification of Monte Carlo Tree Search Variants
    McGuinness, Cameron
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 357 - 363
  • [34] Quantum Circuit Transformation: A Monte Carlo Tree Search Framework
    Zhou, Xiangzhen
    Feng, Yuan
    Li, Sanjiang
    ACM TRANSACTIONS ON DESIGN AUTOMATION OF ELECTRONIC SYSTEMS, 2022, 27 (06)
  • [35] A Monte Carlo Tree Search Framework for Quantum Circuit Transformation
    Zhou, Xiangzhen
    Feng, Yuan
    Li, Sanjiang
    2020 IEEE/ACM INTERNATIONAL CONFERENCE ON COMPUTER AIDED-DESIGN (ICCAD), 2020,
  • [36] Reinforcement learning for active distribution network planning based on Monte Carlo tree search
    Zhang, Xi
    Hua, Weiqi
    Liu, Youbo
    Duan, Jiajun
    Tang, Zhiyuan
    Liu, Junyong
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
  • [37] Bayesian Optimization for Backpropagation in Monte-Carlo Tree Search
    Lim, Nengli
    Li, Yueqin
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT II, 2021, 12892 : 209 - 221
  • [38] Accelerating Cooperative Planning for Automated Vehicles with Learned Heuristics and Monte Carlo Tree Search
    Kurzer, Karl
    Fechner, Marcus
    Zoellner, J. Marius
    2020 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2020, : 1726 - 1733
  • [39] A self-learning Monte Carlo tree search algorithm for robot path planning
    Li, Wei
    Liu, Yi
    Ma, Yan
    Xu, Kang
    Qiu, Jiang
    Gan, Zhongxue
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [40] Time Management for Monte Carlo Tree Search
    Baier, Hendrik
    Winands, Mark H. M.
    IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2016, 8 (03) : 301 - 314