Planning spatial networks with Monte Carlo tree search

被引:5
|
作者
Darvariu, Victor-Alexandru [1 ,2 ]
Hailes, Stephen [1 ]
Musolesi, Mirco [1 ,2 ,3 ]
机构
[1] UCL, London, England
[2] Alan Turing Inst, London, England
[3] Univ Bologna, Bologna, Italy
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2023年 / 479卷 / 2269期
基金
英国工程与自然科学研究理事会;
关键词
planning; spatial networks; Monte Carlo tree search; ROBUSTNESS; GO;
D O I
10.1098/rspa.2022.0383
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Online model adaptation in Monte Carlo tree search planning
    Zuccotto, Maddalena
    Fusa, Edoardo
    Castellini, Alberto
    Farinelli, Alessandro
    OPTIMIZATION AND ENGINEERING, 2024,
  • [2] Hierarchical Monte Carlo Tree Search for Latent Skill Planning
    Pei, Yue
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 6 - 12
  • [3] Monte Carlo Tree Search for Network Planning for Next Generation Mobile Communication Networks
    Shen, Linzhi
    Wang, Shaowei
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [4] The hierarchical task network planning method based on Monte Carlo Tree Search
    Shao, Tianhao
    Zhang, Hongjun
    Cheng, Kai
    Zhang, Ke
    Bie, Lin
    KNOWLEDGE-BASED SYSTEMS, 2021, 225
  • [5] Monte Carlo Tree Search With Reinforcement Learning for Motion Planning
    Weingertner, Philippe
    Ho, Minnie
    Timofeev, Andrey
    Aubert, Sebastien
    Pita-Gil, Guillermo
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [6] Efficient Object Manipulation Planning with Monte Carlo Tree Search
    Zhu, Huaijiang
    Meduri, Avadesh
    Righetti, Ludovic
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 10628 - 10635
  • [7] Improving Monte Carlo Tree Search with Artificial Neural Networks without Heuristics
    Cotarelo, Alba
    Garcia-Diaz, Vicente
    Nunez-Valdez, Edward Rolando
    Gonzalez Garcia, Cristian
    Gomez, Alberto
    Chun-Wei Lin, Jerry
    APPLIED SCIENCES-BASEL, 2021, 11 (05): : 1 - 18
  • [8] Nonasymptotic Analysis of Monte Carlo Tree Search
    Shah, Devavrat
    Xie, Qiaomin
    Xu, Zhi
    OPERATIONS RESEARCH, 2022, 70 (06) : 3234 - 3260
  • [9] Text Matching with Monte Carlo Tree Search
    He, Yixuan
    Tao, Shuchang
    Xu, Jun
    Guo, Jiafeng
    Lan, YanYan
    Cheng, Xueqi
    INFORMATION RETRIEVAL, CCIR 2018, 2018, 11168 : 41 - 52
  • [10] Retrosynthetic planning with experience-guided Monte Carlo tree search
    Hong, Siqi
    Zhuo, Hankz Hankui
    Jin, Kebing
    Shao, Guang
    Zhou, Zhanwen
    COMMUNICATIONS CHEMISTRY, 2023, 6 (01)