MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion

被引:0
作者
Alkilane, Khaled [1 ]
Lee, Der-Horng [1 ]
机构
[1] Zhejiang Univ, Zhejiang Univ Univ Illinois Urbana Champaign Inst, Haining 314400, Peoples R China
关键词
Metro networks; network expansion; deep reinforcement learning; graph neural networks; transport network design; GENETIC ALGORITHM; DESIGN; SHOGI; CHESS; GO;
D O I
10.1109/TITS.2024.3490501
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Metro networks necessitate continuous expansion, either extending existing lines or constructing new ones. Optimizing this process, however, presents multifaceted challenges due to complex spatial and demographic relationships, dynamic travel patterns, and a vast solution space with non-linearities and multiple objectives. Existing approaches often fall short, either relying heavily on subjective expert knowledge or limiting their scope to isolated corridors. This paper introduces MetroZero, a deep reinforcement learning (DRL) framework designed to overcome these limitations. We formulate the optimization as a Markov Decision Process (MDP) and leverage a Monte Carlo Tree Search (MCTS) algorithm guided by an actor-critic agent. This powerful combination identifies the optimal sequence of expansion stations within budgetary constraints. To effectively learn network representations, we develop a multiplex graph encoder powered by attentive message passing. A graph attention network (GAT) and a feasibility mask are employed to prioritize high-potential expansion locations and navigate the search space. Inspired by AlphaZero, we train MetroZero through simulated self-play expansion games. Extensive experiments on real-world datasets from Beijing and Changsha demonstrate MetroZero's effectiveness and superiority. In a complex expansion scenario, it achieves remarkable improvements of 19.6% and 20.4% over the second-best model. Further experiments across varied urban contexts underscore MetorZero's scalability and adaptability.
引用
收藏
页码:810 / 823
页数:14
相关论文
共 39 条
[1]   Data-Driven Transit Network Design at Scale [J].
Bertsimas, Dimitris ;
Ng, Yee Sian ;
Yan, Julia .
OPERATIONS RESEARCH, 2021, 69 (04) :1118-1133
[2]   A faster algorithm for betweenness centrality [J].
Brandes, U .
JOURNAL OF MATHEMATICAL SOCIOLOGY, 2001, 25 (02) :163-177
[3]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[4]   A heuristic for the location of a rapid transit line [J].
Bruno, G ;
Gendreau, M ;
Laporte, G .
COMPUTERS & OPERATIONS RESEARCH, 2002, 29 (01) :1-12
[5]   Robust optimization models for integrated train stop planning and timetabling with passenger demand uncertainty [J].
Cacchiani, Valentina ;
Qi, Jianguo ;
Yang, Lixing .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2020, 136 :1-29
[6]   Evaluating the value of new metro lines using route diversity measures: The case of Hong Kong's Mass Transit Railway system [J].
Chan, Ho-Yin ;
Chen, Anthony ;
Li, Guoyuan ;
Xu, Xiangdong ;
Lam, William .
JOURNAL OF TRANSPORT GEOGRAPHY, 2021, 91
[7]  
Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438
[8]   Planning spatial networks with Monte Carlo tree search [J].
Darvariu, Victor-Alexandru ;
Hailes, Stephen ;
Musolesi, Mirco .
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 2023, 479 (2269)
[9]  
Dijkstra E.W., 1959, NUMENSCHE MATH, V1, P269, DOI [DOI 10.1145/3544585.3544600, 10.1007/BF01386390, DOI 10.1007/BF01386390]
[10]   Searching for spin glass ground states through deep reinforcement learning [J].
Fan, Changjun ;
Shen, Mutian ;
Nussinov, Zohar ;
Liu, Zhong ;
Sun, Yizhou ;
Liu, Yang-Yu .
NATURE COMMUNICATIONS, 2023, 14 (01)