MetroZero: Deep Reinforcement Learning and Monte Carlo Tree Search for Optimized Metro Network Expansion

被引：0

作者：

Alkilane, Khaled ^{[1
]}

Lee, Der-Horng ^{[1
]}

机构：

[1] Zhejiang Univ, Zhejiang Univ Univ Illinois Urbana Champaign Inst, Haining 314400, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2025年 / 26卷 / 01期

关键词：

Metro networks; network expansion; deep reinforcement learning; graph neural networks; transport network design; GENETIC ALGORITHM; DESIGN; SHOGI; CHESS; GO;

D O I：

10.1109/TITS.2024.3490501

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Metro networks necessitate continuous expansion, either extending existing lines or constructing new ones. Optimizing this process, however, presents multifaceted challenges due to complex spatial and demographic relationships, dynamic travel patterns, and a vast solution space with non-linearities and multiple objectives. Existing approaches often fall short, either relying heavily on subjective expert knowledge or limiting their scope to isolated corridors. This paper introduces MetroZero, a deep reinforcement learning (DRL) framework designed to overcome these limitations. We formulate the optimization as a Markov Decision Process (MDP) and leverage a Monte Carlo Tree Search (MCTS) algorithm guided by an actor-critic agent. This powerful combination identifies the optimal sequence of expansion stations within budgetary constraints. To effectively learn network representations, we develop a multiplex graph encoder powered by attentive message passing. A graph attention network (GAT) and a feasibility mask are employed to prioritize high-potential expansion locations and navigate the search space. Inspired by AlphaZero, we train MetroZero through simulated self-play expansion games. Extensive experiments on real-world datasets from Beijing and Changsha demonstrate MetroZero's effectiveness and superiority. In a complex expansion scenario, it achieves remarkable improvements of 19.6% and 20.4% over the second-best model. Further experiments across varied urban contexts underscore MetorZero's scalability and adaptability.

引用

页码：810 / 823

页数：14

共 39 条

[1] Data-Driven Transit Network Design at Scale [J].