Soft Actor-Critic Deep Reinforcement Learning for Train Timetable Collaborative Optimization of Large-Scale Urban Rail Transit Network Under Dynamic Demand

被引：0

作者：

Wen, Longhui ^{[1
]}

Hu, Liyang ^{[2
]}

Zhou, Wei ^{[1
]}

Ren, Gang ^{[2
]}

Zhang, Ning ^{[1
]}

机构：

[1] Southeast Univ, Intelligent Transportat Syst Res Ctr, Nanjing 211189, Peoples R China

[2] Southeast Univ, Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Sch Transportat, Jiangsu Key Lab Urban ITS, Nanjing 211189, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2025年

基金：

中国国家自然科学基金;

关键词：

Rails; Schedules; Collaboration; Real-time systems; Dynamic scheduling; Optimal scheduling; Artificial intelligence; Time-varying systems; Synchronization; Urban rail transit; deep reinforcement learning; train timetable collaborative optimization; soft actor-critic; TIME-DEPENDENT DEMAND; METRO SYSTEM; MODEL; SYNCHRONIZATION; COORDINATION; ALGORITHM;

D O I：

10.1109/TITS.2025.3525538

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

To address the collaborative issue in large-scale urban rail transit (URT) network operations, this paper proposes an adaptive real-time control framework based on the Soft Actor-Critic (SAC) deep reinforcement learning (DRL) method, featuring flexible train scheduling capabilities. First, by analyzing dynamic passenger travel behavior (e.g., entering/exiting stations, transferring) and train operation events (e.g., dispatching, interstation running, station dwelling), the control problem is modeled as a Markov Decision Process (MDP) and an efficient URT simulation environment is constructed. Then, considering constraints such as train capacity and dispatch intervals, a train scheduling model is developed to minimize both passenger costs and operational costs. Subsequently, the real-time state of the URT system is represented by the overall number of passengers present at every platform, and train dispatch intervals on all lines are used as decision variables. A solving algorithm based on the SAC framework is developed. Finally, experimental results on a large-scale URT network comprising 10 lines demonstrate the effectiveness of the proposed framework, showing superior performance compared to other reinforcement learning algorithms and traditional heuristic optimization algorithms. The proposed approach achieves a 1.63% reduction in average passenger waiting time, equivalent to 2.09 seconds, while utilizing 49 fewer trains, representing a 2.97% decrease, compared to the second-best TD3 algorithm.

引用

页数：15

共 9 条

[1] Real-Time Optimization of Urban Rail Transit Train Scheduling via Advantage Actor-Critic Deep Reinforcement Learning
Wen, Longhui
Zhou, Wei
Liu, Jiajun
Ren, Gang
Zhang, Ning
JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (09)
[2] Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination
Hu, Yifan
Fu, Junjie
Wen, Guanghui
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
[3] Integrated optimization of train route plan and timetable with dynamic demand for the urban rail transit line
Yang, Ruixia
Han, Baoming
Zhang, Qi
Han, Zhenyu
Long, Yuxuan
TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2023, 11 (01) : 93 - 126
[4] An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand
Ying, Cheng-shuo
Chow, Andy H. F.
Chin, Kwai-Sang
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2020, 140 (140) : 210 - 235
[5] ACDRL: An actor-critic deep reinforcement learning approach for solving the energy-aimed train timetable rescheduling problem under random disturbances
Liao, Jinlin
Wu, Guilian
Chen, Hao
Ni, Shiyuan
Lin, Tingting
Tang, Lu
ENERGY REPORTS, 2022, 8 : 1350 - 1357
[6] Soft Actor-Critic Deep Reinforcement Learning with Hybrid Mixed-Integer Actions for Demand Responsive Scheduling of Energy Systems
Campos, Gustavo
El-Farra, Nael H.
Palazoglu, Ahmet
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2022, 61 (24) : 8443 - 8461
[7] Multi-objective optimization approach for permanent magnet machine viaimproved soft actor-critic based on deep reinforcement learning
Wang, Chen
Dong, Tianyu
Chen, Lei
Zhu, Guixiang
Chen, Yihan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[8] A hierarchical deep reinforcement learning method for solving urban route planning problems under large-scale customers and real-time traffic conditions
Li, Yuanyuan
Guan, Qingfeng
Gu, Jun Feng
Jiang, Xintong
Li, Yang
INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2025, 39 (01) : 118 - 141
[9] Semi-supervised soft sensor development based on dynamic dimensionality reduction-assisted large-scale pseudo label optimization and sample-weighted quality-relevant deep learning
Jin, Huaiping
Liu, Guangkun
Qian, Bin
Wang, Bin
Yang, Biao
Chen, Xiangguang
CHEMICAL ENGINEERING SCIENCE, 2024, 298

← 1 →