Soft Actor-Critic Deep Reinforcement Learning for Train Timetable Collaborative Optimization of Large-Scale Urban Rail Transit Network Under Dynamic Demand

被引:0
|
作者
Wen, Longhui [1 ]
Hu, Liyang [2 ]
Zhou, Wei [1 ]
Ren, Gang [2 ]
Zhang, Ning [1 ]
机构
[1] Southeast Univ, Intelligent Transportat Syst Res Ctr, Nanjing 211189, Peoples R China
[2] Southeast Univ, Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Sch Transportat, Jiangsu Key Lab Urban ITS, Nanjing 211189, Peoples R China
基金
中国国家自然科学基金;
关键词
Rails; Schedules; Collaboration; Real-time systems; Dynamic scheduling; Optimal scheduling; Artificial intelligence; Time-varying systems; Synchronization; Urban rail transit; deep reinforcement learning; train timetable collaborative optimization; soft actor-critic; TIME-DEPENDENT DEMAND; METRO SYSTEM; MODEL; SYNCHRONIZATION; COORDINATION; ALGORITHM;
D O I
10.1109/TITS.2025.3525538
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
To address the collaborative issue in large-scale urban rail transit (URT) network operations, this paper proposes an adaptive real-time control framework based on the Soft Actor-Critic (SAC) deep reinforcement learning (DRL) method, featuring flexible train scheduling capabilities. First, by analyzing dynamic passenger travel behavior (e.g., entering/exiting stations, transferring) and train operation events (e.g., dispatching, interstation running, station dwelling), the control problem is modeled as a Markov Decision Process (MDP) and an efficient URT simulation environment is constructed. Then, considering constraints such as train capacity and dispatch intervals, a train scheduling model is developed to minimize both passenger costs and operational costs. Subsequently, the real-time state of the URT system is represented by the overall number of passengers present at every platform, and train dispatch intervals on all lines are used as decision variables. A solving algorithm based on the SAC framework is developed. Finally, experimental results on a large-scale URT network comprising 10 lines demonstrate the effectiveness of the proposed framework, showing superior performance compared to other reinforcement learning algorithms and traditional heuristic optimization algorithms. The proposed approach achieves a 1.63% reduction in average passenger waiting time, equivalent to 2.09 seconds, while utilizing 49 fewer trains, representing a 2.97% decrease, compared to the second-best TD3 algorithm.
引用
收藏
页数:15
相关论文
共 9 条
  • [1] Real-Time Optimization of Urban Rail Transit Train Scheduling via Advantage Actor-Critic Deep Reinforcement Learning
    Wen, Longhui
    Zhou, Wei
    Liu, Jiajun
    Ren, Gang
    Zhang, Ning
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2024, 150 (09)
  • [2] Graph Soft Actor-Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination
    Hu, Yifan
    Fu, Junjie
    Wen, Guanghui
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, : 1 - 12
  • [3] Integrated optimization of train route plan and timetable with dynamic demand for the urban rail transit line
    Yang, Ruixia
    Han, Baoming
    Zhang, Qi
    Han, Zhenyu
    Long, Yuxuan
    TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2023, 11 (01) : 93 - 126
  • [4] An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand
    Ying, Cheng-shuo
    Chow, Andy H. F.
    Chin, Kwai-Sang
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2020, 140 (140) : 210 - 235
  • [5] ACDRL: An actor-critic deep reinforcement learning approach for solving the energy-aimed train timetable rescheduling problem under random disturbances
    Liao, Jinlin
    Wu, Guilian
    Chen, Hao
    Ni, Shiyuan
    Lin, Tingting
    Tang, Lu
    ENERGY REPORTS, 2022, 8 : 1350 - 1357
  • [6] Soft Actor-Critic Deep Reinforcement Learning with Hybrid Mixed-Integer Actions for Demand Responsive Scheduling of Energy Systems
    Campos, Gustavo
    El-Farra, Nael H.
    Palazoglu, Ahmet
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2022, 61 (24) : 8443 - 8461
  • [7] Multi-objective optimization approach for permanent magnet machine viaimproved soft actor-critic based on deep reinforcement learning
    Wang, Chen
    Dong, Tianyu
    Chen, Lei
    Zhu, Guixiang
    Chen, Yihan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [8] A hierarchical deep reinforcement learning method for solving urban route planning problems under large-scale customers and real-time traffic conditions
    Li, Yuanyuan
    Guan, Qingfeng
    Gu, Jun Feng
    Jiang, Xintong
    Li, Yang
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2025, 39 (01) : 118 - 141
  • [9] Semi-supervised soft sensor development based on dynamic dimensionality reduction-assisted large-scale pseudo label optimization and sample-weighted quality-relevant deep learning
    Jin, Huaiping
    Liu, Guangkun
    Qian, Bin
    Wang, Bin
    Yang, Biao
    Chen, Xiangguang
    CHEMICAL ENGINEERING SCIENCE, 2024, 298