Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization

被引：28

作者：

Ning, Lingbin ^{[1
]}

Zhou, Min ^{[1
]}

Hou, Zhuopu ^{[1
]}

Goverde, Rob M. P. ^{[2
]}

Wang, Fei-Yue ^{[3
]}

Dong, Hairong ^{[1
]}

机构：

[1] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China

[2] Delft Univ Technol, Dept Transport & Planning, NL-2628 CN Delft, Netherlands

[3] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 08期

基金：

中国国家自然科学基金;

关键词：

Rail transportation; Training; Heuristic algorithms; Resistance; Optimal control; Trajectory optimization; Switches; High-speed railway; train trajectory optimization; deep deterministic policy gradient; energy efficiency; TRAFFIC MANAGEMENT; LEARNING APPROACH; MODEL; INTEGRATION; OPERATION; ALGORITHM; SYSTEM; DELAY;

D O I：

10.1109/TITS.2021.3105380

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

This paper proposes a novel train trajectory optimization approach for high-speed railways. We restrict our attention to single train operation scenarios with different scheduled/rescheduled running times aiming at generating optimal train recommended trajectories in real time, which can ensure punctuality and energy efficiency of train operation. A learning-based approach deep deterministic policy gradient (DDPG) is designed to generate optimal train trajectories based on the offline training from the interaction between the agent and the trajectory simulation environment. An allocating running time and selecting operation modes (ARTSOM) algorithm is proposed to improve train punctuality and give a series of discrete operation modes (full traction, cruising, coasting, full braking), and thus to produce a feasible training set for DDPG, which can speed up the training process. Numerical experiments show that an optimized speed profile can be generated by DDPG within seconds on a realistic railway line. In addition, the results demonstrate the generalization ability of trained DDPG in solving TTO problems with different running times and line conditions.

引用

页码：11562 / 11574

页数：13

共 50 条

[1] Trajectory Optimization for High-Speed Trains via a Mixed Integer Linear Programming Approach
Cao, Yuan
Zhang, Zixuan
Cheng, Fanglin
Su, Shuai
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 17666 - 17676
[2] Trajectory Optimization for High-Speed Train Operation
He Zhi-yu
Yang Zhi-jie
Lv Jing-yang
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2065 - 2070
[3] Collaborative optimization for train scheduling and train stop planning on high-speed railways
Yang, Lixing
Qi, Jianguo
Li, Shukai
Gao, Yuan
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2016, 64 : 57 - 76
[4] On-Line Train Speed Profile Generation of High-Speed Railway With Energy-Saving: A Model Predictive Control Method
Zhong, Weifeng
Li, Shukai
Xu, Hongze
Zhang, Wenjing
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4063 - 4074
[5] Multiobjective Optimization for Train Speed Trajectory in CTCS High-Speed Railway With Hybrid Evolutionary Algorithm
Wei ShangGuan
Yan, Xi-Hui
Cai, Bai-Gen
Wang, Jian
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2015, 16 (04) : 2215 - 2225
[6] Optimization Based High-Speed Railway Train Rescheduling with Speed Restriction
Wang, Li
Mo, Wenting
Qin, Yong
Dou, Fei
Jia, Limin
DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2014, 2014
[7] A dynamic ensemble deep deterministic policy gradient recursive network for spatiotemporal traffic speed forecasting in an urban road network
Mi, Xiwei
Yu, Chengqing
Liu, Xinwei
Yan, Guangxi
Yu, Fuhao
Shang, Pan
DIGITAL SIGNAL PROCESSING, 2022, 129
[8] Adaptive Partial Train Speed Trajectory Optimization
Tan, Zhaoxiang
Lu, Shaofeng
Bao, Kai
Zhang, Shaoning
Wu, Chaoxian
Yang, Jie
Xue, Fei
ENERGIES, 2018, 11 (12)
[9] Deep Deterministic Policy Gradient With Prioritized Sampling for Power Control
Zhou, Shiyang
Cheng, Yufan
Lei, Xia
Duan, Huanhuan
IEEE ACCESS, 2020, 8 : 194240 - 194250
[10] A Deep Deterministic Policy Gradient Approach for Vehicle Speed Tracking Control With a Robotic Driver
Hao, Gaofeng
Fu, Zhuang
Feng, Xin
Gong, Zening
Chen, Peng
Wang, Dan
Wang, Weibin
Si, Yang
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2022, 19 (03) : 2514 - 2525

← 1 2 3 4 5 →