Multiagent Reinforcement Learning for Active Guidance Control of Railway Vehicles with Independently Rotating Wheels

被引:2
作者
Wei, Juyao [1 ]
Lu, Zhenggang [1 ]
Yin, Zheng [1 ]
Jing, Zhipeng [1 ]
机构
[1] Tongji Univ, Inst Rail Transit, Shanghai 201804, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 04期
关键词
active guidance control; independently rotating wheels (IRW); prioritized experience replay (PER); multiagent deep deterministic policy gradient (MADDPG); BOGIE;
D O I
10.3390/app14041677
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper presents a novel data-driven multiagent reinforcement learning (MARL) controller for enhancing the running stability of independently rotating wheels (IRW) and reducing wheel-rail wear. We base our active guidance controller on the multiagent deep deterministic policy gradient (MADDPG) algorithm. In this framework, each IRW controller is treated as an independent agent, facilitating localized control of individual wheelsets and reducing the complexity of the required observations. Furthermore, we enhance the MADDPG algorithm with prioritized experience replay (PER), resulting in the PER-MADDPG algorithm, which optimizes training convergence and stability by prioritizing informative experience samples. In this paper, we compare the PER-MADDPG algorithm against existing controllers, demonstrating the superior simulation performance of the proposed algorithm, particularly in terms of self-centering capability and curve-negotiation behavior, effectively reducing the wear number. We also develop a scaled IRW vehicle for active guidance experiments. The experimental results validate the enhanced running performance of IRW vehicles using our proposed controller.
引用
收藏
页数:21
相关论文
共 31 条
[11]   Estimating the wheel lateral position of a mechatronic railway running gear with nonlinear wheel-rail geometry [J].
Keck, Alexander ;
Schwarz, Christoph ;
Meurer, Thomas ;
Heckmann, Andreas ;
Grether, Gustav .
MECHATRONICS, 2021, 73
[12]   Active control of independently-rotating wheels with gyroscopes and tachometers - simple solutions for perfect curving and high stability performance [J].
Liu, Xiaoyuan ;
Goodall, Roger ;
Iwnicki, Simon .
VEHICLE SYSTEM DYNAMICS, 2021, 59 (11) :1719-1734
[13]   Can language models be used for real-world urban-delivery route optimization? [J].
Liu, Yang ;
Wu, Fanyou ;
Liu, Zhiyuan ;
Wang, Kai ;
Wang, Feiyue ;
Qu, Xiaobo .
INNOVATION, 2023, 4 (06)
[14]  
Lowe R, 2017, ADV NEUR IN, V30
[15]   MADDPG-based joint optimization of task partitioning and computation resource allocation in mobile edge computing [J].
Lu, Kun ;
Li, Rong-Da ;
Li, Ming-Chu ;
Xu, Guo-Rui .
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (22) :16559-16576
[16]   Robust active guidance control using the μ-synthesis method for a tramcar with independently rotating wheelsets [J].
Lu, Zheng-Gang ;
Yang, Zhe ;
Huang, Qi ;
Wang, Xiao-Chao .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2019, 233 (01) :33-48
[17]   Integrated active control of independently rotating wheels on rail vehicles via observers [J].
Lu, Zheng-Gang ;
Sun, Xiao-Jie ;
Yang, Jun-Qi .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART F-JOURNAL OF RAIL AND RAPID TRANSIT, 2017, 231 (03) :295-305
[18]   Active Steering Controller for Driven Independently Rotating Wheelset Vehicles Based on Deep Reinforcement Learning [J].
Lu, Zhenggang ;
Wei, Juyao ;
Wang, Zehan .
PROCESSES, 2023, 11 (09)
[19]   Robust control for independently rotating wheelsets on a railway vehicle using practical sensors [J].
Mei, TX ;
Goodall, RM .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2001, 9 (04) :599-607
[20]  
Perez J, 2002, VEHICLE SYST DYN, V37, P209