A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引:0
|
作者
Zarrouki, Baha [1 ,2 ]
Spanakakis, Marios
Betz, Johannes
机构
[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany
[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany
来源
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年
关键词
MPC;
D O I
10.1109/IV55156.2024.10588747
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL
引用
收藏
页码:1401 / 1408
页数:8
相关论文
共 50 条
  • [21] Adaptive parameterized model predictive control based on reinforcement learning: A synthesis framework
    Sun, Dingshan
    Jamshidnejad, Anahita
    De Schutter, Bart
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [22] A practically implementable reinforcement learning control approach by leveraging offset-free model predictive control
    Hassanpour, Hesam
    Mhaskar, Prashant
    Corbett, Brandon
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 181
  • [23] A novel stable and safe model predictive control framework for autonomous rendezvous and docking with a tumbling target
    Dong, Kaikai
    Luo, Jianjun
    Limon, Daniel
    ACTA ASTRONAUTICA, 2022, 200 : 176 - 187
  • [24] RNN-based linear parameter varying adaptive model predictive control for autonomous driving
    Kebbati, Yassine
    Ait-Oufroukh, Naima
    Ichalal, Dalil
    Vigneron, Vincent
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (05) : 996 - 1008
  • [25] Multiple Autonomous Underwater Vehicle Formation Obstacle Avoidance Control Using Event-Triggered Model Predictive Control
    Wang, Linling
    Xu, Xiaoyan
    Han, Bing
    Zhang, Huapeng
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (10)
  • [26] Symmetry and motion primitives in model predictive control
    Flasskamp, Kathrin
    Ober-Bloebaum, Sina
    Worthmann, Karl
    MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2019, 31 (04) : 455 - 485
  • [27] Learning-Based Stochastic Model Predictive Control for Autonomous Driving at Uncontrolled Intersections
    Soman, Surya
    Zanon, Mario
    Bemporad, Alberto
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (02) : 1538 - 1546
  • [28] Autonomous racing using Linear Parameter Varying-Model Predictive Control (LPV-MPC)
    Alcala, Eugenio
    Puig, Vicenc
    Quevedo, Joseba
    Rosolia, Ugo
    CONTROL ENGINEERING PRACTICE, 2020, 95
  • [29] Model Predictive Control for a Linear Parameter Varying Model of an UAV
    Cavanini, Luca
    Ippoliti, Gianluca
    Camacho, Eduardo F.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (03)
  • [30] A Swarm-Based Distributed Model Predictive Control Scheme for Autonomous Vehicle Formations in Uncertain Environments
    Bono, Antonio
    Fedele, Giuseppe
    Franze, Giuseppe
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8876 - 8886