A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引：0

作者：

Zarrouki, Baha ^{[1
,2
]}

Spanakakis, Marios

Betz, Johannes

机构：

[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany

[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany

来源：

2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年

关键词：

MPC;

D O I：

10.1109/IV55156.2024.10588747

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL

引用

页码：1401 / 1408

页数：8

共 50 条

[21] Adaptive parameterized model predictive control based on reinforcement learning: A synthesis framework
Sun, Dingshan
Jamshidnejad, Anahita
De Schutter, Bart
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
[22] A practically implementable reinforcement learning control approach by leveraging offset-free model predictive control
Hassanpour, Hesam
Mhaskar, Prashant
Corbett, Brandon
COMPUTERS & CHEMICAL ENGINEERING, 2024, 181
[23] A novel stable and safe model predictive control framework for autonomous rendezvous and docking with a tumbling target
Dong, Kaikai
Luo, Jianjun
Limon, Daniel
ACTA ASTRONAUTICA, 2022, 200 : 176 - 187
[24] RNN-based linear parameter varying adaptive model predictive control for autonomous driving
Kebbati, Yassine
Ait-Oufroukh, Naima
Ichalal, Dalil
Vigneron, Vincent
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2025, 56 (05) : 996 - 1008
[25] Multiple Autonomous Underwater Vehicle Formation Obstacle Avoidance Control Using Event-Triggered Model Predictive Control
Wang, Linling
Xu, Xiaoyan
Han, Bing
Zhang, Huapeng
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (10)
[26] Symmetry and motion primitives in model predictive control
Flasskamp, Kathrin
Ober-Bloebaum, Sina
Worthmann, Karl
MATHEMATICS OF CONTROL SIGNALS AND SYSTEMS, 2019, 31 (04) : 455 - 485
[27] Learning-Based Stochastic Model Predictive Control for Autonomous Driving at Uncontrolled Intersections
Soman, Surya
Zanon, Mario
Bemporad, Alberto
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025, 26 (02) : 1538 - 1546
[28] Autonomous racing using Linear Parameter Varying-Model Predictive Control (LPV-MPC)
Alcala, Eugenio
Puig, Vicenc
Quevedo, Joseba
Rosolia, Ugo
CONTROL ENGINEERING PRACTICE, 2020, 95
[29] Model Predictive Control for a Linear Parameter Varying Model of an UAV
Cavanini, Luca
Ippoliti, Gianluca
Camacho, Eduardo F.
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 101 (03)
[30] A Swarm-Based Distributed Model Predictive Control Scheme for Autonomous Vehicle Formations in Uncertain Environments
Bono, Antonio
Fedele, Giuseppe
Franze, Giuseppe
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8876 - 8886

← 1 2 3 4 5 →