A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引：0

作者：

Zarrouki, Baha ^{[1
,2
]}

Spanakakis, Marios

Betz, Johannes

机构：

[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany

[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany

来源：

2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年

关键词：

MPC;

D O I：

10.1109/IV55156.2024.10588747

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL

引用

页码：1401 / 1408

页数：8

共 50 条

[31] A Swarm-Based Distributed Model Predictive Control Scheme for Autonomous Vehicle Formations in Uncertain Environments
Bono, Antonio
Fedele, Giuseppe
Franze, Giuseppe
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8876 - 8886
[32] Model Predictive Control Approaches for Lane Keeping of Vehicle
Kamat, Shivaram
[J]. IFAC PAPERSONLINE, 2020, 53 (01): : 176 - 182
[33] Model-based predictive control of vehicle dynamics
Department of Mechanical Engineering, University of Michigan, Ann Arbor, 2350 Hayward Street, Ann Arbor, MI 48109, United States
不详
不详
不详
不详
[J]. Int. J. Veh. Auton. Syst., 2007, 1-2 (3-27): : 3 - 27
[34] Lane Keeping of Vehicle Using Model Predictive Control
Kamat, Shivaram
[J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
[35] Model predictive control with constraints based on PSO and fuzzy logic applied to the control of coupled longitudinal-lateral dynamics of the autonomous vehicle
Alika, Rachid
Mellouli, El Mehdi
Tissir, El Houssaine
[J]. INTERNATIONAL JOURNAL OF AUTOMATION AND CONTROL, 2025, 19 (01) : 59 - 100
[36] Depth Control of a High Speed Underwater Vehicle using Model Predictive Control
Prasad, M. P. R.
Swarup, Akhilesh
[J]. 2016 IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS ENGINEERING (UPCON), 2016, : 218 - 223
[37] Landing Control of Unmanned Aerial Vehicle using continuous Model Predictive Control
Qayyum, Naila
Bhatti, Aamer Iqbal
Liaquat, Muwahida
[J]. 2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 1804 - 1808
[38] Learning-based Nonlinear Model Predictive Control of Reconfigurable Autonomous Robotic Boats: Roboats
Kayacan, Erkan
Park, Shinkyu
Ratti, Carlo
Rus, Daniela
[J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 8230 - 8237
[39] Robust Model Predictive Iterative Learning Control for Iteration-Varying-Reference Batch Processes
Liu, Xiangjie
Ma, Lele
Kong, Xiaobing
Lee, Kwang Y.
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (07): : 4238 - 4250
[40] Moment-Based Model Predictive Control of Autonomous Systems
Bao, HanQiu
Kang, Qi
Shi, XuDong
Zhou, MengChu
Li, HaoJun
An, Jing
Sedraoui, Khaled
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (04): : 2939 - 2953

← 1 2 3 4 5 →