A Safe Reinforcement Learning driven Weights-varying Model Predictive Control for Autonomous Vehicle Motion Control

被引:0
作者
Zarrouki, Baha [1 ,2 ]
Spanakakis, Marios
Betz, Johannes
机构
[1] Tech Univ Munich, TUM Sch Engn & Design, Automot Technol, Munich, Germany
[2] Tech Univ Munich, TUM Sch Engn & Design, Autonomous Vehicle Syst, Munich, Germany
来源
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024 | 2024年
关键词
MPC;
D O I
10.1109/IV55156.2024.10588747
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Determining the optimal cost function parameters of Model Predictive Control (MPC) to optimize multiple control objectives is a challenging and time-consuming task. Multi-objective Bayesian Optimization (BO) techniques solve this problem by determining a Pareto optimal parameter set for an MPC with static weights. However, a single parameter set may not deliver the most optimal closed-loop control performance when the context of the MPC operating conditions changes during its operation, urging the need to adapt the cost function weights at runtime. Deep Reinforcement Learning (RL) algorithms can automatically learn context-dependent optimal parameter sets and dynamically adapt for a Weights-varying MPC (WMPC). However, learning cost function weights from scratch in a continuous action space may lead to unsafe operating states. To solve this, we propose a novel approach limiting the RL action space within a safe learning space that we represent by a catalog of pre-optimized feasible BO Pareto-optimal weight sets. We conceive an RL agent not to learn in a continuous space but to select the most optimal discrete actions, each corresponding to a single set of Pareto optimal weights, by proactively anticipating upcoming control tasks in a context-dependent manner. This approach introduces a two-step optimization: (1) safety-critical with BO and (2) performance-driven with RL. Hence, even an untrained RL agent guarantees a safe and optimal performance. Simulation results demonstrate that an untrained RL-WMPC shows Pareto-optimal closed-loop behavior and training the RL-WMPC helps exhibit a performance beyond the Pareto-front. The code used in this research is publicly accessible as open-source software: https://github.com/bzarr/TUM-CONTROL
引用
收藏
页码:1401 / 1408
页数:8
相关论文
共 50 条
  • [31] A Swarm-Based Distributed Model Predictive Control Scheme for Autonomous Vehicle Formations in Uncertain Environments
    Bono, Antonio
    Fedele, Giuseppe
    Franze, Giuseppe
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) : 8876 - 8886
  • [32] Model Predictive Control Approaches for Lane Keeping of Vehicle
    Kamat, Shivaram
    [J]. IFAC PAPERSONLINE, 2020, 53 (01): : 176 - 182
  • [33] Model-based predictive control of vehicle dynamics
    Department of Mechanical Engineering, University of Michigan, Ann Arbor, 2350 Hayward Street, Ann Arbor, MI 48109, United States
    不详
    不详
    不详
    不详
    [J]. Int. J. Veh. Auton. Syst., 2007, 1-2 (3-27): : 3 - 27
  • [34] Lane Keeping of Vehicle Using Model Predictive Control
    Kamat, Shivaram
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [35] Model predictive control with constraints based on PSO and fuzzy logic applied to the control of coupled longitudinal-lateral dynamics of the autonomous vehicle
    Alika, Rachid
    Mellouli, El Mehdi
    Tissir, El Houssaine
    [J]. INTERNATIONAL JOURNAL OF AUTOMATION AND CONTROL, 2025, 19 (01) : 59 - 100
  • [36] Depth Control of a High Speed Underwater Vehicle using Model Predictive Control
    Prasad, M. P. R.
    Swarup, Akhilesh
    [J]. 2016 IEEE UTTAR PRADESH SECTION INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ELECTRONICS ENGINEERING (UPCON), 2016, : 218 - 223
  • [37] Landing Control of Unmanned Aerial Vehicle using continuous Model Predictive Control
    Qayyum, Naila
    Bhatti, Aamer Iqbal
    Liaquat, Muwahida
    [J]. 2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 1804 - 1808
  • [38] Learning-based Nonlinear Model Predictive Control of Reconfigurable Autonomous Robotic Boats: Roboats
    Kayacan, Erkan
    Park, Shinkyu
    Ratti, Carlo
    Rus, Daniela
    [J]. 2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 8230 - 8237
  • [39] Robust Model Predictive Iterative Learning Control for Iteration-Varying-Reference Batch Processes
    Liu, Xiangjie
    Ma, Lele
    Kong, Xiaobing
    Lee, Kwang Y.
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (07): : 4238 - 4250
  • [40] Moment-Based Model Predictive Control of Autonomous Systems
    Bao, HanQiu
    Kang, Qi
    Shi, XuDong
    Zhou, MengChu
    Li, HaoJun
    An, Jing
    Sedraoui, Khaled
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (04): : 2939 - 2953