Feature-Based MPPI Control with Applications to Maritime Systems

被引:3
作者
Homburger, Hannes [1 ]
Wirtensohn, Stefan [1 ]
Diehl, Moritz [2 ,3 ]
Reuter, Johannes [1 ]
机构
[1] HTWG Konstanz Univ Appl Sci, Inst Syst Dynam, D-78462 Constance, Germany
[2] Univ Freiburg, Dept Microsyst Engn IMTEK, D-79110 Freiburg, Germany
[3] Univ Freiburg, Dept Math, D-79110 Freiburg, Germany
关键词
sample-based nonlinear model predictive control; stochastic system dynamics; nonlinear model predictive control; maritime systems; collision avoidance; STOCHASTIC-CONTROL;
D O I
10.3390/machines10100900
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a novel feature-based sampling strategy for nonlinear Model Predictive Path Integral (MPPI) control is presented. Using the MPPI approach, the optimal feedback control is calculated by solving a stochastic optimal control (OCP) problem online by evaluating the weighted inference of sampled stochastic trajectories. While the MPPI algorithm can be excellently parallelized, the closed-loop performance strongly depends on the information quality of the sampled trajectories. To draw samples, a proposal density is used. The solver's and thus, the controller's performance is of high quality if the sampled trajectories drawn from this proposal density are located in low-cost regions of state-space. In classical MPPI control, the explored state-space is strongly constrained by assumptions that refer to the control value's covariance matrix, which are necessary for transforming the stochastic Hamilton-Jacobi-Bellman (HJB) equation into a linear second-order partial differential equation. To achieve excellent performance even with discontinuous cost functions, in this novel approach, knowledge-based features are introduced to constitute the proposal density and thus the low-cost region of state-space for exploration. This paper addresses the question of how the performance of the MPPI algorithm can be improved using a feature-based mixture of base densities. Furthermore, the developed algorithm is applied to an autonomous vessel that follows a track and concurrently avoids collisions using an emergency braking feature. Therefore, the presented feature-based MPPI algorithm is applied and analyzed in both simulation and full-scale experiments.
引用
收藏
页数:23
相关论文
共 33 条
  • [1] Abdelaal M, 2016, 2016 IEEE CONFERENCE ON SYSTEMS, PROCESS AND CONTROL (ICSPC), P23, DOI 10.1109/SPC.2016.7920697
  • [2] [Anonymous], GOOGLE MAPS RHINE RI
  • [3] A survey of inverse reinforcement learning: Challenges, methods and progress
    Arora, Saurabh
    Doshi, Prashant
    [J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
  • [4] Nonlinear MPC for Combined Motion Control and Thrust Allocation of Ships
    Barlund, Alexander
    Linder, Jonas
    Feyzmahdavian, Hamid
    Lundh, Michael
    Tervo, Kalevi
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 14698 - 14703
  • [5] Blanke M., 1981, SHIP PROPULSION LOSS
  • [6] Fossen TI, 2002, Marine control systems guidance, navigation, and control of ships, rigs and underwater vehicles
  • [7] Robust Model Predictive Path Integral Control: Analysis and Performance Guarantees
    Gandhi, Manan S.
    Vlahov, Bogdan
    Gibson, Jason
    Williams, Grady
    Theodorou, Evangelos A.
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1423 - 1430
  • [8] Gómez V, 2016, P I C AUTOMAT PLAN S, P468
  • [9] Homburger H., 2022, P 6 IEEE C CONTROL T
  • [10] Homburger H., 2022, P 20 EUROPEAN CONTRO