Feature-Based MPPI Control with Applications to Maritime Systems

被引：3

作者：

Homburger, Hannes ^{[1
]}

Wirtensohn, Stefan ^{[1
]}

Diehl, Moritz ^{[2
,3
]}

Reuter, Johannes ^{[1
]}

机构：

[1] HTWG Konstanz Univ Appl Sci, Inst Syst Dynam, D-78462 Constance, Germany

[2] Univ Freiburg, Dept Microsyst Engn IMTEK, D-79110 Freiburg, Germany

[3] Univ Freiburg, Dept Math, D-79110 Freiburg, Germany

来源：

MACHINES | 2022年 / 10卷 / 10期

关键词：

sample-based nonlinear model predictive control; stochastic system dynamics; nonlinear model predictive control; maritime systems; collision avoidance; STOCHASTIC-CONTROL;

D O I：

10.3390/machines10100900

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, a novel feature-based sampling strategy for nonlinear Model Predictive Path Integral (MPPI) control is presented. Using the MPPI approach, the optimal feedback control is calculated by solving a stochastic optimal control (OCP) problem online by evaluating the weighted inference of sampled stochastic trajectories. While the MPPI algorithm can be excellently parallelized, the closed-loop performance strongly depends on the information quality of the sampled trajectories. To draw samples, a proposal density is used. The solver's and thus, the controller's performance is of high quality if the sampled trajectories drawn from this proposal density are located in low-cost regions of state-space. In classical MPPI control, the explored state-space is strongly constrained by assumptions that refer to the control value's covariance matrix, which are necessary for transforming the stochastic Hamilton-Jacobi-Bellman (HJB) equation into a linear second-order partial differential equation. To achieve excellent performance even with discontinuous cost functions, in this novel approach, knowledge-based features are introduced to constitute the proposal density and thus the low-cost region of state-space for exploration. This paper addresses the question of how the performance of the MPPI algorithm can be improved using a feature-based mixture of base densities. Furthermore, the developed algorithm is applied to an autonomous vessel that follows a track and concurrently avoids collisions using an emergency braking feature. Therefore, the presented feature-based MPPI algorithm is applied and analyzed in both simulation and full-scale experiments.

引用

页数：23

共 33 条

[1] Abdelaal M, 2016, 2016 IEEE CONFERENCE ON SYSTEMS, PROCESS AND CONTROL (ICSPC), P23, DOI 10.1109/SPC.2016.7920697
[2] [Anonymous], GOOGLE MAPS RHINE RI
[3] A survey of inverse reinforcement learning: Challenges, methods and progress
Arora, Saurabh
Doshi, Prashant
[J]. ARTIFICIAL INTELLIGENCE, 2021, 297 (297)
[4] Nonlinear MPC for Combined Motion Control and Thrust Allocation of Ships
Barlund, Alexander
Linder, Jonas
Feyzmahdavian, Hamid
Lundh, Michael
Tervo, Kalevi
[J]. IFAC PAPERSONLINE, 2020, 53 (02): : 14698 - 14703
[5] Blanke M., 1981, SHIP PROPULSION LOSS
[6] Fossen TI, 2002, Marine control systems guidance, navigation, and control of ships, rigs and underwater vehicles
[7] Robust Model Predictive Path Integral Control: Analysis and Performance Guarantees
Gandhi, Manan S.
Vlahov, Bogdan
Gibson, Jason
Williams, Grady
Theodorou, Evangelos A.
[J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1423 - 1430
[8] Gómez V, 2016, P I C AUTOMAT PLAN S, P468
[9] Homburger H., 2022, P 6 IEEE C CONTROL T
[10] Homburger H., 2022, P 20 EUROPEAN CONTRO

← 1 2 3 4 →