Finite Time Lyapunov Exponent Analysis of Model Predictive Control and Reinforcement Learning

被引：0

作者：

Krishna, Kartik ^{[1
]}

Brunton, Steven L. ^{[1
]}

Song, Zhuoyuan ^{[2
]}

机构：

[1] Univ Washington, Dept Mech Engn, Seattle, WA 98195 USA

[2] Univ Hawaii Manoa, Dept Mech Engn, Honolulu, HI 96822 USA

来源：

IEEE ACCESS | 2023年 / 11卷

基金：

美国国家科学基金会;

关键词：

Optimal control; finite-time Lyapunov exponents; path planning; mobile sensors; dynamical systems; unsteady fluid dynamics; model predictive control; reinforcement learning; LAGRANGIAN COHERENT STRUCTURES; OPTIMAL TRAJECTORY GENERATION; WIND-DRIVEN; TRANSPORT; DEFINITION; VEHICLES; WAKE;

D O I：

10.1109/ACCESS.2023.3326424

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Finite-time Lyapunov exponents (FTLEs) provide a powerful approach to compute time-varying analogs of invariant manifolds in unsteady fluid flow fields. These manifolds are useful to visualize the transport mechanisms of passive tracers advecting with the flow. However, many vehicles and mobile sensors are not passive, but are instead actuated according to some intelligent trajectory planning or control law; for example, model predictive control and reinforcement learning are often used to design energy-efficient trajectories in a dynamically changing background flow. In this work, we investigate the use of FTLE on such controlled agents to gain insight into optimal transport routes for navigation in known unsteady flows. We find that these controlled FTLE (cFTLE) coherent structures separate the flow field into different regions with similar costs of transport to the goal location. These separatrices are functions of the planning algorithm's hyper-parameters, such as the optimization time horizon and the cost of actuation. Computing the invariant sets and manifolds of active agent dynamics in dynamic flow fields is useful in the context of robust motion control, hyperparameter tuning, and determining safe and collision-free trajectories for autonomous systems. Moreover, these cFTLE structures provide insight into effective deployment locations for mobile agents with actuation and energy constraints to traverse the ocean or atmosphere.

引用

页码：118916 / 118930

页数：15

共 50 条

[31] An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control
Dobriborsci, Dmitrii
Osinenko, Pavel
Aumer, Wolfgang
IFAC PAPERSONLINE, 2022, 55 (10): : 1545 - 1550
[32] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
Shah, Hitesh
Gopal, M.
IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
[33] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
Moradimaryamnegari, Hoomaan
Frego, Marco
Peer, Angelika
IEEE ACCESS, 2022, 10 : 81177 - 81191
[34] Comparison of reinforcement learning and model predictive control for building energy system optimization
Wang, Dan
Zheng, Wanfu
Wang, Zhe
Wang, Yaran
Pang, Xiufeng
Wang, Wei
APPLIED THERMAL ENGINEERING, 2023, 228
[35] Multi-step Greedy Reinforcement Learning Based on Model Predictive Control
Yang, Yucheng
Lucia, Sergio
IFAC PAPERSONLINE, 2021, 54 (03): : 699 - 705
[36] Optimization of the model predictive control meta-parameters through reinforcement learning
Bohn, Eivind
Gros, Sebastien
Moe, Signe
Johansen, Tor Arne
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[37] Hierarchical Evasive Path Planning Using Reinforcement Learning and Model Predictive Control
Feher, Arpad
Aradi, Szilard
Becsi, Tamas
IEEE ACCESS, 2020, 8 : 187470 - 187482
[38] Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming
Bertsekas, Dimitri P.
IFAC PAPERSONLINE, 2024, 58 (18): : 363 - 383
[39] Quantitative comparison of reinforcement learning and data-driven model predictive control for chemical and biological processes
Oh, Tae Hoon
COMPUTERS & CHEMICAL ENGINEERING, 2024, 181
[40] Interactive Computation and Rendering of Finite-Time Lyapunov Exponent Fields
Barakat, Samer
Garth, Christoph
Tricoche, Xavier
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (08) : 1368 - 1380

← 1 2 3 4 5 →