Finite Time Lyapunov Exponent Analysis of Model Predictive Control and Reinforcement Learning

被引:0
|
作者
Krishna, Kartik [1 ]
Brunton, Steven L. [1 ]
Song, Zhuoyuan [2 ]
机构
[1] Univ Washington, Dept Mech Engn, Seattle, WA 98195 USA
[2] Univ Hawaii Manoa, Dept Mech Engn, Honolulu, HI 96822 USA
基金
美国国家科学基金会;
关键词
Optimal control; finite-time Lyapunov exponents; path planning; mobile sensors; dynamical systems; unsteady fluid dynamics; model predictive control; reinforcement learning; LAGRANGIAN COHERENT STRUCTURES; OPTIMAL TRAJECTORY GENERATION; WIND-DRIVEN; TRANSPORT; DEFINITION; VEHICLES; WAKE;
D O I
10.1109/ACCESS.2023.3326424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finite-time Lyapunov exponents (FTLEs) provide a powerful approach to compute time-varying analogs of invariant manifolds in unsteady fluid flow fields. These manifolds are useful to visualize the transport mechanisms of passive tracers advecting with the flow. However, many vehicles and mobile sensors are not passive, but are instead actuated according to some intelligent trajectory planning or control law; for example, model predictive control and reinforcement learning are often used to design energy-efficient trajectories in a dynamically changing background flow. In this work, we investigate the use of FTLE on such controlled agents to gain insight into optimal transport routes for navigation in known unsteady flows. We find that these controlled FTLE (cFTLE) coherent structures separate the flow field into different regions with similar costs of transport to the goal location. These separatrices are functions of the planning algorithm's hyper-parameters, such as the optimization time horizon and the cost of actuation. Computing the invariant sets and manifolds of active agent dynamics in dynamic flow fields is useful in the context of robust motion control, hyperparameter tuning, and determining safe and collision-free trajectories for autonomous systems. Moreover, these cFTLE structures provide insight into effective deployment locations for mobile agents with actuation and energy constraints to traverse the ocean or atmosphere.
引用
收藏
页码:118916 / 118930
页数:15
相关论文
共 50 条
  • [41] Relation Between the Finite-Time Lyapunov Exponent and Acoustic Wave
    Han, Shuaibin
    Luo, Yong
    Zhang, Shuhai
    AIAA JOURNAL, 2019, 57 (12) : 5114 - 5125
  • [42] Local finite-time Lyapunov exponent, local sampling and probabilistic source and destination regions
    BozorgMagham, A. E.
    Ross, S. D.
    Schmale, D. G., III
    NONLINEAR PROCESSES IN GEOPHYSICS, 2015, 22 (06) : 663 - 677
  • [43] Damping control by fusion of reinforcement learning and control Lyapunov functions
    Glavic, Mevludin
    Ernst, Damien
    Wehenkel, Louis
    2006 38TH ANNUAL NORTH AMERICAN POWER SYMPOSIUM, NAPS-2006 PROCEEDINGS, 2006, : 361 - +
  • [44] Lyapunov-based distributed reinforcement learning control with stability guarantee
    Yao, Jingshi
    Han, Minghao
    Yin, Xunyuan
    COMPUTERS & CHEMICAL ENGINEERING, 2025, 195
  • [45] Stable Inverse Reinforcement Learning: Policies From Control Lyapunov Landscapes
    Tesfazgi, Samuel
    Sprandl, Leonhard
    Lederer, Armin
    Hirche, Sandra
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2024, 3 : 358 - 374
  • [46] Comparison of Deep Reinforcement Learning and Model Predictive Control for Adaptive Cruise Control
    Lin, Yuan
    McPhee, John
    Azad, Nasser L.
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2021, 6 (02): : 221 - 231
  • [47] Practical concerns of implementing a finite-time Lyapunov exponent analysis with under-resolved data
    Rockwood, Matthew P.
    Loiselle, Thomas
    Green, Melissa A.
    EXPERIMENTS IN FLUIDS, 2019, 60 (04)
  • [48] UVaFTLE: Lagrangian finite time Lyapunov exponent extraction for fluid dynamic applications
    Rocío Carratalá-Sáez
    Yuri Torres
    José Sierra-Pallares
    Sergio López-Huguet
    Diego R. Llanos
    The Journal of Supercomputing, 2023, 79 : 9635 - 9665
  • [49] Refining finite-time Lyapunov exponent ridges and the challenges of classifying them
    Allshouse, Michael R.
    Peacock, Thomas
    CHAOS, 2015, 25 (08)
  • [50] Linguistic Lyapunov reinforcement learning control for robotic manipulators
    Kumar, Abhishek
    Sharma, Rajneesh
    NEUROCOMPUTING, 2018, 272 : 84 - 95