Finite Time Lyapunov Exponent Analysis of Model Predictive Control and Reinforcement Learning

被引:0
|
作者
Krishna, Kartik [1 ]
Brunton, Steven L. [1 ]
Song, Zhuoyuan [2 ]
机构
[1] Univ Washington, Dept Mech Engn, Seattle, WA 98195 USA
[2] Univ Hawaii Manoa, Dept Mech Engn, Honolulu, HI 96822 USA
基金
美国国家科学基金会;
关键词
Optimal control; finite-time Lyapunov exponents; path planning; mobile sensors; dynamical systems; unsteady fluid dynamics; model predictive control; reinforcement learning; LAGRANGIAN COHERENT STRUCTURES; OPTIMAL TRAJECTORY GENERATION; WIND-DRIVEN; TRANSPORT; DEFINITION; VEHICLES; WAKE;
D O I
10.1109/ACCESS.2023.3326424
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Finite-time Lyapunov exponents (FTLEs) provide a powerful approach to compute time-varying analogs of invariant manifolds in unsteady fluid flow fields. These manifolds are useful to visualize the transport mechanisms of passive tracers advecting with the flow. However, many vehicles and mobile sensors are not passive, but are instead actuated according to some intelligent trajectory planning or control law; for example, model predictive control and reinforcement learning are often used to design energy-efficient trajectories in a dynamically changing background flow. In this work, we investigate the use of FTLE on such controlled agents to gain insight into optimal transport routes for navigation in known unsteady flows. We find that these controlled FTLE (cFTLE) coherent structures separate the flow field into different regions with similar costs of transport to the goal location. These separatrices are functions of the planning algorithm's hyper-parameters, such as the optimization time horizon and the cost of actuation. Computing the invariant sets and manifolds of active agent dynamics in dynamic flow fields is useful in the context of robust motion control, hyperparameter tuning, and determining safe and collision-free trajectories for autonomous systems. Moreover, these cFTLE structures provide insight into effective deployment locations for mobile agents with actuation and energy constraints to traverse the ocean or atmosphere.
引用
收藏
页码:118916 / 118930
页数:15
相关论文
共 50 条
  • [31] An experimental study of two predictive reinforcement learning methods and comparison with model-predictive control
    Dobriborsci, Dmitrii
    Osinenko, Pavel
    Aumer, Wolfgang
    IFAC PAPERSONLINE, 2022, 55 (10): : 1545 - 1550
  • [32] MODEL-FREE PREDICTIVE CONTROL OF NONLINEAR PROCESSES BASED ON REINFORCEMENT LEARNING
    Shah, Hitesh
    Gopal, M.
    IFAC PAPERSONLINE, 2016, 49 (01): : 89 - 94
  • [33] Model Predictive Control-Based Reinforcement Learning Using Expected Sarsa
    Moradimaryamnegari, Hoomaan
    Frego, Marco
    Peer, Angelika
    IEEE ACCESS, 2022, 10 : 81177 - 81191
  • [34] Comparison of reinforcement learning and model predictive control for building energy system optimization
    Wang, Dan
    Zheng, Wanfu
    Wang, Zhe
    Wang, Yaran
    Pang, Xiufeng
    Wang, Wei
    APPLIED THERMAL ENGINEERING, 2023, 228
  • [35] Multi-step Greedy Reinforcement Learning Based on Model Predictive Control
    Yang, Yucheng
    Lucia, Sergio
    IFAC PAPERSONLINE, 2021, 54 (03): : 699 - 705
  • [36] Optimization of the model predictive control meta-parameters through reinforcement learning
    Bohn, Eivind
    Gros, Sebastien
    Moe, Signe
    Johansen, Tor Arne
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [37] Hierarchical Evasive Path Planning Using Reinforcement Learning and Model Predictive Control
    Feher, Arpad
    Aradi, Szilard
    Becsi, Tamas
    IEEE ACCESS, 2020, 8 : 187470 - 187482
  • [38] Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming
    Bertsekas, Dimitri P.
    IFAC PAPERSONLINE, 2024, 58 (18): : 363 - 383
  • [39] Quantitative comparison of reinforcement learning and data-driven model predictive control for chemical and biological processes
    Oh, Tae Hoon
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 181
  • [40] Interactive Computation and Rendering of Finite-Time Lyapunov Exponent Fields
    Barakat, Samer
    Garth, Christoph
    Tricoche, Xavier
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2012, 18 (08) : 1368 - 1380