OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING

被引:0
|
作者
Diaz, Jorge A. [1 ]
Xu, Lei [2 ]
Sardarmehni, Tohid [3 ]
机构
[1] Univ Texas Rio Grande Valley, Dept Mech Engn, Edinburg, TX 78539 USA
[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA
[3] Calif State Univ Northridge, Dept Mech Engn, Northridge, CA 91330 USA
来源
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5 | 2022年
基金
美国国家科学基金会;
关键词
optimal control; approximate dynamic programming; dynamic programming; neural networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming (DP) provides a systematic, closed-loop solution for optimal control problems. However, it suffers from the curse of dimensionality in higher orders. Approximate dynamic programming (ADP) methods can remedy this by finding near-optimal rather than exact optimal solutions. In summary, ADP uses function approximators, such as neural networks, to approximate optimal control solutions. ADP can then converge to the near-optimal solution using techniques such as reinforcement learning (RL). The two main challenges in using this approach are finding a proper training domain and selecting a suitable neural network architecture for precisely approximating the solutions with RL. Users select the training domain and the neural networks mostly by trial and error, which is tedious and time-consuming. This paper proposes trading the closed-loop solution provided by ADP methods for more effectively selecting the domain of training. To do so, we train a neural network using a small and moving domain around the reference signal. We asses the method's effectiveness by applying it to a widely used benchmark problem, the Van der Pol oscillator; and a real-world problem, controlling a quadrotor to track a reference trajectory. Simulation results demonstrate comparable performance to traditional methods while reducing computational requirements.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Methods for solution of large optimal control problems that bypass open-loop model reduction
    Bewley, Thomas
    Luchini, Paolo
    Pralits, Jan
    MECCANICA, 2016, 51 (12) : 2997 - 3014
  • [42] Open-loop optimal control of a flapping wing using an adjoint Lattice Boltzmann method
    Rutkowski, Mariusz
    Gryglas, Wojciech
    Szumbarski, Jacek
    Leonardi, Christopher
    Laniewski-Wollk, Lukasz
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2020, 79 (12) : 3547 - 3569
  • [43] Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems
    Beppu, Hirofumi
    Maruta, Ichiro
    Fujimoto, Kenji
    IFAC PAPERSONLINE, 2020, 53 (02): : 6715 - 6722
  • [44] A Novel Approximate Dynamic Programming Structure for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
    Sun, Jiayue
    Xu, Zhiming
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (08) : 3835 - 3839
  • [45] Optimal Tracking in Switched Systems With Free Final Time and Fixed Mode Sequence Using Approximate Dynamic Programming
    Sardarmehni, Tohid
    Song, Xingyong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3460 - 3472
  • [46] Adaptive polyhedral meshing for approximate dynamic programming in control
    Sala, Antonio
    Armesto, Leopoldo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
  • [47] Approximate Dynamic Programming for Self-Learning Control
    Derong Liu Department of Electrical and Computer Engineering University of Illinois Chicago IL USA
    自动化学报, 2005, (01) : 13 - 18
  • [48] Convexified Open-Loop Stochastic Optimal Control for Linear Systems With Log-Concave Disturbances
    Sivaramakrishnan, Vignesh
    Vinod, Abraham P.
    Oishi, Meeko M. K.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1249 - 1256
  • [49] Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems with Time Delays Based on Heuristic Dynamic Programming
    Zhang, Huaguang
    Song, Ruizhuo
    Wei, Qinglai
    Zhang, Tieyan
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 1851 - 1862
  • [50] Data-driven approximate optimal tracking control schemes for unknown non-affine non-linear multi-player systems via adaptive dynamic programming
    Jiang, He
    Luo, Yanhong
    ELECTRONICS LETTERS, 2017, 53 (07) : 465 - 467