OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING

被引：0

作者：

Diaz, Jorge A. ^{[1
]}

Xu, Lei ^{[2
]}

Sardarmehni, Tohid ^{[3
]}

机构：

[1] Univ Texas Rio Grande Valley, Dept Mech Engn, Edinburg, TX 78539 USA

[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA

[3] Calif State Univ Northridge, Dept Mech Engn, Northridge, CA 91330 USA

来源：

PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5 | 2022年

基金：

美国国家科学基金会;

关键词：

optimal control; approximate dynamic programming; dynamic programming; neural networks;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dynamic programming (DP) provides a systematic, closed-loop solution for optimal control problems. However, it suffers from the curse of dimensionality in higher orders. Approximate dynamic programming (ADP) methods can remedy this by finding near-optimal rather than exact optimal solutions. In summary, ADP uses function approximators, such as neural networks, to approximate optimal control solutions. ADP can then converge to the near-optimal solution using techniques such as reinforcement learning (RL). The two main challenges in using this approach are finding a proper training domain and selecting a suitable neural network architecture for precisely approximating the solutions with RL. Users select the training domain and the neural networks mostly by trial and error, which is tedious and time-consuming. This paper proposes trading the closed-loop solution provided by ADP methods for more effectively selecting the domain of training. To do so, we train a neural network using a small and moving domain around the reference signal. We asses the method's effectiveness by applying it to a widely used benchmark problem, the Van der Pol oscillator; and a real-world problem, controlling a quadrotor to track a reference trajectory. Simulation results demonstrate comparable performance to traditional methods while reducing computational requirements.

引用

页数：7

共 50 条

[41] Methods for solution of large optimal control problems that bypass open-loop model reduction
Bewley, Thomas
Luchini, Paolo
Pralits, Jan
MECCANICA, 2016, 51 (12) : 2997 - 3014
[42] Open-loop optimal control of a flapping wing using an adjoint Lattice Boltzmann method
Rutkowski, Mariusz
Gryglas, Wojciech
Szumbarski, Jacek
Leonardi, Christopher
Laniewski-Wollk, Lukasz
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2020, 79 (12) : 3547 - 3569
[43] Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems
Beppu, Hirofumi
Maruta, Ichiro
Fujimoto, Kenji
IFAC PAPERSONLINE, 2020, 53 (02): : 6715 - 6722
[44] A Novel Approximate Dynamic Programming Structure for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
Sun, Jiayue
Xu, Zhiming
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (08) : 3835 - 3839
[45] Optimal Tracking in Switched Systems With Free Final Time and Fixed Mode Sequence Using Approximate Dynamic Programming
Sardarmehni, Tohid
Song, Xingyong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3460 - 3472
[46] Adaptive polyhedral meshing for approximate dynamic programming in control
Sala, Antonio
Armesto, Leopoldo
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
[47] Approximate Dynamic Programming for Self-Learning Control
Derong Liu Department of Electrical and Computer Engineering University of Illinois Chicago IL USA
自动化学报, 2005, (01) : 13 - 18
[48] Convexified Open-Loop Stochastic Optimal Control for Linear Systems With Log-Concave Disturbances
Sivaramakrishnan, Vignesh
Vinod, Abraham P.
Oishi, Meeko M. K.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (02) : 1249 - 1256
[49] Optimal Tracking Control for a Class of Nonlinear Discrete-Time Systems with Time Delays Based on Heuristic Dynamic Programming
Zhang, Huaguang
Song, Ruizhuo
Wei, Qinglai
Zhang, Tieyan
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (12): : 1851 - 1862
[50] Data-driven approximate optimal tracking control schemes for unknown non-affine non-linear multi-player systems via adaptive dynamic programming
Jiang, He
Luo, Yanhong
ELECTRONICS LETTERS, 2017, 53 (07) : 465 - 467

← 1 2 3 4 5 →