OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING

被引：0

作者：

Diaz, Jorge A. ^{[1
]}

Xu, Lei ^{[2
]}

Sardarmehni, Tohid ^{[3
]}

机构：

[1] Univ Texas Rio Grande Valley, Dept Mech Engn, Edinburg, TX 78539 USA

[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA

[3] Calif State Univ Northridge, Dept Mech Engn, Northridge, CA 91330 USA

来源：

PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5 | 2022年

基金：

美国国家科学基金会;

关键词：

optimal control; approximate dynamic programming; dynamic programming; neural networks;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Dynamic programming (DP) provides a systematic, closed-loop solution for optimal control problems. However, it suffers from the curse of dimensionality in higher orders. Approximate dynamic programming (ADP) methods can remedy this by finding near-optimal rather than exact optimal solutions. In summary, ADP uses function approximators, such as neural networks, to approximate optimal control solutions. ADP can then converge to the near-optimal solution using techniques such as reinforcement learning (RL). The two main challenges in using this approach are finding a proper training domain and selecting a suitable neural network architecture for precisely approximating the solutions with RL. Users select the training domain and the neural networks mostly by trial and error, which is tedious and time-consuming. This paper proposes trading the closed-loop solution provided by ADP methods for more effectively selecting the domain of training. To do so, we train a neural network using a small and moving domain around the reference signal. We asses the method's effectiveness by applying it to a widely used benchmark problem, the Van der Pol oscillator; and a real-world problem, controlling a quadrotor to track a reference trajectory. Simulation results demonstrate comparable performance to traditional methods while reducing computational requirements.

引用

页数：7

共 50 条

[31] Approximate optimal control for an uncertain robot based on adaptive dynamic programming
Kong, Linghuan
Zhang, Shuang
Yu, Xinbo
NEUROCOMPUTING, 2021, 423 : 308 - 317
[32] Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem
Lu, W.
Ferrari, S.
Fierro, R.
Wettergren, T. A.
UNMANNED SYSTEMS TECHNOLOGY XIV, 2012, 8387
[33] Optimal open-loop desynchronization of neural oscillator populations
Dan Wilson
Journal of Mathematical Biology, 2020, 81 : 25 - 64
[34] Optimal open-loop desynchronization of neural oscillator populations
Wilson, Dan
JOURNAL OF MATHEMATICAL BIOLOGY, 2020, 81 (01) : 25 - 64
[35] A Hybrid Adaptive Dynamic Programming for Optimal Tracking Control of USVs
Xue, Shan
Zhao, Ning
Zhang, Weidong
Luo, Biao
Liu, Derong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
[36] Constraint handling optimal PI control of open-loop unstable process: Analytical approach
Tchamna, Rodrigue
Lee, Moonyong
KOREAN JOURNAL OF CHEMICAL ENGINEERING, 2017, 34 (12) : 3067 - 3076
[37] Methods for solution of large optimal control problems that bypass open-loop model reduction
Thomas Bewley
Paolo Luchini
Jan Pralits
Meccanica, 2016, 51 : 2997 - 3014
[38] Trajectory Optimization of Flexible Mobile Manipulators Using Open-Loop Optimal Control Method
Korayem, M. H.
Nohooji, H. Rahimi
INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 54 - 63
[39] Open-loop optimal control of batch chromatographic separation processes using direct collocation
Holmqvist, A.
Magnusson, F.
JOURNAL OF PROCESS CONTROL, 2016, 46 : 55 - 74
[40] Constraint handling optimal PI control of open-loop unstable process: Analytical approach
Rodrigue Tchamna
Moonyong Lee
Korean Journal of Chemical Engineering, 2017, 34 : 3067 - 3076

← 1 2 3 4 5 →