OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING

被引:0
|
作者
Diaz, Jorge A. [1 ]
Xu, Lei [2 ]
Sardarmehni, Tohid [3 ]
机构
[1] Univ Texas Rio Grande Valley, Dept Mech Engn, Edinburg, TX 78539 USA
[2] Kent State Univ, Dept Comp Sci, Kent, OH 44242 USA
[3] Calif State Univ Northridge, Dept Mech Engn, Northridge, CA 91330 USA
来源
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5 | 2022年
基金
美国国家科学基金会;
关键词
optimal control; approximate dynamic programming; dynamic programming; neural networks;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Dynamic programming (DP) provides a systematic, closed-loop solution for optimal control problems. However, it suffers from the curse of dimensionality in higher orders. Approximate dynamic programming (ADP) methods can remedy this by finding near-optimal rather than exact optimal solutions. In summary, ADP uses function approximators, such as neural networks, to approximate optimal control solutions. ADP can then converge to the near-optimal solution using techniques such as reinforcement learning (RL). The two main challenges in using this approach are finding a proper training domain and selecting a suitable neural network architecture for precisely approximating the solutions with RL. Users select the training domain and the neural networks mostly by trial and error, which is tedious and time-consuming. This paper proposes trading the closed-loop solution provided by ADP methods for more effectively selecting the domain of training. To do so, we train a neural network using a small and moving domain around the reference signal. We asses the method's effectiveness by applying it to a widely used benchmark problem, the Van der Pol oscillator; and a real-world problem, controlling a quadrotor to track a reference trajectory. Simulation results demonstrate comparable performance to traditional methods while reducing computational requirements.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Approximate optimal control for an uncertain robot based on adaptive dynamic programming
    Kong, Linghuan
    Zhang, Shuang
    Yu, Xinbo
    NEUROCOMPUTING, 2021, 423 : 308 - 317
  • [32] Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem
    Lu, W.
    Ferrari, S.
    Fierro, R.
    Wettergren, T. A.
    UNMANNED SYSTEMS TECHNOLOGY XIV, 2012, 8387
  • [33] Optimal open-loop desynchronization of neural oscillator populations
    Dan Wilson
    Journal of Mathematical Biology, 2020, 81 : 25 - 64
  • [34] Optimal open-loop desynchronization of neural oscillator populations
    Wilson, Dan
    JOURNAL OF MATHEMATICAL BIOLOGY, 2020, 81 (01) : 25 - 64
  • [35] A Hybrid Adaptive Dynamic Programming for Optimal Tracking Control of USVs
    Xue, Shan
    Zhao, Ning
    Zhang, Weidong
    Luo, Biao
    Liu, Derong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025,
  • [36] Constraint handling optimal PI control of open-loop unstable process: Analytical approach
    Tchamna, Rodrigue
    Lee, Moonyong
    KOREAN JOURNAL OF CHEMICAL ENGINEERING, 2017, 34 (12) : 3067 - 3076
  • [37] Methods for solution of large optimal control problems that bypass open-loop model reduction
    Thomas Bewley
    Paolo Luchini
    Jan Pralits
    Meccanica, 2016, 51 : 2997 - 3014
  • [38] Trajectory Optimization of Flexible Mobile Manipulators Using Open-Loop Optimal Control Method
    Korayem, M. H.
    Nohooji, H. Rahimi
    INTELLIGENT ROBOTICS AND APPLICATIONS, PT I, PROCEEDINGS, 2008, 5314 : 54 - 63
  • [39] Open-loop optimal control of batch chromatographic separation processes using direct collocation
    Holmqvist, A.
    Magnusson, F.
    JOURNAL OF PROCESS CONTROL, 2016, 46 : 55 - 74
  • [40] Constraint handling optimal PI control of open-loop unstable process: Analytical approach
    Rodrigue Tchamna
    Moonyong Lee
    Korean Journal of Chemical Engineering, 2017, 34 : 3067 - 3076