Automatic reinforcement for robust model-free neurocontrol of robots without persistent excitation

被引:0
|
作者
Pantoja-Garcia, Luis [1 ]
Parra-Vega, Vicente [1 ,3 ]
Garcia-Rodriguez, Rodolfo [2 ]
机构
[1] Ctr Res & Adv Studies, Robot & Adv Mfg, Saltillo, Mexico
[2] Univ Politecn Metropolitana Hidalgo, Aeronaut Engn Dept, Postgrad Program Aerosp Engn, Tolcayuca, Mexico
[3] Ave Ind Met 1062, Ramos Arizpe 25903, Mexico
关键词
automatic reinforced learning; model-free control; neurocontrol; persistent excitation; robot manipulators; TRACKING CONTROL; ADAPTIVE-CONTROL; MANIPULATOR CONTROL; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; APPROXIMATION; FEEDBACK; CONVERGENCE;
D O I
10.1002/acs.3697
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Model-based adaptive control suffers over parametrization from the many adaptive parameters compared to the order of system dynamics, leading to sluggish tracking with a poor adaptation transient without robustness. Likewise, adaptive model-free neurocontrol that relies on the Stone-Weierstrass theorem also suffers from similar problems in addition to over-fitting to approximate inverse dynamics. This article proposes a novel reinforced adaptive mechanism to guarantee a transient and robustness for the model-free adaptive control of nonlinear Lagrangian systems. Inspired by the symbiosis of Actor-Critic (AC) architecture and integral sliding modes, the reinforced stage neural network, analogous to the critic, injects excitation signals to reinforce the parametric learning of the adaptive stage neural network, analogous to the actor to improve the approximation of inverse dynamics. The underlying integral sliding surface error drives improved learning onto a low-dimensional invariant manifold to guarantee local exponential convergence of tracking errors. Lyapunov stability substantiates the robustness with an improved transient response. Our proposal stands for a hybrid approach between AC and neurocontrol, where the reinforced stage does not require a value function nor reward to provide automatic reinforcement to the adaptive stage parametric adaptation. Dynamic simulations are presented for a nonlinear robot manipulator under different conditions.
引用
收藏
页码:221 / 236
页数:16
相关论文
共 50 条
  • [41] Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning
    Li, Gen
    Shi, Laixi
    Chen, Yuxin
    Chi, Yuejie
    INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2023, 12 (02) : 969 - 1043
  • [42] Robust Model-Free Identification of the Causal Networks Underlying Complex Nonlinear Systems
    Yang, Guanxue
    Lei, Shimin
    Yang, Guanxiao
    ENTROPY, 2024, 26 (12)
  • [43] Model-Free Attitude Control of Quadcopter using Disturbance Observer and Integral Reinforcement Learning
    Lee, Hanna
    Kim, Youdan
    AIAA SCITECH 2024 FORUM, 2024,
  • [44] Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
    Gao, Cheng
    Wang, Dan
    JOURNAL OF BUILDING ENGINEERING, 2023, 74
  • [45] Model-free data reconstruction of structural response and excitation via sequential broad learning
    Kuok, Sin-Chi
    Yuen, Ka-Veng
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 141
  • [46] Model-based and model-free control of flexible-link robots: A comparison between representative methods
    Rigatos, Gerasimos G.
    APPLIED MATHEMATICAL MODELLING, 2009, 33 (10) : 3906 - 3925
  • [47] Trajectory Tracking Control Of Omnidirectional Mobile Robots: A Model-Free Control-based Apprroach
    Tuan, Pham Anh
    Linh, Nguyen Manh
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2024, 27 (12): : 3687 - 3696
  • [48] A robust inventory management in dynamic supply chains using an adaptive model-free control
    Nya, Danielle Nyakam
    Abouaissa, Hassane
    COMPUTERS & CHEMICAL ENGINEERING, 2023, 179
  • [49] Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning
    Zhan, Siyuan
    Ringwood, John V.
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, 32 (06) : 2164 - 2177
  • [50] Model-free aperiodic tracking for discrete-time systems using hierarchical reinforcement learning
    Tian, Yingqiang
    Wan, Haiying
    Karimi, Hamid Reza
    Luan, Xiaoli
    Liu, Fei
    NEUROCOMPUTING, 2024, 609