Automatic reinforcement for robust model-free neurocontrol of robots without persistent excitation

被引：0

作者：

Pantoja-Garcia, Luis ^{[1
]}

Parra-Vega, Vicente ^{[1
,3
]}

Garcia-Rodriguez, Rodolfo ^{[2
]}

机构：

[1] Ctr Res & Adv Studies, Robot & Adv Mfg, Saltillo, Mexico

[2] Univ Politecn Metropolitana Hidalgo, Aeronaut Engn Dept, Postgrad Program Aerosp Engn, Tolcayuca, Mexico

[3] Ave Ind Met 1062, Ramos Arizpe 25903, Mexico

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2024年 / 38卷 / 01期

关键词：

automatic reinforced learning; model-free control; neurocontrol; persistent excitation; robot manipulators; TRACKING CONTROL; ADAPTIVE-CONTROL; MANIPULATOR CONTROL; NONLINEAR-SYSTEMS; NEURAL-NETWORKS; APPROXIMATION; FEEDBACK; CONVERGENCE;

D O I：

10.1002/acs.3697

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Model-based adaptive control suffers over parametrization from the many adaptive parameters compared to the order of system dynamics, leading to sluggish tracking with a poor adaptation transient without robustness. Likewise, adaptive model-free neurocontrol that relies on the Stone-Weierstrass theorem also suffers from similar problems in addition to over-fitting to approximate inverse dynamics. This article proposes a novel reinforced adaptive mechanism to guarantee a transient and robustness for the model-free adaptive control of nonlinear Lagrangian systems. Inspired by the symbiosis of Actor-Critic (AC) architecture and integral sliding modes, the reinforced stage neural network, analogous to the critic, injects excitation signals to reinforce the parametric learning of the adaptive stage neural network, analogous to the actor to improve the approximation of inverse dynamics. The underlying integral sliding surface error drives improved learning onto a low-dimensional invariant manifold to guarantee local exponential convergence of tracking errors. Lyapunov stability substantiates the robustness with an improved transient response. Our proposal stands for a hybrid approach between AC and neurocontrol, where the reinforced stage does not require a value function nor reward to provide automatic reinforcement to the adaptive stage parametric adaptation. Dynamic simulations are presented for a nonlinear robot manipulator under different conditions.

引用

页码：221 / 236

页数：16

共 50 条

[41] Breaking the sample complexity barrier to regret-optimal model-free reinforcement learning
Li, Gen
Shi, Laixi
Chen, Yuxin
Chi, Yuejie
INFORMATION AND INFERENCE-A JOURNAL OF THE IMA, 2023, 12 (02) : 969 - 1043
[42] Robust Model-Free Identification of the Causal Networks Underlying Complex Nonlinear Systems
Yang, Guanxue
Lei, Shimin
Yang, Guanxiao
ENTROPY, 2024, 26 (12)
[43] Model-Free Attitude Control of Quadcopter using Disturbance Observer and Integral Reinforcement Learning
Lee, Hanna
Kim, Youdan
AIAA SCITECH 2024 FORUM, 2024,
[44] Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems
Gao, Cheng
Wang, Dan
JOURNAL OF BUILDING ENGINEERING, 2023, 74
[45] Model-free data reconstruction of structural response and excitation via sequential broad learning
Kuok, Sin-Chi
Yuen, Ka-Veng
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 141
[46] Model-based and model-free control of flexible-link robots: A comparison between representative methods
Rigatos, Gerasimos G.
APPLIED MATHEMATICAL MODELLING, 2009, 33 (10) : 3906 - 3925
[47] Trajectory Tracking Control Of Omnidirectional Mobile Robots: A Model-Free Control-based Apprroach
Tuan, Pham Anh
Linh, Nguyen Manh
JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2024, 27 (12): : 3687 - 3696
[48] A robust inventory management in dynamic supply chains using an adaptive model-free control
Nya, Danielle Nyakam
Abouaissa, Hassane
COMPUTERS & CHEMICAL ENGINEERING, 2023, 179
[49] Model-Free Linear Noncausal Optimal Control of Wave Energy Converters via Reinforcement Learning
Zhan, Siyuan
Ringwood, John V.
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2024, 32 (06) : 2164 - 2177
[50] Model-free aperiodic tracking for discrete-time systems using hierarchical reinforcement learning
Tian, Yingqiang
Wan, Haiying
Karimi, Hamid Reza
Luan, Xiaoli
Liu, Fei
NEUROCOMPUTING, 2024, 609

← 1 2 3 4 5 →