One-Shot Bipedal Robot Dynamics Identification With a Reservoir-Based RNN

被引:4
作者
Folgheraiter, Michele [1 ]
Yskak, Asset [1 ]
Yessirkepov, Sharafatdin [1 ]
机构
[1] Nazarbayev Univ, Sch Engn & Digital Sci, Dept Robot, Astana 01000, Kazakhstan
关键词
Legged locomotion; Robot sensing systems; Nonlinear systems; Adaptation models; Computer architecture; Computational modeling; Recurrent neural networks; One-shot nonlinear model identification; bipedal robot inverse pendulum model; reservoir-based RNN; echo state machine; liquid state machine; long short-term memory RNN; RNN real-time implementation; NEURAL-NETWORKS; MODEL;
D O I
10.1109/ACCESS.2023.3277977
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The nonlinear inverted pendulum model of a lightweight bipedal robot is identified in real-time using a reservoir-based Recurrent Neural Network (RNN). The adaptation occurs online, while a disturbance force is repeatedly applied to the robot body. The hyperparameters of the model, such as the number of neurons, connection sparsity, and number of neurons receiving feedback from the readout unit, were initialized to reduce the complexity of the RNN while preserving good performance. The convergence of the adaptation algorithm was numerically proved based on Lyapunov stability criteria. Results demonstrate that, by using a standard Recursive Least Squares (RLS) algorithm to adapt the network parameters, the learning process requires only few examples of the disturbance response. A Mean Squared Error (MSE) of 0.0048, on a normalized validation set, is obtained when 13 instances of the impulse response are used for training the RNN. As a comparison, a linear Auto Regressive eXogenous (ARX) model with the same number of adaptive parameters obtained a MSE of 0.0181, while a more sophisticated Neural Network Auto Regressive eXogenous model (NNARX), having ten time more adaptive parameters, reached a MSE of 0.0079. If only one example, one-shot, is used for identifying the RNN model, the MSE increases to 0.0329 while showing still good prediction capabilities. From a computational point of view, the RNN in combination with the RLS adaptation algorithm, presents a lower complexity compared with the NNARX model that uses the back propagation algorithm, which makes the reservoir-based RNN model more suitable for real-time applications.
引用
收藏
页码:50180 / 50194
页数:15
相关论文
共 45 条
[1]   A learning rule for very simple universal approximators consisting of a single layer of perceptrons [J].
Auer, Peter ;
Burgsteiner, Harald ;
Maass, Wolfgang .
NEURAL NETWORKS, 2008, 21 (05) :786-795
[2]  
Bana M., 2016, PROC 47 INT S ROBOT, P528
[3]  
Chollet F., 2015, Keras
[4]   full-FORCE: A target-based method for training recurrent networks [J].
DePasquale, Brian ;
Cueva, Christopher J. ;
Rajan, Kanaka ;
Escola, G. Sean ;
Abbott, L. F. .
PLOS ONE, 2018, 13 (02)
[5]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[6]  
Folgheraiter M, 2018, IEEE-RAS INT C HUMAN, P232
[7]   Design and Modeling of a Lightweight and Low Power Consumption Full-Scale Biped Robot [J].
Folgheraiter, Michele ;
Aubakir, Bauyrzhan .
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2018, 15 (05)
[8]   A combined B-spline-neural-network and ARX model for online identification of nonlinear dynamic actuation systems [J].
Folgheraiter, Michele .
NEUROCOMPUTING, 2016, 175 :433-442
[9]   Modelling greenhouse temperature using system identification by means of neural networks [J].
Frausto, HU ;
Pieters, JG .
NEUROCOMPUTING, 2004, 56 (1-4) :423-428
[10]  
Gers FA, 1999, IEE CONF PUBL, P850, DOI [10.1162/089976600300015015, 10.1049/cp:19991218]