Robotic Knee Tracking Control to Mimic the Intact Human Knee Profile Based on Actor-Critic Reinforcement Learning

被引:23
作者
Wu, Ruofan [1 ]
Yao, Zhikai [1 ]
Si, Jennie [1 ]
Huang, He Helen [2 ,3 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85287 USA
[2] North Carolina State Univ, Dept Biomed Engn, Raleigh, NC 27695 USA
[3] Univ N Carolina, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Automatic tracking of intact knee; configuration of robotic knee prosthesis; direct heuristic dynamic programming (dHDP); reinforcement learning control; IMPEDANCE CONTROL; MOBILE ROBOTS; LOCOMOTION; STABILIZATION; WALKING; SYSTEM; PHASE; GAIT;
D O I
10.1109/JAS.2021.1004272
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We address a state-of-the-art reinforcement learning (RL) control approach to automatically configure robotic prosthesis impedance parameters to enable end-to-end, continuous locomotion intended for transfemoral amputee subjects. Specifically, our actor-critic based RL provides tracking control of a robotic knee prosthesis to mimic the intact knee profile, This is a significant advance from our previous RL based automatic tuning of prosthesis control parameters which have centered on regulation control with a designer prescribed robotic knee profile as the target. In addition to presenting the tracking control algorithm based on direct heuristic dynamic programming (dHDP), we provide a control performance guarantee including the case of constrained inputs. We show that our proposed tracking control possesses several important properties, such as weight convergence of the learning networks, Bellman (sub) optimality of the cost-to-go value function and control input, and practical stability of the human-robot system. We further provide a systematic simulation of the proposed tracking control using a realistic human-robot system simulator, the OpenSim, to emulate how the dHDP enables level ground walking, walking on different terrains and at different paces. These results show that our proposed dHDP based tracking control is not only theoretically suitable, but also practically useful.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 50 条
[1]  
[Anonymous], 1990, Nonlinear Dynamical Control Systems, DOI 10.1007/978-1-4757-2101-0_13
[2]  
[Anonymous], 2013, Nonlinear control systems
[3]   Design and Control of a New Biomimetic Transfemoral Knee Prosthesis Using an Echo-Control Scheme [J].
Bernal-Torres, Mario G. ;
Medellin-Castillo, Hugo I. ;
Arellano-Gonzalez, Juan C. .
JOURNAL OF HEALTHCARE ENGINEERING, 2018, 2018
[4]   Modulation of phase durations, phase variations, and temporal coordination of the four limbs during quadrupedal split-belt locomotion in intact adult cats [J].
D'Angelo, Giuseppe ;
Thibaudier, Yann ;
Telonio, Alessandro ;
Hurteau, Marie-France ;
Kuczynski, Victoria ;
Dambreville, Charline ;
Frigon, Alain .
JOURNAL OF NEUROPHYSIOLOGY, 2014, 112 (08) :1825-1837
[5]   OpenSim: open-source software to create and analyze dynamic Simulations of movement [J].
Delp, Scott L. ;
Anderson, Frank C. ;
Arnold, Allison S. ;
Loan, Peter ;
Habib, Ayman ;
John, Chand T. ;
Guendelman, Eran ;
Thelen, Darryl G. .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (11) :1940-1950
[6]   Simultaneous tracking and stabilization of mobile robots: An adaptive approach [J].
Do, KD ;
Jiang, ZP ;
Pan, J .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 49 (07) :1147-1152
[7]   Control of a Powered Ankle-Foot Prosthesis Based on a Neuromuscular Model [J].
Eilenberg, Michael F. ;
Geyer, Hartmut ;
Herr, Hugh .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2010, 18 (02) :164-173
[8]   Apache helicopter stabilization using neural dynamic programming [J].
Enns, R ;
Si, J .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2002, 25 (01) :19-25
[9]   Helicopter trimming and tracking control using direct neural dynamic programming [J].
Enns, R ;
Si, J .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (04) :929-939
[10]  
Enns R, 2000, IEEE DECIS CONTR P, P1754, DOI 10.1109/CDC.2000.912115