Robust walking control of a lower limb rehabilitation exoskeleton coupled with a musculoskeletal model via deep reinforcement learning

被引:24
作者
Luo, Shuzhen [1 ,3 ]
Androwis, Ghaith [1 ,2 ]
Adamovich, Sergei [1 ]
Nunez, Erick [1 ]
Su, Hao [3 ,4 ]
Zhou, Xianlian [1 ]
机构
[1] New Jersey Inst Technol, Dept Biomed Engn, Newark, NJ 07102 USA
[2] Kessler Fdn, West Orange, NJ 07052 USA
[3] North Carolina State Univ, Dept Mech & Aerosp Engn, Lab Biomechatron & Intelligent Robot, Raleigh, NC 27695 USA
[4] Univ N Carolina, Joint NCSU UNC Dept Biomed Engn, Chapel Hill, NC 27599 USA
基金
美国国家科学基金会;
关键词
Robust walking control; Human-exoskeleton interactions; Muscular disorders; Deep reinforcement learning; MUSCLE; BIOMECHANICS;
D O I
10.1186/s12984-023-01147-2
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
BackgroundFew studies have systematically investigated robust controllers for lower limb rehabilitation exoskeletons (LLREs) that can safely and effectively assist users with a variety of neuromuscular disorders to walk with full autonomy. One of the key challenges for developing such a robust controller is to handle different degrees of uncertain human-exoskeleton interaction forces from the patients. Consequently, conventional walking controllers either are patient-condition specific or involve tuning of many control parameters, which could behave unreliably and even fail to maintain balance.MethodsWe present a novel, deep neural network, reinforcement learning-based robust controller for a LLRE based on a decoupled offline human-exoskeleton simulation training with three independent networks, which aims to provide reliable walking assistance against various and uncertain human-exoskeleton interaction forces. The exoskeleton controller is driven by a neural network control policy that acts on a stream of the LLRE's proprioceptive signals, including joint kinematic states, and subsequently predicts real-time position control targets for the actuated joints. To handle uncertain human interaction forces, the control policy is trained intentionally with an integrated human musculoskeletal model and realistic human-exoskeleton interaction forces. Two other neural networks are connected with the control policy network to predict the interaction forces and muscle coordination. To further increase the robustness of the control policy to different human conditions, we employ domain randomization during training that includes not only randomization of exoskeleton dynamics properties but, more importantly, randomization of human muscle strength to simulate the variability of the patient's disability. Through this decoupled deep reinforcement learning framework, the trained controller of LLREs is able to provide reliable walking assistance to patients with different degrees of neuromuscular disorders without any control parameter tuning.Results and conclusionA universal, RL-based walking controller is trained and virtually tested on a LLRE system to verify its effectiveness and robustness in assisting users with different disabilities such as passive muscles (quadriplegic), muscle weakness, or hemiplegic conditions without any control parameter tuning. Analysis of the RMSE for joint tracking, CoP-based stability, and gait symmetry shows the effectiveness of the controller. An ablation study also demonstrates the strong robustness of the control policy under large exoskeleton dynamic property ranges and various human-exoskeleton interaction forces. The decoupled network structure allows us to isolate the LLRE control policy network for testing and sim-to-real transfer since it uses only proprioception information of the LLRE (joint sensory state) as the input. Furthermore, the controller is shown to be able to handle different patient conditions without the need for patient-specific control parameter tuning.
引用
收藏
页数:19
相关论文
共 48 条
  • [1] Androwis GJ, 2017, 2017 INTERNATIONAL SYMPOSIUM ON WEARABLE ROBOTICS AND REHABILITATION (WEROB), P93
  • [2] [Anonymous], WANDERCRAFT ATALANTE
  • [3] [Anonymous], BIONICS R REX TECHNO
  • [4] Robot Assisted Gait Training With Active Leg Exoskeleton (ALEX)
    Banala, Sai K.
    Kim, Seok Hun
    Agrawal, Sunil K.
    Scholz, John P.
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2009, 17 (01) : 2 - 8
  • [5] Baraff D., 1994, Computer Graphics Proceedings. Annual Conference Series 1994. SIGGRAPH 94 Conference Proceedings, P23, DOI 10.1145/192161.192168
  • [6] Review of control strategies for lower-limb exoskeletons to assist gait
    Baud, Romain
    Manzoori, Ali Reza
    Ijspeert, Auke
    Bouri, Mohamed
    [J]. JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2021, 18 (01)
  • [7] Baud R, 2019, INT C REHAB ROBOT, P849, DOI [10.1109/ICORR.2019.8779440, 10.1109/icorr.2019.8779440]
  • [8] Can Momentum-Based Control Predict Human Balance Recovery Strategies?
    Bayon, C.
    Emmens, A. R.
    Afschrift, M.
    Van Wouwe, T.
    Keemink, A. Q. L.
    van der Kooij, H.
    van Asseldonk, E. H. F.
    [J]. IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2020, 28 (09) : 2015 - 2024
  • [9] Bengio Y., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.4236/JSIP.2015.62006
  • [10] Bin Peng X, 2020, Arxiv, DOI [arXiv:2004.00784, 10.48550/arXiv.2004.00784]