A Novel Deep Reinforcement Learning Based Framework for Gait Adjustment

被引:2
作者
Li, Ang [1 ,2 ]
Chen, Jianping [2 ,3 ]
Fu, Qiming [1 ,2 ]
Wu, Hongjie [1 ,2 ]
Wang, Yunzhe [1 ,2 ]
Lu, You [1 ,2 ]
机构
[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China
[2] Suzhou Univ Sci & Technol, Jiangsu Prov Key Lab Intelligent Bldg Energy Effic, Suzhou 215009, Peoples R China
[3] Suzhou Univ Sci & Technol, Sch Architecture & Urban Planning, Suzhou 215009, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
deep reinforcement learning; attention mechanism; state reconstruction; gait adjustment;
D O I
10.3390/math11010178
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Nowadays, millions of patients suffer from physical disabilities, including lower-limb disabilities. Researchers have adopted a variety of physical therapies based on the lower-limb exoskeleton, in which it is difficult to adjust equipment parameters in a timely fashion. Therefore, intelligent control methods, for example, deep reinforcement learning (DRL), have been used to control the medical equipment used in human gait adjustment. In this study, based on the key-value attention mechanism, we reconstructed the agent's observations by capturing the self-dependent feature information for decision-making in regard to each state sampled from the replay buffer. Moreover, based on Softmax Deep Double Deterministic policy gradients (SD3), a novel DRL-based framework, key-value attention-based SD3 (AT_SD3), has been proposed for gait adjustment. We demonstrated the effectiveness of our proposed framework in gait adjustment by comparing different gait trajectories, including the desired trajectory and the adjusted trajectory. The results showed that the simulated trajectories were closer to the desired trajectory, both in their shapes and values. Furthermore, by comparing the results of our experiments with those of other state-of-the-art methods, the results proved that our proposed framework exhibited better performance.
引用
收藏
页数:18
相关论文
共 27 条
  • [1] Baldi P., 2011, INT C UNS TRANSF LEA
  • [2] Recent developments and challenges of lower extremity exoskeletons
    Chen, Bing
    Ma, Hao
    Qin, Lai-Yin
    Gao, Fei
    Chan, Kai-Ming
    Law, Sheung-Wai
    Qin, Ling
    Liao, Wei-Hsin
    [J]. JOURNAL OF ORTHOPAEDIC TRANSLATION, 2016, 5 : 26 - 37
  • [3] Chinimilli PT, 2019, 2019 WEARABLE ROBOTICS ASSOCIATION CONFERENCE (WEARRACON), P92, DOI [10.1109/wearracon.2019.8719628, 10.1109/WEARRACON.2019.8719628]
  • [4] Ciosek K, 2019, ADV NEUR IN, V32
  • [5] Fortunato M., 2017, ARXIV
  • [6] Fujimoto S, 2018, PR MACH LEARN RES, V80
  • [7] A class-specific mean vector-based weighted competitive and collaborative representation method for classification
    Gou, Jianping
    He, Xin
    Lu, Junyu
    Ma, Hongxing
    Ou, Weihua
    Yuan, Yunhao
    [J]. NEURAL NETWORKS, 2022, 150 : 12 - 27
  • [8] Human-robot interactive control based on reinforcement learning for gait rehabilitation training robot
    Guo Bingjing
    Han Jianhai
    Li Xiangpan
    Yan Lin
    [J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (02)
  • [9] Haarnoja T, 2018, PR MACH LEARN RES, V80
  • [10] Control of a robotic orthosis for gait rehabilitation
    Hussain, Shahid
    Xie, Sheng Q.
    Jamwal, Prashant K.
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (09) : 911 - 919