A Novel Deep Reinforcement Learning Based Framework for Gait Adjustment

被引：2

作者：

Li, Ang ^{[1
,2
]}

Chen, Jianping ^{[2
,3
]}

Fu, Qiming ^{[1
,2
]}

Wu, Hongjie ^{[1
,2
]}

Wang, Yunzhe ^{[1
,2
]}

Lu, You ^{[1
,2
]}

机构：

[1] Suzhou Univ Sci & Technol, Sch Elect & Informat Engn, Suzhou 215009, Peoples R China

[2] Suzhou Univ Sci & Technol, Jiangsu Prov Key Lab Intelligent Bldg Energy Effic, Suzhou 215009, Peoples R China

[3] Suzhou Univ Sci & Technol, Sch Architecture & Urban Planning, Suzhou 215009, Peoples R China

来源：

MATHEMATICS | 2023年 / 11卷 / 01期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

deep reinforcement learning; attention mechanism; state reconstruction; gait adjustment;

D O I：

10.3390/math11010178

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Nowadays, millions of patients suffer from physical disabilities, including lower-limb disabilities. Researchers have adopted a variety of physical therapies based on the lower-limb exoskeleton, in which it is difficult to adjust equipment parameters in a timely fashion. Therefore, intelligent control methods, for example, deep reinforcement learning (DRL), have been used to control the medical equipment used in human gait adjustment. In this study, based on the key-value attention mechanism, we reconstructed the agent's observations by capturing the self-dependent feature information for decision-making in regard to each state sampled from the replay buffer. Moreover, based on Softmax Deep Double Deterministic policy gradients (SD3), a novel DRL-based framework, key-value attention-based SD3 (AT_SD3), has been proposed for gait adjustment. We demonstrated the effectiveness of our proposed framework in gait adjustment by comparing different gait trajectories, including the desired trajectory and the adjusted trajectory. The results showed that the simulated trajectories were closer to the desired trajectory, both in their shapes and values. Furthermore, by comparing the results of our experiments with those of other state-of-the-art methods, the results proved that our proposed framework exhibited better performance.

引用

页数：18

共 27 条

[1] Baldi P., 2011, INT C UNS TRANSF LEA
[2] Recent developments and challenges of lower extremity exoskeletons
Chen, Bing
Ma, Hao
Qin, Lai-Yin
Gao, Fei
Chan, Kai-Ming
Law, Sheung-Wai
Qin, Ling
Liao, Wei-Hsin
[J]. JOURNAL OF ORTHOPAEDIC TRANSLATION, 2016, 5 : 26 - 37
[3] Chinimilli PT, 2019, 2019 WEARABLE ROBOTICS ASSOCIATION CONFERENCE (WEARRACON), P92, DOI [10.1109/wearracon.2019.8719628, 10.1109/WEARRACON.2019.8719628]
[4] Ciosek K, 2019, ADV NEUR IN, V32
[5] Fortunato M., 2017, ARXIV
[6] Fujimoto S, 2018, PR MACH LEARN RES, V80
[7] A class-specific mean vector-based weighted competitive and collaborative representation method for classification
Gou, Jianping
He, Xin
Lu, Junyu
Ma, Hongxing
Ou, Weihua
Yuan, Yunhao
[J]. NEURAL NETWORKS, 2022, 150 : 12 - 27
[8] Human-robot interactive control based on reinforcement learning for gait rehabilitation training robot
Guo Bingjing
Han Jianhai
Li Xiangpan
Yan Lin
[J]. INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2019, 16 (02)
[9] Haarnoja T, 2018, PR MACH LEARN RES, V80
[10] Control of a robotic orthosis for gait rehabilitation
Hussain, Shahid
Xie, Sheng Q.
Jamwal, Prashant K.
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (09) : 911 - 919

← 1 2 3 →