Comparison of Empirical and Reinforcement Learning (RL)-Based Control Based on Proximal Policy Optimization (PPO) for Walking Assistance: Does AI Always Win?

被引:1
作者
Drewing, Nadine [1 ]
Ahmadi, Arjang [1 ]
Xiong, Xiaofeng [2 ]
Sharbafi, Maziar Ahmad [1 ]
机构
[1] Tech Univ Darmstadt, Inst Sport, Dept Human Sci, D-64289 Darmstadt, Germany
[2] Univ Southern Denmark SDU, Maerisk Mc Kinney Moller Inst, SDU Biorobot, DK-5230 Odense, Denmark
关键词
wearable assistive device; exosuit; exo control; reinforcement learning; PPO; EMG;
D O I
10.3390/biomimetics9110665
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The use of wearable assistive devices is growing in both industrial and medical fields. Combining human expertise and artificial intelligence (AI), e.g., in human-in-the-loop-optimization, is gaining popularity for adapting assistance to individuals. Amidst prevailing assertions that AI could surpass human capabilities in customizing every facet of support for human needs, our study serves as an initial step towards such claims within the context of human walking assistance. We investigated the efficacy of the Biarticular Thigh Exosuit, a device designed to aid human locomotion by mimicking the action of the hamstrings and rectus femoris muscles using Serial Elastic Actuators. Two control strategies were tested: an empirical controller based on human gait knowledge and empirical data and a control optimized using Reinforcement Learning (RL) on a neuromuscular model. The performance results of these controllers were assessed by comparing muscle activation in two assisted and two unassisted walking modes. Results showed that both controllers reduced hamstring muscle activation and improved the preferred walking speed, with the empirical controller also decreasing gastrocnemius muscle activity. However, the RL-based controller increased muscle activity in the vastus and rectus femoris, indicating that RL-based enhancements may not always improve assistance without solid empirical support.
引用
收藏
页数:19
相关论文
共 46 条
[31]  
Schulman J, 2018, Arxiv, DOI [arXiv:1506.02438, 10.48550/arXiv.1506.02438]
[32]  
Schulman J, 2017, Arxiv, DOI arXiv:1707.06347
[33]   Biarticular muscles are most responsive to upper-body pitch perturbations in human standing [J].
Schumacher, Christian ;
Berry, Andrew ;
Lemus, Daniel ;
Rode, Christian ;
Seyfarth, Andre ;
Vallery, Heike .
SCIENTIFIC REPORTS, 2019, 9 (1)
[34]  
Schumacher P, 2023, Arxiv, DOI [arXiv:2309.02976, 10.48550/arXiv.2309.02976]
[35]   OpenSim: Simulating musculoskeletal dynamics and neuromuscular control to study human and animal movement [J].
Seth, Ajay ;
Hicks, Jennifer L. ;
Uchida, Thomas K. ;
Habib, Ayman ;
Dembia, Christopher L. ;
Dunne, James J. ;
Ong, Carmichael F. ;
DeMers, Matthew S. ;
Rajagopal, Apoorva ;
Millard, Matthew ;
Hamner, Samuel R. ;
Arnold, Edith M. ;
Yong, Jennifer R. ;
Lakshmikanth, Shrinidhi K. ;
Sherman, Michael A. ;
Ku, Joy P. ;
Delp, Scott L. .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (07)
[36]   Leg Force Control Through Biarticular Muscles for Human Walking Assistance [J].
Sharbafi, Maziar A. ;
Barazesh, Hamid ;
Iranikhah, Majid ;
Seyfarth, Andre .
FRONTIERS IN NEUROROBOTICS, 2018, 12
[37]   A neural circuitry that emphasizes spinal feedback generates diverse behaviours of human locomotion [J].
Song, Seungmoon ;
Geyer, Hartmut .
JOURNAL OF PHYSIOLOGY-LONDON, 2015, 593 (16) :3493-3511
[38]   Empirical assessment of dynamic hamstring function during human walking [J].
Thelen, Darryl G. ;
Lenz, Amy L. ;
Francis, Carrie ;
Lenhart, Rachel L. ;
Hernandez, Antonio .
JOURNAL OF BIOMECHANICS, 2013, 46 (07) :1255-1261
[39]   Passive and accurate torque control of series elastic actuators [J].
Vallery, Heike ;
Ekkelenkamp, Ralf ;
van der Kooij, Herman ;
Buss, Martin .
2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, :3534-3534
[40]   Artificial Intelligence-Based Wearable Robotic Exoskeletons for Upper Limb Rehabilitation: A Review [J].
Velez-Guerrero, Manuel Andres ;
Callejas-Cuervo, Mauro ;
Mazzoleni, Stefano .
SENSORS, 2021, 21 (06) :1-30