Learning assistive strategies for exoskeleton robots from user-robot physical interaction

被引:56
作者
Hamaya, Masashi [1 ,2 ]
Matsubara, Takamitsu [1 ,3 ]
Noda, Tomoyuki [1 ]
Teramae, Tatsuya [1 ]
Morimoto, Jun [1 ]
机构
[1] ATR CNS, Dept Brain Robot Interface, Kyoto, Japan
[2] Osaka Univ, Grad Sch Frontier Biosci, Osaka, Japan
[3] Nara Inst Sci & Technol, Grad Sch Informat Sci, Nara, Japan
关键词
Exoskeleton robot; Human-robot physical interaction; Human-in-the-loop; Reinforcement learning; WALKING; ORTHOSIS; SUIT;
D O I
10.1016/j.patrec.2017.04.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social demand for exoskeleton robots that physically assist humans has been increasing in various situations due to the demographic trends of aging populations. With exoskeleton robots, an assistive strategy is a key ingredient. Since interactions between users and exoskeleton robots are bidirectional, the assistive strategy design problem is complex and challenging. In this paper, we explore a data-driven learning approach for designing assistive strategies for exoskeletons from user-robot physical interaction. We formulate the learning problem of assistive strategies as a policy search problem and exploit a data-efficient model-based reinforcement learning framework. Instead of explicitly providing the desired trajectories in the cost function, our cost function only considers the user's muscular effort measured by electromyography signals (EMGs) to learn the assistive strategies. The key underlying assumption is that the user is instructed to perform the task by his/her own intended movements. Since the EMGs are observed when the intended movements are achieved by the user's own muscle efforts rather than the robot's assistance, EMGs can be interpreted as the "cost" of the current assistance. We applied our method to a 1-DoF exoskeleton robot and conducted a series of experiments with human subjects. Our experimental results demonstrated that our method learned proper assistive strategies that explicitly considered the bidirectional interactions between a user and a robot with only 60 seconds of interaction. We also showed that our proposed method can cope with changes in both the robot dynamics and movement trajectories. (C) 2017 The Authors. Published by Elsevier B.V.
引用
收藏
页码:67 / 76
页数:10
相关论文
共 42 条
[1]  
[Anonymous], P IEEE RSJ INT C INT
[2]   Stronger, Smarter, Softer Next-Generation Wearable Robots [J].
Asbeck, Alan T. ;
De Rossi, Stefano M. M. ;
Galiana, Ignacio ;
Ding, Ye ;
Walsh, Conor J. .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2014, 21 (04) :22-33
[3]   Gravity-balancing leg orthosis and its performance evaluation [J].
Banala, Sai K. ;
Agrawal, Sunil K. ;
Fattah, Abbas ;
Krishnamoorthy, Vijaya ;
Hsu, Wei-Li ;
Scholz, John ;
Rudolph, Katherine .
IEEE TRANSACTIONS ON ROBOTICS, 2006, 22 (06) :1228-1239
[4]  
Ben Amor H, 2014, IEEE INT CONF ROBOT, P2831, DOI 10.1109/ICRA.2014.6907265
[5]   Time-frequency parameters of the surface myoelectric signal for assessing muscle fatigue during cyclic dynamic contractions [J].
Bonato, P ;
Roy, SH ;
Knaflitz, M ;
De Luca, CJ .
IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2001, 48 (07) :745-753
[6]  
Cunha T, 2016, IEEE INT CONF ROBOT, P1776, DOI 10.1109/ICRA.2016.7487322
[7]  
Deisenroth M.P., 2013, Pilco web site
[8]   Gaussian Processes for Data-Efficient Learning in Robotics and Control [J].
Deisenroth, Marc Peter ;
Fox, Dieter ;
Rasmussen, Carl Edward .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (02) :408-423
[9]  
Dongheui Lee, 2009, 2009 IEEE International Conference on Robotics and Automation (ICRA), P1535, DOI 10.1109/ROBOT.2009.5152857
[10]   Preliminary Evaluation of a Powered Lower Limb Orthosis to Aid Walking in Paraplegic Individuals [J].
Farris, Ryan J. ;
Quintero, Hugo A. ;
Goldfarb, Michael .
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2011, 19 (06) :652-659