Deep reinforcement learning-based rehabilitation robot trajectory planning with optimized reward functions

被引:2
作者
Wang, Xusheng [1 ]
Xie, Jiexin [1 ]
Guo, Shijie [1 ]
Li, Yue [2 ]
Sun, Pengfei [3 ]
Gan, Zhongxue [1 ]
机构
[1] Fudan Univ, Acad Engn & Technol, 220 Handan Rd, Shanghai 200433, Peoples R China
[2] Hebei Coll Ind & Technol, Dept Comp Technol, Shijiazhuang, Hebei, Peoples R China
[3] Beijing Smartchip Microelect Technol Co Ltd, Beijing, Peoples R China
关键词
Rehabilitation robot; trajectory planning; deep reinforcement learning; reward function;
D O I
10.1177/16878140211067011
中图分类号
O414.1 [热力学];
学科分类号
摘要
Deep reinforcement learning (DRL) provides a new solution for rehabilitation robot trajectory planning in the unstructured working environment, which can bring great convenience to patients. Previous researches mainly focused on optimization strategies but ignored the construction of reward functions, which leads to low efficiency. Different from traditional sparse reward function, this paper proposes two dense reward functions. First, azimuth reward function mainly provides a global guidance and reasonable constraints in the exploration. To further improve the efficiency, a process-oriented aspiration reward function is proposed, it is capable of accelerating the exploration process and avoid locally optimal solution. Experiments show that the proposed reward functions are able to accelerate the convergence rate by 38.4% on average with the mainstream DRL methods. The mean of convergence also increases by 9.5%, and the percentage of standard deviation decreases by 21.2%-23.3%. Results show that the proposed reward functions can significantly improve learning efficiency of DRL methods, and then provide practical possibility for automatic trajectory planning of rehabilitation robot.
引用
收藏
页数:10
相关论文
共 28 条
[1]  
Badnava B., 2019, ARXIV PREPRINT ARXIV
[2]   ROAD: domestic assistant and rehabilitation robot [J].
Carrera, Isela ;
Moreno, Hector A. ;
Saltaren, Roque ;
Perez, Carlos ;
Puglisi, Lisandro ;
Garcia, Cecilia .
MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2011, 49 (10) :1201-1211
[3]  
Chen X, 2018, IEEE INT C INT ROBOT, P3110, DOI 10.1109/IROS.2018.8593702
[4]  
Edwards H., 2018, INT C LEARN REPR
[5]  
Freese M, 2010, LECT NOTES ARTIF INT, V6472, P51, DOI 10.1007/978-3-642-17319-6_8
[6]  
Houthooft R., 2016, CoRR, V29
[7]  
Kahn G, 2018, IEEE INT CONF ROBOT, P5129, DOI 10.1109/ICRA.2018.8460655
[8]   Leveraging Deep Reinforcement Learning for Reaching Robotic Tasks [J].
Katyal, Kapil ;
Wang, I-Jeng ;
Burlina, Philippe .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :490-491
[9]   Dynamic Path Planning of Unknown Environment Based on Deep Reinforcement Learning [J].
Lei, Xiaoyun ;
Zhang, Zhian ;
Dong, Peifang .
JOURNAL OF ROBOTICS, 2018, 2018
[10]  
Lillicrap T. P., 2016, ARXIV