Reinforcement learning with prior policy guidance for motion planning of dual-arm free-floating space robot

被引：27

作者：

Cao, Yuxue ^{[1
]}

Wang, Shengjie ^{[2
]}

Zheng, Xiang ^{[3
]}

Ma, Wenke ^{[4
]}

Xie, Xinru ^{[1
]}

Liu, Lei ^{[1
]}

机构：

[1] Beijing Inst Control Engn, Beijing, Peoples R China

[2] Tsinghua Univ, Dept Automat, Beijing, Peoples R China

[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China

[4] Qian Xuesen Lab Space Technol, Beijing, Peoples R China

来源：

AEROSPACE SCIENCE AND TECHNOLOGY | 2023年 / 136卷

关键词：

Learning systems - Reinforcement learning - Robot programming;

D O I：

10.1016/j.ast.2022.108098

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Reinforcement learning methods as a promising technique have achieved superior results in the motion planning of free-floating space robots. However, due to the increase in planning dimension and the intensification of system dynamics coupling, the motion planning of dual-arm free-floating space robots remains an open challenge. In particular, the current study cannot handle the task of capturing a noncooperative object due to the lack of the pose constraint of the end-effectors. To address the problem, we propose a novel algorithm, EfficientLPT, to facilitate RL-based methods to improve planning accuracy efficiently. Our core contributions are constructing a mixed policy with prior knowledge guidance and introducing II center dot Iloo to build a more reasonable reward function. Furthermore, our method successfully captures a rotating object with different spinning speeds.(c) 2023 Elsevier Masson SAS. All rights reserved.

引用

页数：12

共 40 条

[1] Detumbling strategy based on friction control of dual-arm space robot for capturing tumbling target [J].