End-to-end deep reinforcement learning and control with multimodal perception for planetary robotic dual peg-in-hole assembly

被引:1
作者
Li, Boxin [1 ]
Wang, Zhaokui [1 ]
机构
[1] Tsinghua Univ, Sch Aerosp Engn, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Planetary construction; Planetary robotic assembly; End-to-end control; Deep reinforcement learning;
D O I
10.1016/j.asr.2024.08.028
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The planetary construction is necessary for long-term scientific deep space exploration and resource utilization in the future. The plan- etary robotic assembly control is a key technology that must be broken through in future planetary surface construction. The paper focuses on the most representative dual peg-in-hole assembly, which has sufficiently complex contact interaction, wide range of appli- cations and good method portability. To address the challenges brought by the unstructured planetary environment and the features of the construction tasks, the paper proposes an end -to -end deep reinforcement learning and control method with multimodal perception for planetary robotic assembly tasks. A staged reward function based on the visual virtual target point for policy learning is designed. The effectiveness and feasibility of the proposed control method have been verified through simulation experiments and ground real robot experiments. It provides a feasible control method of robotic operations for future planetary surface construction.
引用
收藏
页码:5860 / 5873
页数:14
相关论文
共 37 条
[1]  
[Anonymous], 53 Wikipedia contributors, Surfactant, Wikipedia, The Free Encyclopedia, 25 January 2012, 18:26 UTC, lt
[2]  
http://en.wikipedia.org/w/index.php?titleSurfactantoldid473195506gt
[3]  
[accessed 9 February 2012].
[4]   Robotic Lunar Surface Operations 2 [J].
Austin, Alex ;
Sherwood, Brent ;
Elliott, John ;
Colaprete, Anthony ;
Zacny, Kris ;
Metzger, Philip ;
Sims, Michael ;
Schmitt, Harrison ;
Magnus, Sandra ;
Fong, Terry ;
Smith, Miles ;
Casillas, Raul Polit ;
Howe, A. Scott ;
Voecks, Gerald ;
Vaquero, Mar ;
Vendiola, Vincent .
ACTA ASTRONAUTICA, 2020, 176 :424-437
[5]   Lunar In Situ Large-Scale Construction: Quantitative Evaluation of Regolith Solidification Techniques [J].
Bao, Charun ;
Zhang, Daobo ;
Wang, Qinyu ;
Cui, Yifei ;
Feng, Peng .
ENGINEERING, 2024, 39 :204-221
[6]  
Cline J., 2022, AIAA SCITECH 2022 FO
[7]  
Clinton Raymond G., 2021, ASCEND 2021, DOI 10.2514/6.2021-4072
[8]   Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions [J].
Du, Desong ;
Han, Shaohang ;
Qi, Naiming ;
Ammar, Haitham Bou ;
Wang, Jun ;
Pan, Wei .
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, :9442-9448
[9]   Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery [J].
Fang Qingyun ;
Wang Zhaokui .
PATTERN RECOGNITION, 2022, 130
[10]   Jamming analyses for dual peg-in-hole insertions in three dimensions [J].
Fei, YQ ;
Zhao, XF .
ROBOTICA, 2005, 23 :83-91