Path planning for asteroid hopping rovers with pre-trained deep reinforcement learning architectures

被引：54

作者：

Jiang, Jianxun ^{[1
]}

Zeng, Xiangyuan ^{[1
]}

Guzzetti, Davide ^{[2
]}

You, Yuyang ^{[1
]}

机构：

[1] Beijing Inst Technol, Beijing 100081, Peoples R China

[2] Auburn Univ, Auburn, AL 36849 USA

来源：

ACTA ASTRONAUTICA | 2020年 / 171卷

基金：

中国国家自然科学基金;

关键词：

Asteroid surface exploration; Hopping rover; Path planning; Deep reinforcement learning; EXPLORATION;

D O I：

10.1016/j.actaastro.2020.03.007

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Asteroid surface exploration is challenging due to complex terrain topology and irregular gravity field. A hopping rover is considered as a promising mobility solution to explore the surface of small celestial bodies. Conventional path planning tasks, such as traversing a given map to reach a known target, may become particularly challenging for hopping rovers if the terrain displays sufficiently complex 3-D structures. As an alternative to traditional path-planning approaches, this work explores the possibility of applying deep reinforcement learning (DRL) to plan the path of a hopping rover across a highly irregular surface. The 3-D terrain of the asteroid surface is converted into a level matrix, which is used as an input of the reinforcement learning algorithm. A deep reinforcement learning architecture with good convergence and stability properties is presented to solve the rover path-planning problem. Numerical simulations are performed to validate the effectiveness and robustness of the proposed method with applications to two different types of 3-D terrains.

引用

页码：265 / 279

页数：15

共 38 条

[21] Estimates of glacier equilibrium line altitudes by the Area x Altitude, the Area x Altitude Balance Ratio and the Area x Altitude Balance Index methods and their validation
Osmaston, H
[J]. QUATERNARY INTERNATIONAL, 2005, 138 : 22 - 31
[22] Realization of DVCCTA Based Versatile Modulator
Pandey, Neeta
Pandey, Rajeshwari
Sayal, Aseem
Tripathi, Manan
[J]. ACTIVE AND PASSIVE ELECTRONIC COMPONENTS, 2014, 2014
[23] Schaul Tom, 2015, ARXIV
[24] Deep learning in neural networks: An overview
Schmidhuber, Juergen
[J]. NEURAL NETWORKS, 2015, 61 : 85 - 117
[25] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[26] Surface Gravity Fields for Asteroids and Comets
Takahashi, Yu
Scheeres, D. J.
Werner, Robert A.
[J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (02) : 362 - 374
[27] Tokic Michel, 2010, ANN C ART INT
[28] Valencia-Murillo R., 2013, P INT C ADV COMP SCI
[29] Vasudevan V., 2017, INT C MACHINE LEARNI, P459
[30] Where does AlphaGo go: From church-turing thesis to AlphaGo thesis and beyond
Wang F.-Y.
Zhang J.J.
Zheng X.
Wang X.
Yuan Y.
Dai X.
Zhang J.
Yang L.
[J]. 2016, Institute of Electrical and Electronics Engineers Inc. (03) : 113 - 120

← 1 2 3 4 →