Path planning for asteroid hopping rovers with pre-trained deep reinforcement learning architectures

被引:54
作者
Jiang, Jianxun [1 ]
Zeng, Xiangyuan [1 ]
Guzzetti, Davide [2 ]
You, Yuyang [1 ]
机构
[1] Beijing Inst Technol, Beijing 100081, Peoples R China
[2] Auburn Univ, Auburn, AL 36849 USA
基金
中国国家自然科学基金;
关键词
Asteroid surface exploration; Hopping rover; Path planning; Deep reinforcement learning; EXPLORATION;
D O I
10.1016/j.actaastro.2020.03.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Asteroid surface exploration is challenging due to complex terrain topology and irregular gravity field. A hopping rover is considered as a promising mobility solution to explore the surface of small celestial bodies. Conventional path planning tasks, such as traversing a given map to reach a known target, may become particularly challenging for hopping rovers if the terrain displays sufficiently complex 3-D structures. As an alternative to traditional path-planning approaches, this work explores the possibility of applying deep reinforcement learning (DRL) to plan the path of a hopping rover across a highly irregular surface. The 3-D terrain of the asteroid surface is converted into a level matrix, which is used as an input of the reinforcement learning algorithm. A deep reinforcement learning architecture with good convergence and stability properties is presented to solve the rover path-planning problem. Numerical simulations are performed to validate the effectiveness and robustness of the proposed method with applications to two different types of 3-D terrains.
引用
收藏
页码:265 / 279
页数:15
相关论文
共 38 条
  • [21] Estimates of glacier equilibrium line altitudes by the Area x Altitude, the Area x Altitude Balance Ratio and the Area x Altitude Balance Index methods and their validation
    Osmaston, H
    [J]. QUATERNARY INTERNATIONAL, 2005, 138 : 22 - 31
  • [22] Realization of DVCCTA Based Versatile Modulator
    Pandey, Neeta
    Pandey, Rajeshwari
    Sayal, Aseem
    Tripathi, Manan
    [J]. ACTIVE AND PASSIVE ELECTRONIC COMPONENTS, 2014, 2014
  • [23] Schaul Tom, 2015, ARXIV
  • [24] Deep learning in neural networks: An overview
    Schmidhuber, Juergen
    [J]. NEURAL NETWORKS, 2015, 61 : 85 - 117
  • [25] Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
  • [26] Surface Gravity Fields for Asteroids and Comets
    Takahashi, Yu
    Scheeres, D. J.
    Werner, Robert A.
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (02) : 362 - 374
  • [27] Tokic Michel, 2010, ANN C ART INT
  • [28] Valencia-Murillo R., 2013, P INT C ADV COMP SCI
  • [29] Vasudevan V., 2017, INT C MACHINE LEARNI, P459
  • [30] Where does AlphaGo go: From church-turing thesis to AlphaGo thesis and beyond
    Wang F.-Y.
    Zhang J.J.
    Zheng X.
    Wang X.
    Yuan Y.
    Dai X.
    Zhang J.
    Yang L.
    [J]. 2016, Institute of Electrical and Electronics Engineers Inc. (03) : 113 - 120