Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach

被引:55
作者
Furfaro, Roberto [1 ]
Scorsoglio, Andrea [2 ]
Linares, Richard [3 ]
Massari, Mauro [4 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Syst & Ind Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
[4] Politecn Milan, Dept Aerosp Sci & Technol, I-20156 Milan, Italy
关键词
Optimal landing guidance; Deep reinfocement learning; Closed-loop guidance;
D O I
10.1016/j.actaastro.2020.02.051
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints.
引用
收藏
页码:156 / 171
页数:16
相关论文
共 29 条
  • [1] [Anonymous], 2000, ADV NEURAL INF PROCE
  • [2] Minimum-Landing-Error Powered-Descent Guidance for Mars Landing Using Convex Optimization
    Blackmore, Lars
    Acikmese, Behcet
    Scharf, Daniel P.
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2010, 33 (04) : 1161 - 1171
  • [3] Burns O. Jack, 2018, ACTA ASTRONAUTICA
  • [4] Furfaro R., 2016, 26 AAS AIAA SPAC FLI
  • [5] Furfaro R., 2017, 3 IAA C DYN CONTR SP
  • [6] Mars Science Laboratory Mission and Science Investigation
    Grotzinger, John P.
    Crisp, Joy
    Vasavada, Ashwin R.
    Anderson, Robert C.
    Baker, Charles J.
    Barry, Robert
    Blake, David F.
    Conrad, Pamela
    Edgett, Kenneth S.
    Ferdowski, Bobak
    Gellert, Ralf
    Gilbert, John B.
    Golombek, Matt
    Gomez-Elvira, Javier
    Hassler, Donald M.
    Jandura, Louise
    Litvak, Maxim
    Mahaffy, Paul
    Maki, Justin
    Meyer, Michael
    Malin, Michael C.
    Mitrofanov, Igor
    Simmonds, John J.
    Vaniman, David
    Welch, Richard V.
    Wiens, Roger C.
    [J]. SPACE SCIENCE REVIEWS, 2012, 170 (1-4) : 5 - 56
  • [7] Guo Y., 2011, AAS AIAA ASTR SPEC C, V36, P588
  • [8] Applications of Generalized Zero-Effort-Miss/Zero-Effort-Velocity Feedback Guidance Algorithm
    Guo, Yanning
    Hawkins, Matt
    Wie, Bong
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (03) : 810 - 820
  • [9] Waypoint-Optimized Zero-Effort-Miss/Zero-Effort-Velocity Feedback Guidance for Mars Landing
    Guo, Yanning
    Hawkins, Matt
    Wie, Bong
    [J]. JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (03) : 799 - 809
  • [10] TRAINING FEEDFORWARD NETWORKS WITH THE MARQUARDT ALGORITHM
    HAGAN, MT
    MENHAJ, MB
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (06): : 989 - 993