Adaptive generalized ZEM-ZEV feedback guidance for planetary landing via a deep reinforcement learning approach

被引：62

作者：

Furfaro, Roberto ^{[1
]}

Scorsoglio, Andrea ^{[2
]}

Linares, Richard ^{[3
]}

Massari, Mauro ^{[4
]}

机构：

[1] Univ Arizona, Dept Syst & Ind Engn, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA

[2] Univ Arizona, Dept Syst & Ind Engn, Tucson, AZ 85721 USA

[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA

[4] Politecn Milan, Dept Aerosp Sci & Technol, I-20156 Milan, Italy

来源：

ACTA ASTRONAUTICA | 2020年 / 171卷

关键词：

Optimal landing guidance; Deep reinfocement learning; Closed-loop guidance;

D O I：

10.1016/j.actaastro.2020.02.051

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Precision landing on large and small planetary bodies is a technology of utmost importance for future human and robotic exploration of the solar system. In this context, the Zero-Effort-Miss/Zero-Effort-Velocity (ZEM/ZEV) feedback guidance algorithm has been studied extensively and is still a field of active research. The algorithm, although powerful in terms of accuracy and ease of implementation, has some limitations. Therefore with this paper we present an adaptive guidance algorithm based on classical ZEM/ZEV in which machine learning is used to overcome its limitations and create a closed loop guidance algorithm that is sufficiently lightweight to be implemented on board spacecraft and flexible enough to be able to adapt to the given constraint scenario. The adopted methodology is an actor-critic reinforcement learning algorithm that learns the parameters of the above-mentioned guidance architecture according to the given problem constraints.

引用

页码：156 / 171

页数：16

共 29 条

[1]

[Anonymous], 1992, REINFORCEMENT LEARNI

[2] Minimum-Landing-Error Powered-Descent Guidance for Mars Landing Using Convex Optimization [J].

Blackmore, Lars ;

Acikmese, Behcet ;

Scharf, Daniel P. .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2010, 33 (04) :1161-1171

[3]

Burns O. Jack, 2018, ACTA ASTRONAUTICA

[4]

Furfaro R., 2016, 26 AAS AIAA SPAC FLI

[5]

Furfaro R., 2017, 3 IAA C DYN CONTR SP

[6] Mars Science Laboratory Mission and Science Investigation [J].

Grotzinger, John P. ;

Crisp, Joy ;

Vasavada, Ashwin R. ;

Anderson, Robert C. ;

Baker, Charles J. ;

Barry, Robert ;

Blake, David F. ;

Conrad, Pamela ;

Edgett, Kenneth S. ;

Ferdowski, Bobak ;

Gellert, Ralf ;

Gilbert, John B. ;

Golombek, Matt ;

Gomez-Elvira, Javier ;

Hassler, Donald M. ;

Jandura, Louise ;

Litvak, Maxim ;

Mahaffy, Paul ;

Maki, Justin ;

Meyer, Michael ;

Malin, Michael C. ;

Mitrofanov, Igor ;

Simmonds, John J. ;

Vaniman, David ;

Welch, Richard V. ;

Wiens, Roger C. .

SPACE SCIENCE REVIEWS, 2012, 170 (1-4) :5-56

[7]

Guo Y., 2011, AAS AIAA ASTR SPEC C, V36, P588

[8] Applications of Generalized Zero-Effort-Miss/Zero-Effort-Velocity Feedback Guidance Algorithm [J].

Guo, Yanning ;

Hawkins, Matt ;

Wie, Bong .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (03) :810-820

[9] Waypoint-Optimized Zero-Effort-Miss/Zero-Effort-Velocity Feedback Guidance for Mars Landing [J].

Guo, Yanning ;

Hawkins, Matt ;

Wie, Bong .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2013, 36 (03) :799-809

[10] TRAINING FEEDFORWARD NETWORKS WITH THE MARQUARDT ALGORITHM [J].

HAGAN, MT ;

MENHAJ, MB .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (06) :989-993

← 1 2 3 →