Adaptive guidance and integrated navigation with reinforcement meta-learning

被引:57
|
作者
Gaudet, Brian [1 ]
Linares, Richard [3 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E James & Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Guidance; Meta learning; Reinforcement learning; Landing guidance;
D O I
10.1016/j.actaastro.2020.01.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt in real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment thus integrating guidance and navigation.
引用
收藏
页码:180 / 190
页数:11
相关论文
共 50 条
  • [31] Adaptive Gradient-Based Meta-Learning Methods
    Khodak, Mikhail
    Balcan, Maria-Florina
    Talwalkar, Ameet
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [32] Domain Adaptive Meta-Learning for Dialogue State Tracking
    Zeng, Jiali
    Yin, Yongjing
    Liu, Yang
    Ge, Yubin
    Su, Jinsong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2493 - 2501
  • [33] Non-monotone Adaptive Submodular Meta-Learning
    Tang, Shaojie
    Yuan, Jing
    PROCEEDINGS OF THE 2021 SIAM CONFERENCE ON APPLIED AND COMPUTATIONAL DISCRETE ALGORITHMS, ACDA21, 2021, : 57 - 65
  • [34] ADAPTIVE ATTITUDE DETERMINATION OF BIONIC POLARIZATION INTEGRATED NAVIGATION SYSTEM BASED ON REINFORCEMENT LEARNING STRATEGY
    Bao, Huiyi
    Du, Tao
    Sun, Luyue
    MATHEMATICAL FOUNDATIONS OF COMPUTING, 2023, 6 (02): : 161 - 177
  • [35] TGOnline: Enhancing Temporal Graph Learning with Adaptive Online Meta-Learning
    Wang, Ruijie
    Huang, Jingyuan
    Zhang, Yutong
    Li, Jinyang
    Wang, Yufeng
    Zhao, Wanyu
    Liu, Shengzhong
    Mendis, Charith
    Abdelzaher, Tarek
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 1659 - 1669
  • [36] MLS: A Meta-Learning Based Stackelberg Model for Robot Trajectory Guidance
    Guo, Jin
    Jiang, Zhiyong
    Liu, Yifan
    Gong, Lili
    2024 8TH INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION, ICRCA 2024, 2024, : 23 - 28
  • [37] Integrated robust navigation and guidance for the kinetic impact of near-earth asteroids based on deep reinforcement learning
    Yuan, Hao
    Li, Dongxu
    Wang, Jie
    AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 142
  • [38] Learning Meta-Learning (LML) dataset: Survey data of meta-learning parameters
    Corraya, Sonia
    Al Mamun, Shamim
    Kaiser, M. Shamim
    DATA IN BRIEF, 2023, 51
  • [39] Adaptive Multi-Teacher Knowledge Distillation with Meta-Learning
    Zhang, Hailin
    Chen, Defang
    Wang, Can
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1943 - 1948
  • [40] Hybrid Hierarchical Reinforcement Learning for online guidance and navigation with partial observability
    Zhou, Ye
    van Kampen, Erik-Jan
    Chu, Qiping
    NEUROCOMPUTING, 2019, 331 : 443 - 457