Adaptive guidance and integrated navigation with reinforcement meta-learning

被引:57
|
作者
Gaudet, Brian [1 ]
Linares, Richard [3 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E James & Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Guidance; Meta learning; Reinforcement learning; Landing guidance;
D O I
10.1016/j.actaastro.2020.01.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt in real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment thus integrating guidance and navigation.
引用
收藏
页码:180 / 190
页数:11
相关论文
共 50 条
  • [21] Learn to chill - Intelligent Chiller Scheduling using Meta-learning and Deep Reinforcement Learning
    Manoharan, Praveen
    Venkat, Malini Pooni
    Nagarathinam, Srinarayana
    Vasan, Arunchandar
    BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 21 - 30
  • [22] ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING
    Thanh Nguyen
    Tung Luu
    Trung Pham
    Rakhimkul, Sanzhar
    Yoo, Chang D.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3460 - 3464
  • [23] A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems
    Tosic, Predrag T.
    Vilalta, Ricardo
    ICCS 2010 - INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, PROCEEDINGS, 2010, 1 (01): : 2211 - 2220
  • [24] Meta-learning within Projective Simulation
    Makmal, Adi
    Melnikov, Alexey A.
    Dunjko, Vedran
    Briegel, Hans J.
    IEEE ACCESS, 2016, 4 : 2110 - 2122
  • [25] Meta-Learning With Adaptive Learning Rates for Few-Shot Fault Diagnosis
    Chang, Liang
    Lin, Yan-Hui
    IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2022, 27 (06) : 5948 - 5958
  • [26] Learning to Learn Better Unimodal Representations via Adaptive Multimodal Meta-Learning
    Sun, Ya
    Mai, Sijie
    Hu, Haifeng
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 2209 - 2223
  • [27] Learning With Dual Demonstration Domains: Random Domain-Adaptive Meta-Learning
    Hu, Ziye
    Li, Wei
    Gan, Zhongxue
    Guo, Weikun
    Zhu, Jiwei
    Gao, Xiang
    Yang, Xuyun
    Peng, Yueyan
    Zuo, Zhihao
    Wen, James Zhiqing
    Zhou, Decheng
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 3523 - 3530
  • [28] Robustness challenges in Reinforcement Learning based time-critical cloud resource scheduling: A Meta-Learning based solution
    Liu, Hongyun
    Chen, Peng
    Ouyang, Xue
    Gao, Hui
    Yan, Bing
    Grosso, Paola
    Zhao, Zhiming
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 146 : 18 - 33
  • [29] Smart and adaptive website navigation recommendations based on reinforcement learning
    Ting, I-Hsien
    Tang, Ying-Ling
    Minetaki, Kazunori
    INTERNATIONAL JOURNAL OF WEB AND GRID SERVICES, 2024, 20 (03) : 253 - 265
  • [30] Visual Perception Generalization for Vision-and-Language Navigation via Meta-Learning
    Wang, Ting
    Wu, Zongkai
    Wang, Donglin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 5193 - 5199