Adaptive guidance and integrated navigation with reinforcement meta-learning

被引:57
|
作者
Gaudet, Brian [1 ]
Linares, Richard [3 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E James & Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Guidance; Meta learning; Reinforcement learning; Landing guidance;
D O I
10.1016/j.actaastro.2020.01.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt in real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment thus integrating guidance and navigation.
引用
收藏
页码:180 / 190
页数:11
相关论文
共 50 条
  • [31] A meta-reinforcement learning method for adaptive payload transportation with variations
    Chen, Jingyu
    Ma, Ruidong
    Xu, Meng
    Candan, Fethi
    Mihaylova, Lyudmila
    Oyekan, John
    NEUROCOMPUTING, 2025, 638
  • [32] Geometry-Adaptive Meta-Learning in Mixed-Curvature Spaces
    Gao, Zhi
    Wu, Yu-Wei
    Jia, Yun-De
    Jisuanji Xuebao/Chinese Journal of Computers, 2024, 47 (10): : 2289 - 2306
  • [33] UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning
    Li, Bo
    Gan, Zhigang
    Chen, Daqing
    Sergey Aleksandrovich, Dyachenko
    REMOTE SENSING, 2020, 12 (22) : 1 - 20
  • [34] Meta-learning for adaptive identification of non-linear dynamical systems
    Oubbati, M
    Levi, P
    Schanz, M
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL & 13TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1 AND 2, 2005, : 473 - 478
  • [35] Meta-IDS: Meta-Learning Automotive Intrusion Detection Systems with Adaptive and Learnable
    Wang, Hong-Quan
    Li, Jin
    Huang, Dong-Hua
    Tao, Yao-Dong
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (03)
  • [36] An Adaptive Approach for Probabilistic Wind Power Forecasting Based on Meta-Learning
    Meng, Zichao
    Guo, Ye
    Sun, Hongbin
    IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2024, 15 (03) : 1814 - 1833
  • [37] Fast Adaptive Meta-Learning for Few-Shot Image Generation
    Phaphuangwittayakul, Aniwat
    Guo, Yi
    Ying, Fangli
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2205 - 2217
  • [38] Locally-adaptive mapping for network alignment via meta-learning
    Long, Meixiu
    Chen, Siyuan
    Wang, Jiahai
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (05)
  • [39] Meta-features for meta-learning
    Rivolli, Adriano
    Garcia, Luis P. F.
    Soares, Carlos
    Vanschoren, Joaquin
    de Carvalho, Andre C. P. L. F.
    KNOWLEDGE-BASED SYSTEMS, 2022, 240
  • [40] META-LEARNING FOR ADAPTIVE FILTERS WITH HIGHER-ORDER FREQUENCY DEPENDENCIES
    Wu, Junkai
    Casebeer, Jonah
    Bryan, Nicholas J.
    Smaragdis, Paris
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,