Adaptive guidance and integrated navigation with reinforcement meta-learning

被引:57
|
作者
Gaudet, Brian [1 ]
Linares, Richard [3 ]
Furfaro, Roberto [1 ,2 ]
机构
[1] Univ Arizona, Dept Syst & Ind Engn, 1127 E James & Roger Way, Tucson, AZ 85721 USA
[2] Univ Arizona, Dept Aerosp & Mech Engn, Tucson, AZ 85721 USA
[3] MIT, Dept Aeronaut & Astronaut, Cambridge, MA 02139 USA
关键词
Guidance; Meta learning; Reinforcement learning; Landing guidance;
D O I
10.1016/j.actaastro.2020.01.007
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper proposes a novel adaptive guidance system developed using reinforcement meta-learning with a recurrent policy and value function approximator. The use of recurrent network layers allows the deployed policy to adapt in real time to environmental forces acting on the agent. We compare the performance of the DR/DV guidance law, an RL agent with a non-recurrent policy, and an RL agent with a recurrent policy in four challenging environments with unknown but highly variable dynamics. These tasks include a safe Mars landing with random engine failure and a landing on an asteroid with unknown environmental dynamics. We also demonstrate the ability of a RL meta-learning optimized policy to implement a guidance law using observations consisting of only Doppler radar altimeter readings in a Mars landing environment, and LIDAR altimeter readings in an asteroid landing environment thus integrating guidance and navigation.
引用
收藏
页码:180 / 190
页数:11
相关论文
共 50 条
  • [21] MAML2: meta reinforcement learning via meta-learning for task categories
    Fu, Qiming
    Wang, Zhechao
    Fang, Nengwei
    Xing, Bin
    Zhang, Xiao
    Chen, Jianping
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
  • [22] Bitrate Adaptation and Guidance With Meta Reinforcement Learning
    Bentaleb, Abdelhak
    Lim, May
    Akcay, Mehmet N.
    Begen, Ali C.
    Zimmermann, Roger
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (11) : 10378 - 10392
  • [23] An Integrated Federated Learning and Meta-Learning Approach for Mining Operations
    Munagala, Venkat
    Singh, Sankhya
    Thudumu, Srikanth
    Logothetis, Irini
    Bhandari, Sushil
    Bhandari, Amit
    Mouzakis, Kon
    Vasa, Rajesh
    ADVANCES IN ARTIFICIAL INTELLIGENCE, AI 2023, PT I, 2024, 14471 : 379 - 390
  • [24] Multi-Task Reinforcement Meta-Learning in Neural Networks
    Shakah, Ghazi
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (07) : 263 - 269
  • [25] A Reinforcement Meta-Learning framework of executive function and information demand
    Silvetti, Massimo
    Lasaponara, Stefano
    Daddaoua, Nabil
    Horan, Mattias
    Gottlieb, Jacqueline
    NEURAL NETWORKS, 2023, 157 : 103 - 113
  • [26] Goal-Conditioned Reinforcement Learning for Ultrasound Navigation Guidance
    Amadou, Abdoul Aziz
    Singh, Vivek
    Ghesu, Florin C.
    Kim, Young-Ho
    Stanciulescu, Laura
    Sai, Harshitha P.
    Sharma, Puneet
    Young, Alistair
    Rajani, Ronak
    Rhode, Kawal
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT XI, 2024, 15011 : 319 - 329
  • [27] MLANE: Meta-Learning Based Adaptive Network Embedding
    Cui, Chen
    Yang, Ning
    Yu, Philip S.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 904 - 909
  • [28] Distributionally Adaptive Meta Reinforcement Learning
    Ajay, Anurag
    Gupta, Abhishek
    Ghosh, Dibya
    Levine, Sergey
    Agrawal, Pulkit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Geometry-adaptive Meta-learning in Riemannian Manifolds
    Gao, Zhi
    PROCEEDINGS OF THE ACM TURING AWARD CELEBRATION CONFERENCE-CHINA 2024, ACM-TURC 2024, 2024, : 231 - 232
  • [30] FAM: Adaptive federated meta-learning for MRI data
    Sinha, Indrajeet Kumar
    Verma, Shekhar
    Singh, Krishna Pratap
    PATTERN RECOGNITION LETTERS, 2024, 186 : 205 - 212