Advancing spacecraft rendezvous and docking through safety reinforcement learning and ubiquitous learning principles

被引:1
|
作者
Sharma, Kanta Prasad [1 ]
Kumar, Indradeep [2 ]
Singh, Pavitar Parkash [3 ]
Anbazhagan, K. [4 ]
Albarakati, Hussain Mobarak [5 ]
Bhatt, Mohammed Wasim [6 ]
Ziyadullayevich, Avlokulov Anvar [7 ]
Rana, Arti [8 ]
Sivasankari, S. A. [9 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura 281406, Uttar Pradesh, India
[2] Inst Aeronaut Engn, Dept Aeronaut Engn, Hyderabad 500043, Telangana, India
[3] Lovely Profess Univ, Dept Management, Phagwara, India
[4] SIMATS, Saveetha Sch Engn, Dept Comp Sci & Engn, Chennai, India
[5] Umm Al Qura Univ, Coll Comp & Informat Syst, Comp Engn & Networks Dept, Mecca 24382, Saudi Arabia
[6] Model Inst Engn & Technol, Jammu, J&K, India
[7] Tashkent Inst Finance, Dept Audit, Tashkent, Uzbekistan
[8] Uttaranchal Univ, Uttaranchal Inst Technol, Dept Comp Sci & Engn, Dehra Dun 248007, India
[9] Vignans Fdn Sci Technol & Res, Dept ECE, Guntur 522213, India
关键词
Proximal Policy Optimization; Deep Deterministic Policy Gradient; Reinforcement Learning; Markov Model; Rendezvous and Docking Mission; ARTIFICIAL POTENTIAL-FIELD; SLIDING MODE CONTROL; COLLISION-AVOIDANCE; MANEUVERS;
D O I
10.1016/j.chb.2023.108110
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
As spacecraft rendezvous and docking missions become increasingly complex, the need for advanced solutions has surged. In recent years, the application of reinforcement learning techniques to tackle spacecraft rendezvous guidance challenges has emerged as a prominent international trend. Vital to ensuring the secure rendezvous and docking of spacecraft is the task of obstacle avoidance. However, traditional reinforcement learning algorithms lack the ability to enforce safety constraints within the exploration space, which presents a formidable obstacle in the design of spacecraft rendezvous guidance strategies. In response to this challenge, a spacecraft rendezvous guidance methodology founded on safety reinforcement learning is proposed. Firstly, a Markov model is crafted for autonomous spacecraft rendezvous in scenarios involving collision avoidance. A reward system, contingent on obstacle warnings and collision avoidance constraints, is introduced to establish a safety reinforcement learning framework for devising spacecraft rendezvous guidance strategies. Secondly, within the framework of safety reinforcement learning, two deep reinforcement learning (DRL) algorithms, Proximal Policy Optimisation (PPO) and Deep Deterministic Policy Gradient (DDPG), are leveraged to generate these guidance strategies. Experimental findings validate the effectiveness of this approach in successfully executing obstacle avoidance and achieving rendezvous with remarkable precision. Furthermore, through an analysis of the performance and generalization capabilities of these two algorithms, the efficacy of the proposed methodology is further underscored. This fusion of advanced space guidance technology with the principles of Ubiquitous Learning marks a significant step forward in the quest for safer and more efficient spacecraft rendezvous and docking operations.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] A spacecraft rendezvous and docking method based on inverse reinforcement learning
    Yue C.
    Wang X.
    Yue X.
    Song T.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2023, 44 (19):
  • [2] SPACECRAFT RENDEZVOUS GUIDANCE IN CLUTTERED ENVIRONMENTS VIA REINFORCEMENT LEARNING
    Broida, Jacob
    Linares, Richard
    SPACEFLIGHT MECHANICS 2019, VOL 168, PTS I-IV, 2019, 168 : 1777 - 1788
  • [3] Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning
    Qu, Qingyu
    Liu, Kexin
    Wang, Wei
    Lu, Jinhu
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (06) : 5823 - 5834
  • [4] Invariant Feature Matching in Spacecraft Rendezvous and Docking Optical Imaging Based on Deep Learning
    Guo, Dongwen
    Wu, Shuang
    Weng, Desheng
    Gao, Chenzhong
    Li, Wei
    REMOTE SENSING, 2024, 16 (24)
  • [5] SPACECRAFT RENDEZVOUS GUIDANCE IN CLUTTERED ENVIRONMENTS VIA ARTIFICIAL POTENTIAL FUNCTIONS AND REINFORCEMENT LEARNING
    Gaudet, Brain
    Linares, Richard
    Furfaro, Roberto
    ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 813 - 827
  • [6] Safety Control for Spacecraft Autonomous Rendezvous and Docking Under Motion Constraints
    Dong, Hongyang
    Hu, Qinglei
    Akella, Maruthi R.
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (07) : 1680 - 1692
  • [7] Operator Learning Effects in Teleoperated Rendezvous & Docking
    Wilde, Markus
    Harder, Jan
    Purschke, Ralf
    2013 IEEE AEROSPACE CONFERENCE, 2013,
  • [8] Learning process safety principles through practice
    Gunasekera, Manisha
    Khan, Faisal
    Ahmed, Salim
    PROCESS SAFETY PROGRESS, 2018, 37 (03) : 347 - 354
  • [9] A Reinforcement Learning Approach for the Multichannel Rendezvous Problem
    Wang, Jen-Hung
    Lu, Ping-En
    Chang, Cheng-Shang
    Lee, Duan-Shin
    2019 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2019,
  • [10] Trajectory optimization of spacecraft autonomous far-distance rapid rendezvous based on deep reinforcement learning
    Di, Peng
    Yao, Ye
    Lin, Zheng
    Yin, Zengshan
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 790 - 806