Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits

被引:2
|
作者
Yu, Weizhuo [1 ,2 ]
Liu, Chuang [1 ,2 ]
Yue, Xiaokui [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ Shenzhen, Res & Dev Inst, Shenzhen 518057, Peoples R China
基金
中国国家自然科学基金;
关键词
Pursuit-evasion game; Decision making; Deep deterministic policy gradient; Impulsive maneuver; Elliptical orbit; DYNAMICS; DOCKING;
D O I
10.1016/j.conengprac.2024.106072
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The orbital game theory is a fundamental technology for the cleanup of space debris to improve the safety of useful spacecraft in future, thus, this work develops a decision-making method by reinforcement learning technology to implement the pursuit-evasion game in elliptical orbits. The linearized Tschauner-Hempel equation describes the spacecraft's motion and the problem is formulated by game theory. Subsequently, an impulsive maneuvering model in a complete three-dimensional elliptical orbit is established. Then an algorithm based on deep deterministic policy gradient is designed to solve the optimal strategy for the pursuit-evasion game. For the successful decision of the pursuer, an extensive reward function is designed and improved considering the shortest time, optimal fuel, and collision avoidance. Finally, numerical simulations of a pursuit-evasion mission are performed to demonstrate the effectiveness and superiority of the proposed decision-making algorithm. The game success rate of the algorithm against targets with different maneuvering abilities is verified, which implies that the algorithm can be applied in extended scenarios.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Pursuit-evasion game switching strategies for spacecraft with incomplete-information
    Tang, Xu
    Ye, Dong
    Huang, Lei
    Sun, Zhaowei
    Sun, Jianye
    AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
  • [22] Analytical pursuit-evasion game strategy in arbitrary Keplerian reference orbits
    Fu, Shuyue
    Gong, Shengping
    Shi, Peng
    AEROSPACE SCIENCE AND TECHNOLOGY, 2025, 158
  • [23] Method of spacecraft cluster orbital pursuit-evasion game based on the hierarchical theory structure
    Feng, Haolong
    Wu, Songtai
    Liu, Shengyang
    Song, Ting
    Han, Fei
    2024 3RD CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, FASTA 2024, 2024, : 438 - 442
  • [24] An Algorithm for UAV Pursuit-Evasion Game Based on MADDPG and Contrastive Learning
    Wang R.
    Wang X.
    Yuhang Xuebao/Journal of Astronautics, 2024, 45 (02): : 262 - 272
  • [25] Reinforcement learning-based formation-surrounding control for multiple quadrotor UAVs pursuit-evasion games
    Xiong, Hang
    Zhang, Ying
    ISA TRANSACTIONS, 2024, 145 : 205 - 224
  • [26] PRD-MADDPG: An efficient learning-based algorithm for orbital pursuit-evasion game with impulsive maneuvers
    Zhao, Liran
    Zhang, Yulin
    Dang, Zhaohui
    ADVANCES IN SPACE RESEARCH, 2023, 72 (02) : 211 - 230
  • [27] Analysis of a New Pursuit-Evasion Game Based on Game Theory
    Chen, Hao
    Chen, Jing
    Zhang, Wanpeng
    Liu, Hongfu
    2015 11TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2015, : 875 - 880
  • [28] Pursuit-evasion game strategy of USV based on deep reinforcement learning in complex multi-obstacle environment
    Qu, Xiuqing
    Gan, Wenhao
    Song, Dalei
    Zhou, Liqin
    OCEAN ENGINEERING, 2023, 273
  • [29] Reinforcement Learning based Anti-UAV Three-dimensional Pursuit-evasion Game for Substation Security
    Dong, Qingxue
    2024 5th International Conference on Mechatronics Technology and Intelligent Manufacturing, ICMTIM 2024, 2024, : 224 - 227
  • [30] Strategy solution of non-cooperative target pursuit-evasion game based on branching deep reinforcement learning
    Liu B.
    Ye X.
    Gao Y.
    Wang X.
    Ni L.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2020, 41 (10):