PRD-MADDPG: An efficient learning-based algorithm for orbital pursuit-evasion game with impulsive maneuvers

被引:30
|
作者
Zhao, Liran [1 ,2 ]
Zhang, Yulin [1 ,2 ]
Dang, Zhaohui [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Xian 710072, Peoples R China
基金
中国国家自然科学基金;
关键词
Orbital pursuit-evasion game; Impulsive maneuver; Multi-agent deep reinforcement learning; PRD-MADDPG; REINFORCEMENT; NAVIGATION;
D O I
10.1016/j.asr.2023.03.014
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This paper comprehensively investigates the problem of impulsive orbital pursuit-evasion games (OPEGs) by using an artificial intelligence-based approach. First, the mathematical model for the impulsive OPEGs in which the pursuer and evader both perform their orbital maneuvers by imposing the impulsive velocity increments is constructed. Second, the problem of impulsive OPEGs is transformed into a bilateral optimization problem with a minimum-maximum optimization index in terms of terminal time and multiple constraints such as maneuverability, total fuel consumption, and mission time, etc. To determine the optimal impulsive maneuvers for both sides, a PRD-MADDPG (Predict-Reward-Detect Multi-Agent Deep Deterministic Policy Gradient) algorithm in the frame of multi-agent reinforcement learning is designed. This novel algorithm uses the basic MADDPG to achieve the strategies training and learning, and applies the supplemental PRD to predict the change of game state during the interval between two adjacent impulsive maneuvers and incorporate these information into the algorithm training in the form of predicted reward. Finally, some pursuit-evasion missions near the Geosynchronous Earth Orbit are numerically analyzed to verify the validness and effectiveness of the algorithm. The results prove that the PRD-MADDPG algorithm is very efficient to find applicable strategies even considering rather complex constraints. It is also shown that the learning-based strategies can be effectively applied in the extended scenarios which are not seen in the training process. & COPY; 2023 COSPAR. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:211 / 230
页数:20
相关论文
共 37 条
  • [1] An Algorithm for UAV Pursuit-Evasion Game Based on MADDPG and Contrastive Learning
    Wang R.
    Wang X.
    Yuhang Xuebao/Journal of Astronautics, 2024, 45 (02): : 262 - 272
  • [2] Game Tree Search-based Impulsive Orbital Pursuit-Evasion Game with Limited Actions
    Xie, Wenyuan
    Zhao, Liran
    Dang, Zhaohui
    SPACE: SCIENCE & TECHNOLOGY, 2024, 4
  • [3] Orbital Impulsive Pursuit-Evasion Game Formulation and Stackelberg Equilibrium Solutions
    Li, Zhenyu
    Luo, Yazhong
    JOURNAL OF SPACECRAFT AND ROCKETS, 2024,
  • [4] A learning-based algorithm for turn-based orbital pursuit-evasion problem with reaction-time delay
    Zhao, Liran
    Sun, Qinbo
    Xu, Sihan
    Dang, Zhaohui
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
  • [5] Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning
    Zhen-yu Li
    Si Chen
    Chenghong Zhou
    Wei Sun
    The Journal of the Astronautical Sciences, 72 (1)
  • [6] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Jie
    Liu, Shuhua
    Wu, Hongyan
    Zhang, Yu
    2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL II, 2009, : 482 - 486
  • [7] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
    Liu, Shuhua
    Liu, Jie
    Cheng, Yu
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (03): : 635 - 645
  • [8] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
    Geng Y.-Z.
    Yuan L.
    Huang H.
    Tang L.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
  • [9] Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits
    Yu, Weizhuo
    Liu, Chuang
    Yue, Xiaokui
    CONTROL ENGINEERING PRACTICE, 2024, 153
  • [10] Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards
    Wang, Hongbo
    Zhang, Yao
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 155