PRD-MADDPG: An efficient learning-based algorithm for orbital pursuit-evasion game with impulsive maneuvers

被引：30

作者：

Zhao, Liran ^{[1
,2
]}

Zhang, Yulin ^{[1
,2
]}

Dang, Zhaohui ^{[1
,2
]}

机构：

[1] Northwestern Polytech Univ, Sch Astronaut, Xian 710072, Peoples R China

[2] Northwestern Polytech Univ, Natl Key Lab Aerosp Flight Dynam, Xian 710072, Peoples R China

来源：

ADVANCES IN SPACE RESEARCH | 2023年 / 72卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Orbital pursuit-evasion game; Impulsive maneuver; Multi-agent deep reinforcement learning; PRD-MADDPG; REINFORCEMENT; NAVIGATION;

D O I：

10.1016/j.asr.2023.03.014

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

This paper comprehensively investigates the problem of impulsive orbital pursuit-evasion games (OPEGs) by using an artificial intelligence-based approach. First, the mathematical model for the impulsive OPEGs in which the pursuer and evader both perform their orbital maneuvers by imposing the impulsive velocity increments is constructed. Second, the problem of impulsive OPEGs is transformed into a bilateral optimization problem with a minimum-maximum optimization index in terms of terminal time and multiple constraints such as maneuverability, total fuel consumption, and mission time, etc. To determine the optimal impulsive maneuvers for both sides, a PRD-MADDPG (Predict-Reward-Detect Multi-Agent Deep Deterministic Policy Gradient) algorithm in the frame of multi-agent reinforcement learning is designed. This novel algorithm uses the basic MADDPG to achieve the strategies training and learning, and applies the supplemental PRD to predict the change of game state during the interval between two adjacent impulsive maneuvers and incorporate these information into the algorithm training in the form of predicted reward. Finally, some pursuit-evasion missions near the Geosynchronous Earth Orbit are numerically analyzed to verify the validness and effectiveness of the algorithm. The results prove that the PRD-MADDPG algorithm is very efficient to find applicable strategies even considering rather complex constraints. It is also shown that the learning-based strategies can be effectively applied in the extended scenarios which are not seen in the training process. & COPY; 2023 COSPAR. Published by Elsevier B.V. All rights reserved.

引用

页码：211 / 230

页数：20

共 37 条

[1] An Algorithm for UAV Pursuit-Evasion Game Based on MADDPG and Contrastive Learning
Wang R.
Wang X.
Yuhang Xuebao/Journal of Astronautics, 2024, 45 (02): : 262 - 272
[2] Game Tree Search-based Impulsive Orbital Pursuit-Evasion Game with Limited Actions
Xie, Wenyuan
Zhao, Liran
Dang, Zhaohui
SPACE: SCIENCE & TECHNOLOGY, 2024, 4
[3] Orbital Impulsive Pursuit-Evasion Game Formulation and Stackelberg Equilibrium Solutions
Li, Zhenyu
Luo, Yazhong
JOURNAL OF SPACECRAFT AND ROCKETS, 2024,
[4] A learning-based algorithm for turn-based orbital pursuit-evasion problem with reaction-time delay
Zhao, Liran
Sun, Qinbo
Xu, Sihan
Dang, Zhaohui
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 145
[5] Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning
Zhen-yu Li
Si Chen
Chenghong Zhou
Wei Sun
The Journal of the Astronautical Sciences, 72 (1)
[6] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
Liu, Jie
Liu, Shuhua
Wu, Hongyan
Zhang, Yu
2009 INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION, VOL II, 2009, : 482 - 486
[7] A Pursuit-Evasion Algorithm Based on Hierarchical Reinforcement Learning
Liu, Shuhua
Liu, Jie
Cheng, Yu
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (03): : 635 - 645
[8] Terminal-guidance Based Reinforcement-learning for Orbital Pursuit-evasion Game of the Spacecraft
Geng Y.-Z.
Yuan L.
Huang H.
Tang L.
Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (05): : 974 - 984
[9] Reinforcement learning-based decision-making for spacecraft pursuit-evasion game in elliptical orbits
Yu, Weizhuo
Liu, Chuang
Yue, Xiaokui
CONTROL ENGINEERING PRACTICE, 2024, 153
[10] Impulsive maneuver strategy for multi-agent orbital pursuit-evasion game under sparse rewards
Wang, Hongbo
Zhang, Yao
AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 155

← 1 2 3 4 →