Reinforcement learning-based satellite formation attitude control under multi-constraint

被引:0
作者
Cai, Yingkai [1 ]
Low, Kay-Soon [2 ]
Wang, Zhaokui [1 ]
机构
[1] Tsinghua Univ, Sch Aerosp Engn, Beijing 100084, Peoples R China
[2] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore 117292, Singapore
基金
中国国家自然科学基金;
关键词
Satellite formation; Attitude control; Phased priority reinforcement learning; Multi-constraint; Multi-agent system; SPACECRAFT;
D O I
10.1016/j.asr.2024.07.084
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
As the complexity of space missions increases, the constraints on satellite attitude control become more stringent, particularly for satellites working in orbit formation. This paper introduces a novel method, based on the categorization and modeling of different constraints, for attitude control of satellite formations under multiple constraints. The method employs the Phased Priority Reinforcement Learning (PPRL) approach, which utilizes Deep Deterministic Policy Gradient (DDPG) technology. Considering the complexity of constraints and the challenge posed by the high control dimensionality due to multi-satellite coordination, the method addresses these challenges through a two-step training strategy. The first step addresses the multi-constraint issue for individual satellites and increases the priority of single-satellite training experience data in the experience replay buffer of the second step to enhance data utilization efficiency. To address the issue of reward sparsity in complex high-dimensional constraint models, a detailed reward mechanism is proposed, incorporating both local and global constraints into the reward function, thereby achieving both efficient and effective attitude control. This approach not only meets dynamic, state, and performance constraints but also demonstrates adaptability and robustness through numerical simulations. Compared to traditional methods, this approach achieves significant improvements in control performance and constraint satisfaction, offering a novel solution pathway for high-dimensional control problems in multi-constraint satellite formations. (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:5819 / 5836
页数:18
相关论文
共 39 条
[1]   Sun-Avoidance Slew Planning with Keep-Out Cone and Actuator Constraints [J].
Ayoubi, Mohammad A. ;
Hsin, Junette .
JOURNAL OF SPACECRAFT AND ROCKETS, 2020, 57 (06) :1175-1185
[2]  
Bonifazi Giuseppe, 2015, 2015 IEEE Sensors. Proceedings, P1, DOI 10.1109/ICSENS.2015.7370458
[3]   Constrained single-axis path planning of underactuated spacecraft [J].
Duan, Chao ;
Hu, Qinglei ;
Zhang, Youmin ;
Wu, Huaining .
AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 107
[4]   Bridging Reinforcement Learning and Online Learning for Spacecraft Attitude Control [J].
Elkins, Jacob G. ;
Sood, Rohan ;
Rumpf, Clemens .
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, :62-69
[5]   Agile Development of Small Satellite's Attitude Determination and Control System [J].
Foo, K. J. ;
Tissera, M. S. C. ;
Tan, R. D. ;
Low, K. S. .
2023 IEEE AEROSPACE CONFERENCE, 2023,
[6]   Pose Regulation via the Dual Unitary Group: An Application to Spacecraft Rendezvous [J].
Geng, Yuanzhuo ;
Biggs, James Douglas ;
Li, Chuanjiang .
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2021, 57 (06) :3734-3748
[7]   CloudScout: A Deep Neural Network for On-Board Cloud Detection on Hyperspectral Images [J].
Giuffrida, Gianluca ;
Diana, Lorenzo ;
de Gioia, Francesco ;
Benelli, Gionata ;
Meoni, Gabriele ;
Donati, Massimiliano ;
Fanucci, Luca .
REMOTE SENSING, 2020, 12 (14)
[8]   Attitude commands avoiding bright objects and maintaining communication with ground station [J].
Hablani, HB .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1999, 22 (06) :759-767
[9]   Single-Agent Reinforcement Learning for Scalable Earth-Observing Satellite Constellation Operations [J].
Herrmann, Adam ;
Stephenson, Mark A. ;
Schaub, Hanspeter .
JOURNAL OF SPACECRAFT AND ROCKETS, 2024, 61 (01) :114-132
[10]   Anti-Unwinding Attitude Control of Spacecraft with Forbidden Pointing Constraints [J].
Hu, Qinglei ;
Chi, Biru ;
Akella, Maruthi R. .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2019, 42 (04) :822-835