Robust solar sail trajectories using proximal policy optimization

被引:2
作者
Bianchi, Christian [1 ]
Niccolai, Lorenzo [1 ]
Mengali, Giovanni [1 ]
机构
[1] Univ Pisa, Dept Civil & Ind Engn, Via G Caruso 8, I-56122 Pisa, Italy
关键词
Solar sail; Reinforcement learning; Stochastic trajectory optimization; Uncertainty sources; Proximal policy optimization; TIME OPTIMAL-CONTROL; WRINKLING ANALYSIS; DESIGN; GUIDANCE;
D O I
10.1016/j.actaastro.2024.10.065
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Reinforcement learning is used to design minimum-time trajectories of solar sails subject to the typical sources of uncertainty associated with such a propulsion system, i.e., inaccurate knowledge of the sail's optical properties and the presence of wrinkles on the sail membrane. A proximal policy optimization (PPO) algorithm is used to train the agent and derive the control policy that associates the optimal sail attitude with each dynamic state. First, the agent is trained assuming deterministic unperturbed dynamics, and the results are compared with optimal solutions found by an indirect optimization method, thus demonstrating the effectiveness of this approach. Next, two stochastic scenarios are analysed. In the first, the optical coefficients of the sail are assumed to be random variables with Gaussian distribution, which leads to random variations in the sail characteristic acceleration. In the second scenario, wrinkles on the sail membrane are taken into account, resulting in a misalignment of the thrust vector with respect to a perfectly smooth surface. Both phenomena are modelled based on experimental measurements available in the literature in order to perform realistic analyses. In the stochastic scenarios, Monte Carlo simulations are performed using the trained policies, demonstrating that the reinforcement learning approach is capable of finding near time-optimal solutions, while also being robust to the sources of uncertainty considered.
引用
收藏
页码:702 / 715
页数:14
相关论文
共 74 条
[1]  
Allison Vedant J.T, 2019, P 70 INT ASTRONAUTIC
[2]   Sliding mode control for attitude maneuvers of Helianthus solar sail [J].
Bassetto, Marco ;
Niccolai, Lorenzo ;
Boni, Luisa ;
Mengali, Giovanni ;
Quarta, Alessandro A. ;
Circi, Christian ;
Pizzurro, Simone ;
Pizzarelli, Marco ;
Pellegrini, Rocco C. ;
Cavallini, Enrico .
ACTA ASTRONAUTICA, 2022, 198 :100-110
[3]  
Betts JT, 2010, ADV DES CONTROL, P411
[4]   Survey of numerical methods for trajectory optimization [J].
Betts, JT .
JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1998, 21 (02) :193-207
[5]   Collinear artificial equilibrium point maintenance with a wrinkled solar sail [J].
Bianchi, Christian ;
Niccolai, Lorenzo ;
Mengali, Giovanni ;
Quarta, Alessandro A. .
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 119
[6]   Finite element analysis of solar sail force model with mission application [J].
Boni, Luisa ;
Mengali, Giovanni ;
Quarta, Alessandro A. .
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2019, 233 (05) :1838-1846
[7]  
Bryson A.E., 1975, Applied Optimal Control: Optimization, Estimation and Control, P87
[8]   Effects of attitude constraints on solar sail optimal interplanetary trajectories [J].
Caruso, Andrea ;
Niccolai, Lorenzo ;
Quarta, Alessandro A. ;
Mengali, Giovanni .
ACTA ASTRONAUTICA, 2020, 177 :39-47
[9]   Solar sail optimal control with solar irradiance fluctuations [J].
Caruso, Andrea ;
Mengali, Giovanni ;
Quarta, Alessandro A. ;
Niccolai, Lorenzo .
ADVANCES IN SPACE RESEARCH, 2021, 67 (09) :2776-2783
[10]   Reinforcement-learning-based fixed-time attitude consensus control for multiple spacecraft systems with model uncertainties [J].
Chen, Run-Ze ;
Li, Yuan-Xin ;
Ahn, Choon Ki .
AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 132