Robust solar sail trajectories using proximal policy optimization

被引：2

作者：

Bianchi, Christian ^{[1
]}

Niccolai, Lorenzo ^{[1
]}

Mengali, Giovanni ^{[1
]}

机构：

[1] Univ Pisa, Dept Civil & Ind Engn, Via G Caruso 8, I-56122 Pisa, Italy

来源：

ACTA ASTRONAUTICA | 2025年 / 226卷

关键词：

Solar sail; Reinforcement learning; Stochastic trajectory optimization; Uncertainty sources; Proximal policy optimization; TIME OPTIMAL-CONTROL; WRINKLING ANALYSIS; DESIGN; GUIDANCE;

D O I：

10.1016/j.actaastro.2024.10.065

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Reinforcement learning is used to design minimum-time trajectories of solar sails subject to the typical sources of uncertainty associated with such a propulsion system, i.e., inaccurate knowledge of the sail's optical properties and the presence of wrinkles on the sail membrane. A proximal policy optimization (PPO) algorithm is used to train the agent and derive the control policy that associates the optimal sail attitude with each dynamic state. First, the agent is trained assuming deterministic unperturbed dynamics, and the results are compared with optimal solutions found by an indirect optimization method, thus demonstrating the effectiveness of this approach. Next, two stochastic scenarios are analysed. In the first, the optical coefficients of the sail are assumed to be random variables with Gaussian distribution, which leads to random variations in the sail characteristic acceleration. In the second scenario, wrinkles on the sail membrane are taken into account, resulting in a misalignment of the thrust vector with respect to a perfectly smooth surface. Both phenomena are modelled based on experimental measurements available in the literature in order to perform realistic analyses. In the stochastic scenarios, Monte Carlo simulations are performed using the trained policies, demonstrating that the reinforcement learning approach is capable of finding near time-optimal solutions, while also being robust to the sources of uncertainty considered.

引用

页码：702 / 715

页数：14

共 74 条

[1]

Allison Vedant J.T, 2019, P 70 INT ASTRONAUTIC

[2] Sliding mode control for attitude maneuvers of Helianthus solar sail [J].