Multi energy dynamic soaring trajectory optimization method based on reinforcement learning

被引:0
作者
Zhang, Yunfei [1 ,2 ]
Wang, Honglun [1 ,2 ]
Zhang, Menghua [1 ,2 ]
Gong, Yinan [3 ]
机构
[1] School of Automation Science and Electrical Engineering, Beihang University, Beijing
[2] The Science and Technology on Aircraft Control Laboratory, Beihang University, Beijing
[3] Hiwing Aviation General Equipment Co., Ltd., Beijing
来源
Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University | 2025年 / 43卷 / 01期
关键词
dynamic soaring; Gaussian pseudospectral method; reinforcement learning; trajectory optimization;
D O I
10.1051/jnwpu/20254310128
中图分类号
学科分类号
摘要
In addressing the issue of dynamic soaring in unmanned aerial vehicles, a trajectory optimization approach based on deep reinforcement learning is proposed. This method synergistically utilizes gradient wind energy and solar energy and incorporates obstacle constraints to simulate complex barrier environments. It employs neural networks to approximate the Gaussian pseudospectral method for solving trajectory policies. On the foundation of the trained policies, the method utilizes the twin delayed deep deterministic policy gradient algorithm for policy enhancement. This significantly boosts the real-time inference capabilities while addressing the challenges traditional optimal control algorithms face in dynamic soaring due to varying wind fields. The experiments initially validate the approach through simulation of two classic modes of dynamic soaring, followed by Monte Carlo simulations considering multiple energy sources. The results indicate that the dynamic soaring trajectory optimization method based on deep reinforcement learning achieves energy acquisition comparable to optimal outcomes within a single soaring cycle, with a 91% reduction in real-time inference decision time. Moreover, in changing wind field environments, this method demonstrates superior adaptability compared to traditional approaches. ©2025 Journal of Northwestern Polytechnical University.
引用
收藏
页码:128 / 139
页数:11
相关论文
共 24 条
[1]  
MIR I, EISA S A, TAHA H, Et al., A stability perspective of bioinspired unmanned aerial vehicles performing optimal dynamic soaring, Bioinspiration & Biomimetics, 16, 6, (2021)
[2]  
LIU S, BAI J, WANG C., Energy acquisition of a small solar UAV using dynamic soaring, The Aeronautical Journal, 125, 1283, pp. 60-86, (2021)
[3]  
LIU Duoneng, Research on mechanism and trajectory optimization for dynamic soaring with fixed-wing unmanned aerial vehicles, (2016)
[4]  
ZHU Yi, LI Jiguang, HAO Xiangyu, Optimization of dynamic gliding flight trajectory for UAV in gradient wind fields, Journal of Xi′an Aeronautical Institute, 41, 5, pp. 8-16, (2023)
[5]  
SACHS G P., Maximum travel speed performance of albatrosses and UAVs using dynamic soaring, AIAA Scitech 2019 Forum, (2019)
[6]  
MIR I, GUL F, EISA S, Et al., On the stability of dynamic soaring: Floquet-based investigation, AIAA Science and Technology Forum and Exposition, (2022)
[7]  
ZWENIG A, HONG H, HOLZAPFEL F., Sensitivity analysis of the energy balance of dynamic soaring, Journal of Physics, 2514, 1, (2023)
[8]  
BOWER G C., Boundary layer dynamic soaring for autonomous aircraft: design and validation, (2011)
[9]  
HONG H, ZHENG H, HOLZAPFEL F, Et al., Dynamic soaring in unspecified wind shear: a real-time quadratic-programming approach, 2019 27th Mediterranean Conference on Control and Automation, pp. 600-605, (2019)
[10]  
LAWRANCE N R J, SUKKARIEH S., Autonomous exploration of a wind field with a gliding aircraft, Journal of Guidance, Control, and Dynamics, 34, 3, pp. 719-733, (2011)