Dueling double deep Q learning network;
Model of the uncertain wind field;
Reinforcement learning;
Stratospheric balloon;
Station-keeping strategy;
PERFORMANCE ANALYSIS;
DISTRIBUTIONS;
PREDICTION;
DIRECTION;
GUIDANCE;
D O I:
10.1016/j.asr.2022.05.006
中图分类号:
V [航空、航天];
学科分类号:
08 ;
0825 ;
摘要:
The operating altitudes of the stratospheric balloons can reach near-space altitudes of more than 20 km, where the appearance of the quasi-zero wind layer follows the seasonal rule. As unmanned aerial vehicles with application potential, the effective flight control of balloon position is crucial. This research develops a station-keeping control approach based on reinforcement learning, and the control strategy also considers the characteristics of the local wind field. Firstly, an atmospheric environment model with an uncertain wind field is established according to the analysis of the historical wind data. The model serves as a training environment for the balloon station-keeping strategy training. Secondly, the thermal model, dynamic model, and altitude control model are introduced, and an environment based on historical real wind data is developed. Thirdly, the dueling double Q-learning deep network with prioritized experience replay method is applied to the station keeping of high-altitude balloons. The Priority Experience Replay based on High-Value Samples (HVS-PER) is developed to improve the stability of strategy training. Finally, the performance of the optimal network is evaluated by the total reward, horizontal displacement, and effective working time under the uncertain wind environment. The strategy analysis also has reference to exploiting appropriate initial positions and capturing the opportunity of releasing a balloon. This work confirms that the control strategy is viable in complex and variable wind field environments and is capable of long-duration flights. (C) 2022 COSPAR. Published by Elsevier B.V. All rights reserved.
机构:
Univ Michigan, Space Res Bldg,2455 Hayward St, Ann Arbor, MI 48109 USA
McMaster Univ, 1280 Main St West,John Hodgins Engn Bldg A315, Hamilton, ON L8S 4L7, CanadaUniv Michigan, Space Res Bldg,2455 Hayward St, Ann Arbor, MI 48109 USA
van Wynsberghe, Erinn
Turak, Ayse
论文数: 0引用数: 0
h-index: 0
机构:
McMaster Univ, 1280 Main St West,John Hodgins Engn Bldg A315, Hamilton, ON L8S 4L7, CanadaUniv Michigan, Space Res Bldg,2455 Hayward St, Ann Arbor, MI 48109 USA
机构:
Xian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R ChinaXian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R China
Ren, Bo
Zhu, Zhicheng
论文数: 0引用数: 0
h-index: 0
机构:
Xian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R ChinaXian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R China
Zhu, Zhicheng
Yang, Fan
论文数: 0引用数: 0
h-index: 0
机构:
Xian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R ChinaXian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R China
Yang, Fan
Wu, Tao
论文数: 0引用数: 0
h-index: 0
机构:
Xian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R ChinaXian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R China
Wu, Tao
Yuan, Hui
论文数: 0引用数: 0
h-index: 0
机构:
Xian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R ChinaXian Satellite Control Ctr, State Key Lab Astronaut Dynam, Xian 710043, Peoples R China