Station-keeping for high-altitude balloon with reinforcement learning

被引:7
作者
Xu, Ziyuan [1 ]
Liu, Yang [1 ]
Du, Huafei [1 ]
Lv, Mingyun [1 ]
机构
[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Dueling double deep Q learning network; Model of the uncertain wind field; Reinforcement learning; Stratospheric balloon; Station-keeping strategy; PERFORMANCE ANALYSIS; DISTRIBUTIONS; PREDICTION; DIRECTION; GUIDANCE;
D O I
10.1016/j.asr.2022.05.006
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The operating altitudes of the stratospheric balloons can reach near-space altitudes of more than 20 km, where the appearance of the quasi-zero wind layer follows the seasonal rule. As unmanned aerial vehicles with application potential, the effective flight control of balloon position is crucial. This research develops a station-keeping control approach based on reinforcement learning, and the control strategy also considers the characteristics of the local wind field. Firstly, an atmospheric environment model with an uncertain wind field is established according to the analysis of the historical wind data. The model serves as a training environment for the balloon station-keeping strategy training. Secondly, the thermal model, dynamic model, and altitude control model are introduced, and an environment based on historical real wind data is developed. Thirdly, the dueling double Q-learning deep network with prioritized experience replay method is applied to the station keeping of high-altitude balloons. The Priority Experience Replay based on High-Value Samples (HVS-PER) is developed to improve the stability of strategy training. Finally, the performance of the optimal network is evaluated by the total reward, horizontal displacement, and effective working time under the uncertain wind environment. The strategy analysis also has reference to exploiting appropriate initial positions and capturing the opportunity of releasing a balloon. This work confirms that the control strategy is viable in complex and variable wind field environments and is capable of long-duration flights. (C) 2022 COSPAR. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:733 / 751
页数:19
相关论文
共 50 条
  • [21] Long-Term Electric-Propulsion Geostationary Station-Keeping via Integer Programming
    Gazzino, Clement
    Arzelier, Denis
    Louembet, Christophe
    Cerri, Luca
    Pittet, Christelle
    Losa, Damiana
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2019, 42 (05) : 976 - 991
  • [22] Station keeping control method based on deep reinforcement learning for stratospheric aerostat in dynamic wind field
    Bai, Fangchao
    Yang, Xixiang
    Deng, Xiaolong
    Ma, Zhenyu
    Long, Yuan
    ADVANCES IN SPACE RESEARCH, 2025, 75 (01) : 752 - 766
  • [23] Drone Altitude Control with Reinforcement Learning
    Fu, Xilin
    Tay, Eng Hock Francis
    Hu, Junru
    Zhang, Yingnan
    Ding, Yi
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 590 - 594
  • [24] Physiological characteristics of elite high-altitude climbers
    Puthon, L.
    Bouzat, P.
    Rupp, T.
    Robach, P.
    Favre-Juvin, A.
    Verges, S.
    SCANDINAVIAN JOURNAL OF MEDICINE & SCIENCE IN SPORTS, 2016, 26 (09) : 1052 - 1059
  • [25] Opportunities and Potential of Model Predictive Control for Low-Thrust Spacecraft Station-Keeping and Momentum-Management
    Weiss, Avishai
    Di Cairano, Stefano
    2015 EUROPEAN CONTROL CONFERENCE (ECC), 2015, : 1370 - 1375
  • [26] Altitude control for high-speed vehicles in the cruise phase based on reinforcement learning
    Chi H.
    Yu F.
    Guo Z.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2021, 42 (09): : 1340 - 1346and1362
  • [27] Reinforcement Learning for Altitude Hold and Path Planning in a Quadcopter
    Karthik, P. B.
    Kumar, Keshav
    Fernandes, Vikrant
    Arya, Kavi
    2020 6TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2020, : 463 - 467
  • [28] A Base Station Placement Method For High-precision Positioning Using Reinforcement Learning
    Hwang J.G.
    Park J.G.
    Journal of Institute of Control, Robotics and Systems, 2023, 29 (11) : 836 - 840
  • [29] Atmospheric ice nuclei at the high-altitude observatory Jungfraujoch, Switzerland
    Conen, Franz
    Rodriguez, Sergio
    Hueglin, Christoph
    Henne, Stephan
    Herrmann, Erik
    Bukowiecki, Nicolas
    Alewell, Christine
    TELLUS SERIES B-CHEMICAL AND PHYSICAL METEOROLOGY, 2015, 67
  • [30] Path Planning for Autonomous Balloon Navigation with Reinforcement Learning
    He, Yingzhe
    Guo, Kai
    Wang, Chisheng
    Fu, Keyi
    Zheng, Jiehao
    ELECTRONICS, 2025, 14 (01):