Station-keeping for high-altitude balloon with reinforcement learning

被引：9

作者：

Xu, Ziyuan ^{[1
]}

Liu, Yang ^{[1
]}

Du, Huafei ^{[1
]}

Lv, Mingyun ^{[1
]}

机构：

[1] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China

来源：

ADVANCES IN SPACE RESEARCH | 2022年 / 70卷 / 03期

基金：

中国国家自然科学基金; 中国博士后科学基金;

关键词：

Dueling double deep Q learning network; Model of the uncertain wind field; Reinforcement learning; Stratospheric balloon; Station-keeping strategy; PERFORMANCE ANALYSIS; DISTRIBUTIONS; PREDICTION; DIRECTION; GUIDANCE;

D O I：

10.1016/j.asr.2022.05.006

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The operating altitudes of the stratospheric balloons can reach near-space altitudes of more than 20 km, where the appearance of the quasi-zero wind layer follows the seasonal rule. As unmanned aerial vehicles with application potential, the effective flight control of balloon position is crucial. This research develops a station-keeping control approach based on reinforcement learning, and the control strategy also considers the characteristics of the local wind field. Firstly, an atmospheric environment model with an uncertain wind field is established according to the analysis of the historical wind data. The model serves as a training environment for the balloon station-keeping strategy training. Secondly, the thermal model, dynamic model, and altitude control model are introduced, and an environment based on historical real wind data is developed. Thirdly, the dueling double Q-learning deep network with prioritized experience replay method is applied to the station keeping of high-altitude balloons. The Priority Experience Replay based on High-Value Samples (HVS-PER) is developed to improve the stability of strategy training. Finally, the performance of the optimal network is evaluated by the total reward, horizontal displacement, and effective working time under the uncertain wind environment. The strategy analysis also has reference to exploiting appropriate initial positions and capturing the opportunity of releasing a balloon. This work confirms that the control strategy is viable in complex and variable wind field environments and is capable of long-duration flights. (C) 2022 COSPAR. Published by Elsevier B.V. All rights reserved.

引用

页码：733 / 751

页数：19

共 50 条

[41] Reinforcement learning optimization for base station sleeping strategy in coordinated multipoint (CoMP) communications [J].

Wen, Shuhuan ;

Hu, Baozhu ;

Lam, H. K. .

NEUROCOMPUTING, 2015, 167 :443-450

[42] The Short Life of Upvalley Wind in a High-Altitude Valley in the Colorado Rocky Mountains [J].

Adler, Bianca ;

Caicedo, Vanessa ;

Butterworth, Brian J. ;

Bianco, Laura ;

Cox, Christopher J. ;

de Boer, Gijs ;

Gutman, Ethan ;

Intrieri, Janet M. ;

Meyers, Tilden ;

Sedlar, Joseph ;

Turner, David D. ;

Wilczak, James .

JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2025, 130 (11)

[43] Model-Free Reinforcement Learning based Lateral Control for Lane Keeping [J].

Zhang, Qichao ;

Luo, Rui ;

Zhao, Dongbin ;

Luo, Chaomin ;

Qian, Dianwei .

2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,

[44] Train Station Parking Approach Based on Fuzzy Reinforcement Learning Algorithms [J].

Yin, Jiateng ;

Su, Shuai ;

Li, Kaicheng ;

Tang, Tao .

2019 IEEE 15TH INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2019, :1411-1416

[45] Reinforcement Learning for Charging Scheduling in a Renewable Powered Battery Swapping Station [J].

Renga, Daniela ;

Spoturno, Felipe ;

Meo, Michela .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (10) :14382-14398

[46] Drone Base Station Positioning and Power Allocation using Reinforcement Learning [J].

Parisotto, Rafaela de Paula ;

Klaine, Paulo, V ;

Nadas, Joao P. B. ;

Souza, Richard Demo ;

Brante, Glauber ;

Imran, Muhammad A. .

2019 16TH INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS (ISWCS), 2019, :213-217

[47] Reinforcement Learning Based Algorithm for the Maximization of EV Charging Station Revenue [J].

Dimitrov, Stoyan ;

Lguensat, Redouane .

2014 INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI 2014), 2014, :235-239

[48] Performance analysis of rotatable energy system of high-altitude airships in real wind field [J].

Zhu, Weiyu ;

Xu, Yuanming ;

Li, Jun ;

Zhang, Lanchuan .

AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 98

[49] Agreement between cardiovascular risk scores in a high-altitude Andean population with rheumatoid arthritis [J].

Diaz-Arocutipa, Carlos ;

Lumbe-Diaz, Vidia ;

Soto-Becerra, Percy .

REUMATOLOGIA CLINICA, 2025, 21 (03)

[50] Disproportional risk for habitat loss of high-altitude endemic species under climate change [J].

Dirnboeck, Thomas ;

Essl, Franz ;

Rabitsch, Wolfgang .

GLOBAL CHANGE BIOLOGY, 2011, 17 (02) :990-996

← 1 2 3 4 5 →