Weather Optimal Station Keeping Control for Airship Based on Deep Reinforcement Learning

被引:0
作者
Wen, Hongyi [1 ]
Zheng, Zewei [1 ]
Zhang, Yifei [2 ]
Chen, Tian [2 ]
Zhu, Ming [2 ]
机构
[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China
[2] Beihang Univ, Inst Unmanned Syst, Beijing 100191, Peoples R China
来源
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 18 | 2025年 / 1354卷
关键词
airship; station keeping; weather optimal; deep reinforcement learning; PATH-FOLLOWING CONTROL;
D O I
10.1007/978-981-96-2268-9_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This research paper introduces a method to address the problem of station keeping controlling of underactuated stratospheric airships when faced with unpredictable external wind disturbances. The proposed algorithm combines weather optimal control theory and deep reinforcement learning theory to achieve station keeping control. In terms of details, an outer-loop guidance law is developed using weather optimal control theory. This guidance law allows the airship to autonomously follow a pendulum-like arc trajectory towards a stable point that counteracts the wind. Additionally, a deep reinforcement learning agent is used as the inner-loop attitude controller for the airship. A reward function is designed to facilitate model-free, self-learning control, eliminating the need for a physical model of the airship. The algorithm is finally validated through simulations, which demonstrate its effectiveness.
引用
收藏
页码:259 / 268
页数:10
相关论文
共 10 条
[1]  
Acosta D., 2007, AIAA GUIDANCE NAVIGA
[2]   Asymmetric error-constrained path-following control of a stratospheric airship with disturbances and actuator saturation [J].
Chen, Tian ;
Zhu, Ming ;
Zheng, Zewei .
MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2019, 119 :501-522
[3]  
Duan Y, 2016, PR MACH LEARN RES, V48
[4]  
Lillicrap T., 2015, ARXIV
[5]   Adaptive sliding-mode-backstepping trajectory tracking control of underactuated airships [J].
Liu, Shi Qian ;
Sang, Yuan Jun ;
Whidborne, James F. .
AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 97
[6]  
Lutz M., 2021, Optimal trajectory planning and model predictive control of underactuated marine surface vessels using a flatness-based approach
[7]   DYNAMICS AND CONTROL OF A HEAVY LIFT AIRSHIP HOVERING IN A TURBULENT CROSS WIND [J].
NAGABHUSHAN, BL ;
TOMLINSON, NP .
JOURNAL OF AIRCRAFT, 1982, 19 (10) :826-830
[8]   Hovering Control of Outdoor Blimp Robots Based on Path Following [J].
Saiki, Hitomu ;
Fukao, Takanori ;
Urakubo, Takateru ;
Kohno, Takashi .
2010 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS, 2010, :2124-2129
[9]  
Wang C., 2019, Weather optimal area keeping intermittent control based on reducing energy consumption of USV
[10]   Autonomous airship path following control: Theory and experiments [J].
Zheng, Zewei ;
Huo, Wei ;
Wu, Zhe .
CONTROL ENGINEERING PRACTICE, 2013, 21 (06) :769-788