Model-Based Reinforcement Learning for Control of Strongly Disturbed Unsteady Aerodynamic Flows

被引:2
作者
Liu, Zhecheng [1 ]
Beckers, Diederik [2 ]
Eldredge, Jeff D. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Mech & Aerosp Engn, Los Angeles, CA 90095 USA
[2] CALTECH, Grad Aerosp Labs, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
Representation Learning; Convolutional Neural Network; Pitching Airfoil; Fluid Dynamics; Vortex Structure; Aerodynamic Performance; Proper Orthogonal Decomposition; Reinforcement Learning; Data-Driven Model; Active Flow Control; NEURAL-NETWORKS; DECOMPOSITION;
D O I
10.2514/1.J064790
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The intrinsic high dimension of fluid dynamics is an inherent challenge to control of aerodynamic flows, and this is further complicated by a flow's nonlinear response to strong disturbances. Deep reinforcement learning, which takes advantage of the exploratory aspects of reinforcement learning (RL) and the rich nonlinearity of a deep neural network, provides a promising approach to discover feasible control strategies. However, the typical model-free approach to reinforcement learning requires a significant amount of interaction between the flow environment and the RL agent during training, and this high training cost impedes its development and application. In this work, we propose a model-based reinforcement learning (MBRL) approach by incorporating a novel reduced-order model as a surrogate for the full environment. The model consists of a physics-augmented autoencoder, which compresses high-dimensional CFD flowfield snapshots into a three-dimensional latent space, and a latent dynamics model that is trained to accurately predict the long-time dynamics of trajectories in the latent space in response to action sequences. The accuracy and robustness of the model are demonstrated in the scenario of a pitching airfoil within a highly disturbed environment. Additionally, an application to a vertical-axis wind turbine in a disturbance-free environment is discussed in the Appendix. Based on the model trained in the pitching airfoil problem, we realize an MBRL strategy to mitigate lift variation during gust-airfoil encounters. We demonstrate that the policy learned in the reduced-order environment translates to an effective control strategy in the full CFD environment.
引用
收藏
页数:21
相关论文
共 31 条
[11]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[12]   Physics and Modeling of Large Flow Disturbances: Discrete Gust Encounters for Modern Air Vehicles [J].
Jones, Anya R. ;
Cetiner, Oksan ;
Smith, Marilyn J. .
ANNUAL REVIEW OF FLUID MECHANICS, 2022, 54 :469-493
[13]   Closed-Loop Control of Lift for Longitudinal Gust Suppression at Low Reynolds Numbers [J].
Kerstens, Wesley ;
Pfeiffer, Jens ;
Williams, David ;
King, Rudibert ;
Colonius, Tim .
AIAA JOURNAL, 2011, 49 (08) :1721-1728
[14]   Optimal blade pitch control for enhanced vertical-axis wind turbine performance [J].
Le Fouest, Sebastien ;
Mulleners, Karen .
NATURE COMMUNICATIONS, 2024, 15 (01)
[15]   State representation learning for control: An overview [J].
Lesort, Timothee ;
Diaz-Rodriguez, Natalia ;
Goudou, Jean-Franois ;
Filliat, David .
NEURAL NETWORKS, 2018, 108 :379-392
[16]   Turbulence control in plane Couette flow using low-dimensional neural ODE-based models and deep reinforcement learning [J].
Linot, Alec J. ;
Zeng, Kevin ;
Graham, Michael D. .
INTERNATIONAL JOURNAL OF HEAT AND FLUID FLOW, 2023, 101
[17]   Stabilized neural ordinary differential equations for long-time forecasting of dynamical systems [J].
Linot, Alec J. ;
Burby, Joshua W. ;
Tang, Qi ;
Balaprakash, Prasanna ;
Graham, Michael D. ;
Maulik, Romit .
JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 474
[18]   Model-based Reinforcement Learning: A Survey [J].
Moerland, Thomas M. ;
Broekens, Joost ;
Plaat, Aske ;
Jonker, Catholijn M. .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (01) :1-118
[19]   Nonlinear mode decomposition with convolutional neural networks for fluid dynamics [J].
Murata, Takaaki ;
Fukami, Kai ;
Fukagata, Koji .
JOURNAL OF FLUID MECHANICS, 2020, 882
[20]   Synchronisation through learning for two self-propelled swimmers [J].
Novati, Guido ;
Verma, Siddhartha ;
Alexeev, Dmitry ;
Rossinelli, Diego ;
van Rees, Wim M. ;
Koumoutsakos, Petros .
BIOINSPIRATION & BIOMIMETICS, 2017, 12 (03)