Model-Based Reinforcement Learning for Control of Strongly Disturbed Unsteady Aerodynamic Flows

被引：2

作者：

Liu, Zhecheng ^{[1
]}

Beckers, Diederik ^{[2
]}

Eldredge, Jeff D. ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Mech & Aerosp Engn, Los Angeles, CA 90095 USA

[2] CALTECH, Grad Aerosp Labs, Pasadena, CA 91125 USA

来源：

AIAA JOURNAL | 2025年

基金：

美国国家科学基金会;

关键词：

Representation Learning; Convolutional Neural Network; Pitching Airfoil; Fluid Dynamics; Vortex Structure; Aerodynamic Performance; Proper Orthogonal Decomposition; Reinforcement Learning; Data-Driven Model; Active Flow Control; NEURAL-NETWORKS; DECOMPOSITION;

D O I：

10.2514/1.J064790

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The intrinsic high dimension of fluid dynamics is an inherent challenge to control of aerodynamic flows, and this is further complicated by a flow's nonlinear response to strong disturbances. Deep reinforcement learning, which takes advantage of the exploratory aspects of reinforcement learning (RL) and the rich nonlinearity of a deep neural network, provides a promising approach to discover feasible control strategies. However, the typical model-free approach to reinforcement learning requires a significant amount of interaction between the flow environment and the RL agent during training, and this high training cost impedes its development and application. In this work, we propose a model-based reinforcement learning (MBRL) approach by incorporating a novel reduced-order model as a surrogate for the full environment. The model consists of a physics-augmented autoencoder, which compresses high-dimensional CFD flowfield snapshots into a three-dimensional latent space, and a latent dynamics model that is trained to accurately predict the long-time dynamics of trajectories in the latent space in response to action sequences. The accuracy and robustness of the model are demonstrated in the scenario of a pitching airfoil within a highly disturbed environment. Additionally, an application to a vertical-axis wind turbine in a disturbance-free environment is discussed in the Appendix. Based on the model trained in the pitching airfoil problem, we realize an MBRL strategy to mitigate lift variation during gust-airfoil encounters. We demonstrate that the policy learned in the reduced-order environment translates to an effective control strategy in the full CFD environment.

引用

页数：21

共 31 条

[11] Reducing the dimensionality of data with neural networks [J].

Hinton, G. E. ;

Salakhutdinov, R. R. .

SCIENCE, 2006, 313 (5786) :504-507

[12] Physics and Modeling of Large Flow Disturbances: Discrete Gust Encounters for Modern Air Vehicles [J].

Jones, Anya R. ;

Cetiner, Oksan ;

Smith, Marilyn J. .

ANNUAL REVIEW OF FLUID MECHANICS, 2022, 54 :469-493

[13] Closed-Loop Control of Lift for Longitudinal Gust Suppression at Low Reynolds Numbers [J].

Kerstens, Wesley ;

Pfeiffer, Jens ;

Williams, David ;

King, Rudibert ;

Colonius, Tim .

AIAA JOURNAL, 2011, 49 (08) :1721-1728

[14] Optimal blade pitch control for enhanced vertical-axis wind turbine performance [J].

Le Fouest, Sebastien ;

Mulleners, Karen .

NATURE COMMUNICATIONS, 2024, 15 (01)

[15] State representation learning for control: An overview [J].

Lesort, Timothee ;

Diaz-Rodriguez, Natalia ;

Goudou, Jean-Franois ;

Filliat, David .

NEURAL NETWORKS, 2018, 108 :379-392

[16] Turbulence control in plane Couette flow using low-dimensional neural ODE-based models and deep reinforcement learning [J].

Linot, Alec J. ;

Zeng, Kevin ;

Graham, Michael D. .

INTERNATIONAL JOURNAL OF HEAT AND FLUID FLOW, 2023, 101

[17] Stabilized neural ordinary differential equations for long-time forecasting of dynamical systems [J].

Linot, Alec J. ;

Burby, Joshua W. ;

Tang, Qi ;

Balaprakash, Prasanna ;

Graham, Michael D. ;

Maulik, Romit .

JOURNAL OF COMPUTATIONAL PHYSICS, 2023, 474

[18] Model-based Reinforcement Learning: A Survey [J].

Moerland, Thomas M. ;

Broekens, Joost ;

Plaat, Aske ;

Jonker, Catholijn M. .

FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2023, 16 (01) :1-118

[19] Nonlinear mode decomposition with convolutional neural networks for fluid dynamics [J].

Murata, Takaaki ;

Fukami, Kai ;

Fukagata, Koji .

JOURNAL OF FLUID MECHANICS, 2020, 882

[20] Synchronisation through learning for two self-propelled swimmers [J].

Novati, Guido ;

Verma, Siddhartha ;

Alexeev, Dmitry ;

Rossinelli, Diego ;

van Rees, Wim M. ;

Koumoutsakos, Petros .

BIOINSPIRATION & BIOMIMETICS, 2017, 12 (03)

← 1 2 3 4 →