Model-Based Reinforcement Learning for Control of Strongly Disturbed Unsteady Aerodynamic Flows

被引：2

作者：

Liu, Zhecheng ^{[1
]}

Beckers, Diederik ^{[2
]}

Eldredge, Jeff D. ^{[1
]}

机构：

[1] Univ Calif Los Angeles, Dept Mech & Aerosp Engn, Los Angeles, CA 90095 USA

[2] CALTECH, Grad Aerosp Labs, Pasadena, CA 91125 USA

来源：

AIAA JOURNAL | 2025年

基金：

美国国家科学基金会;

关键词：

Representation Learning; Convolutional Neural Network; Pitching Airfoil; Fluid Dynamics; Vortex Structure; Aerodynamic Performance; Proper Orthogonal Decomposition; Reinforcement Learning; Data-Driven Model; Active Flow Control; NEURAL-NETWORKS; DECOMPOSITION;

D O I：

10.2514/1.J064790

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

The intrinsic high dimension of fluid dynamics is an inherent challenge to control of aerodynamic flows, and this is further complicated by a flow's nonlinear response to strong disturbances. Deep reinforcement learning, which takes advantage of the exploratory aspects of reinforcement learning (RL) and the rich nonlinearity of a deep neural network, provides a promising approach to discover feasible control strategies. However, the typical model-free approach to reinforcement learning requires a significant amount of interaction between the flow environment and the RL agent during training, and this high training cost impedes its development and application. In this work, we propose a model-based reinforcement learning (MBRL) approach by incorporating a novel reduced-order model as a surrogate for the full environment. The model consists of a physics-augmented autoencoder, which compresses high-dimensional CFD flowfield snapshots into a three-dimensional latent space, and a latent dynamics model that is trained to accurately predict the long-time dynamics of trajectories in the latent space in response to action sequences. The accuracy and robustness of the model are demonstrated in the scenario of a pitching airfoil within a highly disturbed environment. Additionally, an application to a vertical-axis wind turbine in a disturbance-free environment is discussed in the Appendix. Based on the model trained in the pitching airfoil problem, we realize an MBRL strategy to mitigate lift variation during gust-airfoil encounters. We demonstrate that the policy learned in the reduced-order environment translates to an effective control strategy in the full CFD environment.

引用

页数：21

共 31 条

[1] Feedback control of unstable steady states of flow past a flat plate using reduced-order estimators [J].

Ahuja, S. ;

Rowley, C. W. .

JOURNAL OF FLUID MECHANICS, 2010, 645 :447-478

[2] Deep reinforcement learning of airfoil pitch control in a highly disturbed environment using partial observations [J].

Beckers, Diederik ;

Eldredge, Jeff D. .

PHYSICAL REVIEW FLUIDS, 2024, 9 (09)

[3] THE PROPER ORTHOGONAL DECOMPOSITION IN THE ANALYSIS OF TURBULENT FLOWS [J].

BERKOOZ, G ;

HOLMES, P ;

LUMLEY, JL .

ANNUAL REVIEW OF FLUID MECHANICS, 1993, 25 :539-575

[4] Machine Learning for Fluid Mechanics [J].

Brunton, Steven L. ;

Noack, Bernd R. ;

Koumoutsakos, Petros .

ANNUAL REVIEW OF FLUID MECHANICS, VOL 52, 2020, 52 :477-508

[5] Discovering governing equations from data by sparse identification of nonlinear dynamical systems [J].

Brunton, Steven L. ;

Proctor, Joshua L. ;

Kutz, J. Nathan .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) :3932-3937

[6] A method of immersed layers on Cartesian grids, with application to incompressible flows [J].

Eldredge, Jeff D. .

JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 448

[7] Reinforcement learning for bluff body active flow control in experiments and simulations [J].

Fan, Dixia ;

Yang, Liu ;

Wang, Zhicheng ;

Triantafyllou, Michael S. ;

Karniadakis, George Em .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (42) :26091-26098

[8]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[9] Grasping extreme aerodynamics on a low-dimensional manifold [J].

Fukami, Kai ;

Taira, Kunihiko .

NATURE COMMUNICATIONS, 2023, 14 (01)

[10]

Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]

← 1 2 3 4 →