Model-Based Reinforcement Learning for Control of Strongly Disturbed Unsteady Aerodynamic Flows

被引:2
作者
Liu, Zhecheng [1 ]
Beckers, Diederik [2 ]
Eldredge, Jeff D. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Mech & Aerosp Engn, Los Angeles, CA 90095 USA
[2] CALTECH, Grad Aerosp Labs, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
Representation Learning; Convolutional Neural Network; Pitching Airfoil; Fluid Dynamics; Vortex Structure; Aerodynamic Performance; Proper Orthogonal Decomposition; Reinforcement Learning; Data-Driven Model; Active Flow Control; NEURAL-NETWORKS; DECOMPOSITION;
D O I
10.2514/1.J064790
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
The intrinsic high dimension of fluid dynamics is an inherent challenge to control of aerodynamic flows, and this is further complicated by a flow's nonlinear response to strong disturbances. Deep reinforcement learning, which takes advantage of the exploratory aspects of reinforcement learning (RL) and the rich nonlinearity of a deep neural network, provides a promising approach to discover feasible control strategies. However, the typical model-free approach to reinforcement learning requires a significant amount of interaction between the flow environment and the RL agent during training, and this high training cost impedes its development and application. In this work, we propose a model-based reinforcement learning (MBRL) approach by incorporating a novel reduced-order model as a surrogate for the full environment. The model consists of a physics-augmented autoencoder, which compresses high-dimensional CFD flowfield snapshots into a three-dimensional latent space, and a latent dynamics model that is trained to accurately predict the long-time dynamics of trajectories in the latent space in response to action sequences. The accuracy and robustness of the model are demonstrated in the scenario of a pitching airfoil within a highly disturbed environment. Additionally, an application to a vertical-axis wind turbine in a disturbance-free environment is discussed in the Appendix. Based on the model trained in the pitching airfoil problem, we realize an MBRL strategy to mitigate lift variation during gust-airfoil encounters. We demonstrate that the policy learned in the reduced-order environment translates to an effective control strategy in the full CFD environment.
引用
收藏
页数:21
相关论文
共 31 条
[1]   Feedback control of unstable steady states of flow past a flat plate using reduced-order estimators [J].
Ahuja, S. ;
Rowley, C. W. .
JOURNAL OF FLUID MECHANICS, 2010, 645 :447-478
[2]   Deep reinforcement learning of airfoil pitch control in a highly disturbed environment using partial observations [J].
Beckers, Diederik ;
Eldredge, Jeff D. .
PHYSICAL REVIEW FLUIDS, 2024, 9 (09)
[3]   THE PROPER ORTHOGONAL DECOMPOSITION IN THE ANALYSIS OF TURBULENT FLOWS [J].
BERKOOZ, G ;
HOLMES, P ;
LUMLEY, JL .
ANNUAL REVIEW OF FLUID MECHANICS, 1993, 25 :539-575
[4]   Machine Learning for Fluid Mechanics [J].
Brunton, Steven L. ;
Noack, Bernd R. ;
Koumoutsakos, Petros .
ANNUAL REVIEW OF FLUID MECHANICS, VOL 52, 2020, 52 :477-508
[5]   Discovering governing equations from data by sparse identification of nonlinear dynamical systems [J].
Brunton, Steven L. ;
Proctor, Joshua L. ;
Kutz, J. Nathan .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (15) :3932-3937
[6]   A method of immersed layers on Cartesian grids, with application to incompressible flows [J].
Eldredge, Jeff D. .
JOURNAL OF COMPUTATIONAL PHYSICS, 2022, 448
[7]   Reinforcement learning for bluff body active flow control in experiments and simulations [J].
Fan, Dixia ;
Yang, Liu ;
Wang, Zhicheng ;
Triantafyllou, Michael S. ;
Karniadakis, George Em .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (42) :26091-26098
[8]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[9]   Grasping extreme aerodynamics on a low-dimensional manifold [J].
Fukami, Kai ;
Taira, Kunihiko .
NATURE COMMUNICATIONS, 2023, 14 (01)
[10]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]