Deep-reinforcement-learning-based self-organization of freely undulatory swimmers

被引:16
作者
Yu, Huiyang [1 ]
Liu, Bo [1 ]
Wang, Chengyun [1 ]
Liu, Xuechao [1 ]
Lu, Xi-Yun [1 ]
Huang, Haibo [1 ]
机构
[1] Univ Sci & Technol China, Dept Modern Mech, Hefei 230026, Anhui, Peoples R China
关键词
SIMULATIONS;
D O I
10.1103/PhysRevE.105.045105
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
It is fascinating that fish groups spontaneously form different formations. The collective locomotions of two and multiple undulatory self-propelled foils swimming in a fluid are numerically studied and the deep reinforcement learning (DRL) is applied to control the locomotion. We explored whether typical patterns emerge spontaneously under the driven two DRL strategies. One strategy is that only the following fish gets hydrodynamic advantages. The other is that all individuals in the group take advantage of the interaction. In the DRL strategy, we use swimming efficiency as the reward function, and the visual information is included. We also investigated the effect of involving hydrodynamic force information, which is an analogy to that detected by the lateral line of fish. Each fish can adjust its undulatory phase to achieve the goal. Under the two strategies, collective patterns with different characteristics, i.e., the staggered-following, tandem-following phalanx and compact modes emerge. They are consistent with the results in the literature. The hydrodynamic mechanism of the above high-efficiency collective traveling modes is analyzed by the vortex-body interaction and thrust. We also found that the time sequence feature and hydrodynamic information in the DRL are essential to improve the performance of collective swimming. Our research can reasonably explain the controversial issue observed in the relevant experiments. The paper may be helpful for the design of bionic fish.
引用
收藏
页数:9
相关论文
共 44 条
[1]  
[Anonymous], US, DOI [10.1103/PhysRevE.105.045105, DOI 10.1103/PHYSREVE.105.045105]
[2]   Simple phalanx pattern leads to energy saving in cohesive fish schooling [J].
Ashraf, Intesaaf ;
Bradshaw, Hanae ;
Thanh-Tung Ha ;
Halloy, Jose ;
Godoy-Diana, Ramiro ;
Thiria, Benjamin .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (36) :9599-9604
[3]   Exploiting locality and translational invariance to design effective deep reinforcement learning control of the 1-dimensional unstable falling liquid film [J].
Belus, Vincent ;
Rabault, Jean ;
Viquerat, Jonathan ;
Che, Zhizhao ;
Hachem, Elie ;
Reglade, Ulysse .
AIP ADVANCES, 2019, 9 (12)
[4]   Propulsive performance of unsteady tandem hydrofoils in an in-line configuration [J].
Boschitsch, Birgitt M. ;
Dewey, Peter A. ;
Smits, Alexander J. .
PHYSICS OF FLUIDS, 2014, 26 (05)
[5]  
Braylan A., 2015, WORKSHOPS 20 9 AAAI
[6]   Machine Learning for Fluid Mechanics [J].
Brunton, Steven L. ;
Noack, Bernd R. ;
Koumoutsakos, Petros .
ANNUAL REVIEW OF FLUID MECHANICS, VOL 52, 2020, 52 :477-508
[7]   A comprehensive survey of multiagent reinforcement learning [J].
Busoniu, Lucian ;
Babuska, Robert ;
De Schutter, Bart .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (02) :156-172
[8]  
Carling J, 1998, J EXP BIOL, V201, P3143
[9]   Lattice Boltzmann method for fluid flows [J].
Chen, S ;
Doolen, GD .
ANNUAL REVIEW OF FLUID MECHANICS, 1998, 30 :329-364
[10]   The hydrodynamic advantages of synchronized swimming in a rectangular pattern [J].
Daghooghi, Mohsen ;
Borazjani, Iman .
BIOINSPIRATION & BIOMIMETICS, 2015, 10 (05)