Learning efficient navigation in vortical flow fields

被引：47

作者：

Gunnarson, Peter ^{[1
]}

Mandralis, Ioannis ^{[1
]}

Novati, Guido ^{[2
]}

Koumoutsakos, Petros ^{[2
,3
]}

Dabiri, John O. ^{[1
,4
]}

机构：

[1] CALTECH, Grad Aerosp Labs, 1200 E Calif Blvd, Pasadena, CA 91125 USA

[2] Swiss Fed Inst Technol, Computat Sci & Engn Lab, CH-8093 Zurich, Switzerland

[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, 150 Western Ave, Boston, MA 02134 USA

[4] CALTECH, Mech & Civil Engn, 1200 E Calif Blvd, Pasadena, CA 91125 USA

来源：

NATURE COMMUNICATIONS | 2021年 / 12卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1038/s41467-021-27015-y

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer's actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio-mimetic vorticity sensing approach, and achieved a near 100% success rate in reaching the target locations while approaching the time-efficiency of optimal navigation trajectories. Navigation and trajectory planning in environments with background flow, relevant for robotics, are challenging provided information only on local surrounding. The authors propose a reinforcement learning approach for time-efficient navigation of a swimmer through unsteady two-dimensional flows.

引用

页数：7

共 34 条

[1] Training bioinspired sensors to classify flows
Alsalman, Mohamad
Colvert, Brendan
Kanso, Eva
[J]. BIOINSPIRATION & BIOMIMETICS, 2019, 14 (01)
[2] Autonomous navigation of stratospheric balloons using reinforcement learning
Bellemare, Marc G.
Candido, Salvatore
Castro, Pablo Samuel
Gong, Jun
Machado, Marlos C.
Moitra, Subhodeep
Ponda, Sameera S.
Wang, Ziyu
[J]. NATURE, 2020, 588 (7836) : 77 - +
[3] Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using reinforcement learning
Biferale, L.
Bonaccorso, F.
Buzzicotti, M.
Di Leoni, P. Clark
Gustavsson, K.
[J]. CHAOS, 2019, 29 (10)
[4] Optimal Control of Point-to-Point Navigation in Turbulent Time Dependent Flows Using Reinforcement Learning
Buzzicotti, Michele
Biferale, Luca
Bonaccorso, Fabio
Di Leoni, Patricio Clark
Gustavsson, Kristian
[J]. AIXIA 2020 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 12414 : 223 - 234
[5] Macro- to fine-scale spatial and temporal distributions and dynamics of phytoplankton and their environmental driving forces in a small montane lake in southern California, USA
Caron, David A.
Stauffer, Beth
Moorthi, Stefanie
Singh, Amarjeet
Batalin, Maxim
Graham, Eric A.
Hansen, Mark
Kaiser, William J.
Das, Jnaneshwar
Pereira, Arvind
Dhariwal, Amit
Zhang, Bin
Oberg, Carl
Sukhatme, Gaurav S.
[J]. LIMNOLOGY AND OCEANOGRAPHY, 2008, 53 (05) : 2333 - 2349
[6] Flow Navigation by Smart Microswimmers via Reinforcement Learning
Colabrese, Simona
Gustavsson, Kristian
Celani, Antonio
Biferale, Luca
[J]. PHYSICAL REVIEW LETTERS, 2017, 118 (15)
[7] Seal whiskers detect water movements
Dehnhardt, G
Mauck, B
Bleckmann, H
[J]. NATURE, 1998, 394 (6690) : 235 - 236
[8] Multi-AUV control and adaptive sampling in Monterey Bay
Fiorelli, Edward
Leonard, Naomi Ehrich
Bhatta, Pradeep
Paley, Derek A.
Bachmayer, Ralf
Fratantoni, David M.
[J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2006, 31 (04) : 935 - 948
[9] REINFORCEMENT LEARNING AND WAVELET ADAPTED VORTEX METHODS FOR SIMULATIONS OF SELF-PROPELLED SWIMMERS
Gazzola, Mattia
Hejazialhosseini, Babak
Koumoutsakos, Petros
[J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (03) : B622 - B639
[10] UAV Path Planning for Structure Inspection in Windy Environments
Guerrero, Jose Alfredo
Bestaoui, Yasmina
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2013, 69 (1-4) : 297 - 311

← 1 2 3 4 →