Learning efficient navigation in vortical flow fields

被引:47
作者
Gunnarson, Peter [1 ]
Mandralis, Ioannis [1 ]
Novati, Guido [2 ]
Koumoutsakos, Petros [2 ,3 ]
Dabiri, John O. [1 ,4 ]
机构
[1] CALTECH, Grad Aerosp Labs, 1200 E Calif Blvd, Pasadena, CA 91125 USA
[2] Swiss Fed Inst Technol, Computat Sci & Engn Lab, CH-8093 Zurich, Switzerland
[3] Harvard Univ, John A Paulson Sch Engn & Appl Sci, 150 Western Ave, Boston, MA 02134 USA
[4] CALTECH, Mech & Civil Engn, 1200 E Calif Blvd, Pasadena, CA 91125 USA
基金
美国国家科学基金会;
关键词
D O I
10.1038/s41467-021-27015-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Efficient point-to-point navigation in the presence of a background flow field is important for robotic applications such as ocean surveying. In such applications, robots may only have knowledge of their immediate surroundings or be faced with time-varying currents, which limits the use of optimal control techniques. Here, we apply a recently introduced Reinforcement Learning algorithm to discover time-efficient navigation policies to steer a fixed-speed swimmer through unsteady two-dimensional flow fields. The algorithm entails inputting environmental cues into a deep neural network that determines the swimmer's actions, and deploying Remember and Forget Experience Replay. We find that the resulting swimmers successfully exploit the background flow to reach the target, but that this success depends on the sensed environmental cue. Surprisingly, a velocity sensing approach significantly outperformed a bio-mimetic vorticity sensing approach, and achieved a near 100% success rate in reaching the target locations while approaching the time-efficiency of optimal navigation trajectories. Navigation and trajectory planning in environments with background flow, relevant for robotics, are challenging provided information only on local surrounding. The authors propose a reinforcement learning approach for time-efficient navigation of a swimmer through unsteady two-dimensional flows.
引用
收藏
页数:7
相关论文
共 34 条
  • [1] Training bioinspired sensors to classify flows
    Alsalman, Mohamad
    Colvert, Brendan
    Kanso, Eva
    [J]. BIOINSPIRATION & BIOMIMETICS, 2019, 14 (01)
  • [2] Autonomous navigation of stratospheric balloons using reinforcement learning
    Bellemare, Marc G.
    Candido, Salvatore
    Castro, Pablo Samuel
    Gong, Jun
    Machado, Marlos C.
    Moitra, Subhodeep
    Ponda, Sameera S.
    Wang, Ziyu
    [J]. NATURE, 2020, 588 (7836) : 77 - +
  • [3] Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using reinforcement learning
    Biferale, L.
    Bonaccorso, F.
    Buzzicotti, M.
    Di Leoni, P. Clark
    Gustavsson, K.
    [J]. CHAOS, 2019, 29 (10)
  • [4] Optimal Control of Point-to-Point Navigation in Turbulent Time Dependent Flows Using Reinforcement Learning
    Buzzicotti, Michele
    Biferale, Luca
    Bonaccorso, Fabio
    Di Leoni, Patricio Clark
    Gustavsson, Kristian
    [J]. AIXIA 2020 - ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 12414 : 223 - 234
  • [5] Macro- to fine-scale spatial and temporal distributions and dynamics of phytoplankton and their environmental driving forces in a small montane lake in southern California, USA
    Caron, David A.
    Stauffer, Beth
    Moorthi, Stefanie
    Singh, Amarjeet
    Batalin, Maxim
    Graham, Eric A.
    Hansen, Mark
    Kaiser, William J.
    Das, Jnaneshwar
    Pereira, Arvind
    Dhariwal, Amit
    Zhang, Bin
    Oberg, Carl
    Sukhatme, Gaurav S.
    [J]. LIMNOLOGY AND OCEANOGRAPHY, 2008, 53 (05) : 2333 - 2349
  • [6] Flow Navigation by Smart Microswimmers via Reinforcement Learning
    Colabrese, Simona
    Gustavsson, Kristian
    Celani, Antonio
    Biferale, Luca
    [J]. PHYSICAL REVIEW LETTERS, 2017, 118 (15)
  • [7] Seal whiskers detect water movements
    Dehnhardt, G
    Mauck, B
    Bleckmann, H
    [J]. NATURE, 1998, 394 (6690) : 235 - 236
  • [8] Multi-AUV control and adaptive sampling in Monterey Bay
    Fiorelli, Edward
    Leonard, Naomi Ehrich
    Bhatta, Pradeep
    Paley, Derek A.
    Bachmayer, Ralf
    Fratantoni, David M.
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2006, 31 (04) : 935 - 948
  • [9] REINFORCEMENT LEARNING AND WAVELET ADAPTED VORTEX METHODS FOR SIMULATIONS OF SELF-PROPELLED SWIMMERS
    Gazzola, Mattia
    Hejazialhosseini, Babak
    Koumoutsakos, Petros
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2014, 36 (03) : B622 - B639
  • [10] UAV Path Planning for Structure Inspection in Windy Environments
    Guerrero, Jose Alfredo
    Bestaoui, Yasmina
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2013, 69 (1-4) : 297 - 311