Autonomous navigation of stratospheric balloons using reinforcement learning

被引:0
|
作者
Marc G. Bellemare
Salvatore Candido
Pablo Samuel Castro
Jun Gong
Marlos C. Machado
Subhodeep Moitra
Sameera S. Ponda
Ziyu Wang
机构
[1] Brain Team,
[2] Google Research,undefined
[3] Brain Team,undefined
[4] Google Research,undefined
[5] Loon,undefined
来源
Nature | 2020年 / 588卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Efficiently navigating a superpressure balloon in the stratosphere1 requires the integration of a multitude of cues, such as wind speed and solar elevation, and the process is complicated by forecast errors and sparse wind measurements. Coupled with the need to make decisions in real time, these factors rule out the use of conventional control techniques2,3. Here we describe the use of reinforcement learning4,5 to create a high-performing flight controller. Our algorithm uses data augmentation6,7 and a self-correcting design to overcome the key technical challenge of reinforcement learning from imperfect data, which has proved to be a major obstacle to its application to physical systems8. We deployed our controller to station Loon superpressure balloons at multiple locations across the globe, including a 39-day controlled experiment over the Pacific Ocean. Analyses show that the controller outperforms Loon’s previous algorithm and is robust to the natural diversity in stratospheric winds. These results demonstrate that reinforcement learning is an effective solution to real-world autonomous control problems in which neither conventional methods nor human intervention suffice, offering clues about what may be needed to create artificially intelligent agents that continuously interact with real, dynamic environments.
引用
收藏
页码:77 / 82
页数:5
相关论文
共 50 条
  • [1] Autonomous navigation of stratospheric balloons using reinforcement learning
    Bellemare, Marc G.
    Candido, Salvatore
    Castro, Pablo Samuel
    Gong, Jun
    Machado, Marlos C.
    Moitra, Subhodeep
    Ponda, Sameera S.
    Wang, Ziyu
    NATURE, 2020, 588 (7836) : 77 - +
  • [2] Autonomous vehicle navigation using evolutionary reinforcement learning
    Stafylopatis, A
    Blekas, K
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 108 (02) : 306 - 318
  • [3] Navigation of autonomous vehicles in unknown environments using reinforcement learning
    Martinez-Marin, Tomas
    Rodriguez, Rafael
    2007 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2007, : 964 - +
  • [4] Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning
    Mokhtar, Mohamed
    El-Badawy, Ayman
    2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1045 - 1052
  • [5] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
  • [6] Reinforcement Learning for Autonomous UAV Navigation Using Function Approximation
    Huy Xuan Pham
    Hung Manh La
    Feil-Seifer, David
    Luan Van Nguyen
    2018 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2018,
  • [7] A fast learning approach for autonomous navigation using a deep reinforcement learning method
    Ejaz, Muhammad Mudassir
    Tang, Tong Boon
    Lu, Cheng-Kai
    ELECTRONICS LETTERS, 2021, 57 (02) : 50 - 53
  • [8] Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
    Xu, Zifan
    Liu, Bo
    Xiao, Xuesu
    Nair, Anirudh
    Stone, Peter
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9224 - 9230
  • [9] Neural inverse reinforcement learning in autonomous navigation
    Xia, Chen
    El Kamel, Abdelkader
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 84 : 1 - 14
  • [10] Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning
    Samma, Hussein
    El-Ferik, Sami
    IEEE ACCESS, 2024, 12 : 79967 - 79977