Autonomous navigation of stratospheric balloons using reinforcement learning

被引：0

作者：

Marc G. Bellemare

Salvatore Candido

Pablo Samuel Castro

Jun Gong

Marlos C. Machado

Subhodeep Moitra

Sameera S. Ponda

Ziyu Wang

机构：

[1] Brain Team,

[2] Google Research,undefined

[3] Brain Team,undefined

[4] Google Research,undefined

[5] Loon,undefined

来源：

Nature | 2020年 / 588卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Efficiently navigating a superpressure balloon in the stratosphere1 requires the integration of a multitude of cues, such as wind speed and solar elevation, and the process is complicated by forecast errors and sparse wind measurements. Coupled with the need to make decisions in real time, these factors rule out the use of conventional control techniques2,3. Here we describe the use of reinforcement learning4,5 to create a high-performing flight controller. Our algorithm uses data augmentation6,7 and a self-correcting design to overcome the key technical challenge of reinforcement learning from imperfect data, which has proved to be a major obstacle to its application to physical systems8. We deployed our controller to station Loon superpressure balloons at multiple locations across the globe, including a 39-day controlled experiment over the Pacific Ocean. Analyses show that the controller outperforms Loon’s previous algorithm and is robust to the natural diversity in stratospheric winds. These results demonstrate that reinforcement learning is an effective solution to real-world autonomous control problems in which neither conventional methods nor human intervention suffice, offering clues about what may be needed to create artificially intelligent agents that continuously interact with real, dynamic environments.

引用

页码：77 / 82

页数：5

共 50 条

[1] Autonomous navigation of stratospheric balloons using reinforcement learning
Bellemare, Marc G.
Candido, Salvatore
Castro, Pablo Samuel
Gong, Jun
Machado, Marlos C.
Moitra, Subhodeep
Ponda, Sameera S.
Wang, Ziyu
NATURE, 2020, 588 (7836) : 77 - +
[2] Autonomous vehicle navigation using evolutionary reinforcement learning
Stafylopatis, A
Blekas, K
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 1998, 108 (02) : 306 - 318
[3] Navigation of autonomous vehicles in unknown environments using reinforcement learning
Martinez-Marin, Tomas
Rodriguez, Rafael
2007 IEEE INTELLIGENT VEHICLES SYMPOSIUM, VOLS 1-3, 2007, : 964 - +
[4] Autonomous Navigation and Control of a Quadrotor Using Deep Reinforcement Learning
Mokhtar, Mohamed
El-Badawy, Ayman
2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2023, : 1045 - 1052
[5] Autonomous Visual Navigation using Deep Reinforcement Learning: An Overview
Ejaz, Muhammad Mudassir
Tang, Tong Boon
Lu, Cheng-Kai
2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 294 - 299
[6] Reinforcement Learning for Autonomous UAV Navigation Using Function Approximation
Huy Xuan Pham
Hung Manh La
Feil-Seifer, David
Luan Van Nguyen
2018 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2018,
[7] A fast learning approach for autonomous navigation using a deep reinforcement learning method
Ejaz, Muhammad Mudassir
Tang, Tong Boon
Lu, Cheng-Kai
ELECTRONICS LETTERS, 2021, 57 (02) : 50 - 53
[8] Benchmarking Reinforcement Learning Techniques for Autonomous Navigation
Xu, Zifan
Liu, Bo
Xiao, Xuesu
Nair, Anirudh
Stone, Peter
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9224 - 9230
[9] Neural inverse reinforcement learning in autonomous navigation
Xia, Chen
El Kamel, Abdelkader
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 84 : 1 - 14
[10] Autonomous UAV Visual Navigation Using an Improved Deep Reinforcement Learning
Samma, Hussein
El-Ferik, Sami
IEEE ACCESS, 2024, 12 : 79967 - 79977

← 1 2 3 4 5 →