UAV Navigation in 3D Urban Environments with Curriculum-based Deep Reinforcement Learning

被引:1
作者
de Carvalho, Kevin Braathen [1 ]
de Oliveira, Iure Rosa L. [1 ]
Brandao, Alexandre S. [1 ,2 ]
机构
[1] Univ Fed Vicosa, Grad Program Comp Sci, Nucl Specializat Robot, Dept Elect Engn, BR-36570900 Vicosa, MG, Brazil
[2] Univ Fed Vicosa, Dept Elect Engn, Vicosa, MG, Brazil
来源
2023 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS | 2023年
关键词
Deep Reinforcement Learning; Curriculum Learning; Urban Environments; 3D Navigation;
D O I
10.1109/ICUAS57906.2023.10156524
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Unmanned Aerial Vehicles (UAVs) are widely used in various applications, from inspection and surveillance to transportation and delivery. Navigating UAVs in complex 3D environments is a challenging task that requires robust and efficient decision-making algorithms. This paper presents a novel approach to UAV navigation in 3D environments using a Curriculum-based Deep Reinforcement Learning (DRL) approach. The proposed method utilizes a deep neural network to model the UAV's decision-making process and to learn a mapping from the state space to the action space. The learning process is guided by a reinforcement signal that reflects the performance of the UAV in terms of reaching its target while avoiding obstacles and with energy efficiency. Simulation results show that the proposed method has a positive trade off when compared to the baseline algorithm. The proposed method was able to perform well in environments with a state space size of 22 millions, allowing the usage in big environments or in maps with high resolution. The results demonstrate the potential of DRL for enabling UAVs to operate effectively in complex environments.
引用
收藏
页码:1249 / 1255
页数:7
相关论文
共 24 条
[1]   Artificial Intelligence in Advanced Manufacturing: Current Status and Future Outlook [J].
Arinez, Jorge F. ;
Chang, Qing ;
Gao, Robert X. ;
Xu, Chengying ;
Zhang, Jianjing .
JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME, 2020, 142 (11)
[2]   A Path-Following Controller for a UAV-UGV Formation Performing the Final Step of Last-Mile-Delivery [J].
Bacheti, Vinicius Pacheco ;
Brandao, Alexandre Santos ;
Sarcinelli-Filho, Mario .
IEEE ACCESS, 2021, 9 :142218-142231
[3]  
Bouhamed O., 2020, IEEE INT SYMP CIRC S, DOI [10.1109/iscas45731.2020.9181245, DOI 10.1109/iscas45731.2020.9181245]
[4]   Q-Learning: Theory and Applications [J].
Clifton, Jesse ;
Laber, Eric .
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 7, 2020, 2020, 7 :279-301
[5]   Q-learning based Path Planning Method for UAVs using Priority Shifting [J].
de Carvalho, Kevin B. ;
de Oliveira, Iure Rosa L. ;
Villa, Daniel K. D. ;
Caldeira, Alexandre G. ;
Sarcinelli-Filho, Mario ;
Brandao, Alexandre S. .
2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, :421-426
[6]   A 3D Q-Learning Algorithm for Offline UAV Path Planning with Priority Shifting Rewards [J].
de Carvalho, Kevin Braathen ;
Batista, Hiago B. ;
de Oliveira, Iure L. ;
Brandao, Alexandre S. .
2022 LATIN AMERICAN ROBOTICS SYMPOSIUM (LARS), 2022 BRAZILIAN SYMPOSIUM ON ROBOTICS (SBR), AND 2022 WORKSHOP ON ROBOTICS IN EDUCATION (WRE), 2022, :169-174
[7]   Gestures-teleoperation of a heterogeneous multi-robot system [J].
de Carvalho, Kevin Braathen ;
Villa, Daniel Khede Dourado ;
Sarcinelli-Filho, Mario ;
Brandao, Alexandre Santos .
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 118 (5-6) :1999-2015
[8]   Cooperative Load Transportation With Two Quadrotors Using Adaptive Control [J].
Dourado Villa, Daniel Khede ;
Brandao, Alexandre Santos ;
Carelli, Ricardo ;
Sarcinelli-Filho, Mario .
IEEE ACCESS, 2021, 9 :129148-129160
[9]   The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking [J].
Du, Dawei ;
Qi, Yuankai ;
Yu, Hongyang ;
Yang, Yifan ;
Duan, Kaiwen ;
Li, Guorong ;
Zhang, Weigang ;
Huang, Qingming ;
Tian, Qi .
COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 :375-391
[10]  
Erdelj M, 2017, IEEE PERVAS COMPUT, V16, P24, DOI 10.1109/MPRV.2017.11