Real-time route planning of unmanned aerial vehicles based on improved soft actor-critic algorithm

被引:3
作者
Zhou, Yuxiang [1 ]
Shu, Jiansheng [1 ]
Zheng, Xiaolong [1 ]
Hao, Hui [1 ]
Song, Huan [1 ]
机构
[1] Xian Res Inst High Technol, Xian, Peoples R China
基金
英国科研创新办公室;
关键词
deep reinforcement learning; unmanned aerial vehicles (UAV); 2D path planning; local route planning; SAC algorithm;
D O I
10.3389/fnbot.2022.1025817
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the application and development of UAV technology and navigation and positioning technology, higher requirements are put forward for UAV maneuvering obstacle avoidance ability and real-time route planning. In this paper, for the problem of real-time UAV route planning in the unknown environment, we combine the ideas of artificial potential field method to modify the state observation and reward function, which solves the problem of sparse rewards of reinforcement learning algorithm, improves the convergence speed of the algorithm, and improves the generalization of the algorithm by step-by-step training based on the ideas of curriculum learning and transfer learning according to the difficulty of the task. The simulation results show that the improved SAC algorithm has fast convergence speed, good timeliness and strong generalization, and can better complete the UAV route planning task.
引用
收藏
页数:15
相关论文
共 21 条
[1]   Double Critic Deep Reinforcement Learning for Mapless 3D Navigation of Unmanned Aerial Vehicles [J].
Bedin Grando, Ricardo ;
de Jesus, Junior Costa ;
Kich, Victor Augusto ;
Kolling, Alisson Henrique ;
Jorge Drews-Jr, Paulo Lilles .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (02)
[2]  
Bengio Yoshua., 2009, P INT C MACH LEARN, DOI 10.1145/1553374.1553380
[3]   Soft Actor-Critic for Navigation of Mobile Robots [J].
de Jesus, Junior Costa ;
Kich, Victor Augusto ;
Kolling, Alisson Henrique ;
Grando, Ricardo Bedin ;
Cuadros, Marco Antonio de Souza Leite ;
Gamarra, Daniel Fernando Tello .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
[4]  
Dijkstra EW., 1959, NUMER MATH, V1, P269, DOI [10.1007/BF01386390, DOI 10.1007/BF01386390]
[5]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[6]   Deep Reinforcement Learning for Indoor Mobile Robot Path Planning [J].
Gao, Junli ;
Ye, Weijie ;
Guo, Jing ;
Li, Zhongjuan .
SENSORS, 2020, 20 (19) :1-15
[7]  
Haarnoja T, 2018, PR MACH LEARN RES, V80
[8]   Mobile robot path planning with surrounding point set and path improvement [J].
Han, Jihee ;
Seo, Yoonho .
APPLIED SOFT COMPUTING, 2017, 57 :35-47
[9]   A FORMAL BASIS FOR HEURISTIC DETERMINATION OF MINIMUM COST PATHS [J].
HART, PE ;
NILSSON, NJ ;
RAPHAEL, B .
IEEE TRANSACTIONS ON SYSTEMS SCIENCE AND CYBERNETICS, 1968, SSC4 (02) :100-+
[10]   Explainable Deep Reinforcement Learning for UAV autonomous path planning [J].
He, Lei ;
Aouf, Nabil ;
Song, Bifeng .
AEROSPACE SCIENCE AND TECHNOLOGY, 2021, 118