UAV online path planning technology based on deep reinforcement learning

被引：4

作者：

Fan, Jiaxuan ^{[1
]}

Wang, Zhenya ^{[1
]}

Ren, Jinlei ^{[1
]}

Lu, Ying ^{[1
]}

Liu, Yiheng ^{[2
]}

机构：

[1] China Acad Launch Vehicle Technol, Res & Dev Ctr, Beijing, Peoples R China

[2] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China

来源：

2020 CHINESE AUTOMATION CONGRESS (CAC 2020) | 2020年

关键词：

path planning; interfered fluid dynamical system (IFDS); unmanned aerial vehicle (UAV); deep reinforcement learning; Twin Delayed Deep Deterministic Policy Gradient (TD3);

D O I：

10.1109/CAC51589.2020.9327752

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a method for planning three-dimensional path for unmanned aerial vehicle (UAV) in complex airspace based on interfered fluid dynamical system (IFDS) and deep reinforcement learning. Firstly, the model of unmanned aerial vehicle under various constraints and the mathematical expression of threat zone are established. Secondly, in order to solve the problems of slow calculation speed and difficult to make the global optimal solution existed at present, an intelligent 3D path planning method on the basis of IFDS is proposed, and deep reinforcement learning is used to solve the coefficient of IFDS. The simulation results show that the path planned by the proposed method can avoid the threat zone effectively, meanwhile, the path is smooth, suitable and fuel saving for UAV.

引用

页码：5382 / 5386

页数：5

共 11 条

[1]

Bortoff SA, 2000, P AMER CONTR CONF, P364, DOI 10.1109/ACC.2000.878915

[2]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[3]

Han-Pang Huang, 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566), P2813

[4]

Lillicrap Timothy P., 2015, INT C LEARNING REPRE

[5] Deep learning based trajectory optimization for UAV aerial refueling docking under bow wave [J].

Liu, Yiheng ;

Wang, Honglun ;

Su, Zikang ;

Fan, Jiaxuan .

AEROSPACE SCIENCE AND TECHNOLOGY, 2018, 80 :392-402

[6] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[7]

Tan JH, 2015, 2015 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, P2592, DOI 10.1109/ICInfA.2015.7279722

[8]

Timothy WM, 2000, TRAJECTORY PLANNING, V12, P123

[9] Three-dimensional path planning for unmanned aerial vehicle based on interfered fluid dynamical system [J].

Wang Honglun ;

Lyu Wentao ;

Peng, Yao ;

Xiao, Liang ;

Chang, Liu .

CHINESE JOURNAL OF AERONAUTICS, 2015, 28 (01) :229-239

[10] Path planning for solar-powered UAV in urban environment [J].

Wu, Jianfa ;

Wang, Honglun ;

Li, Na ;

Yao, Peng ;

Huang, Yu ;

Yang, Hemeng .

NEUROCOMPUTING, 2018, 275 :2055-2065

← 1 2 →