Robot path planning based on deep reinforcement learning

被引：13

作者：

Long, Yinxin ^{[1
]}

He, Huajin ^{[1
]}

机构：

[1] Wuhan Univ Technol, Sch Automat, Wuhan, Peoples R China

来源：

2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS) | 2020年

关键词：

mobile robot; deep reinforcement learning; obstacle avoidance; optimal path;

D O I：

10.1109/TOCS50858.2020.9339752

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Q-learning algorithm based on Markov decision process as a reinforcement learning algorithm can achieve better path planning effect for mobile robot in continuous trial and error. However, Q-learning needs a huge Q-value table, which is easy to cause dimension disaster in decision-making, and it is difficult to get a good path in complex situations. By combining deep learning with reinforcement learning and using the perceptual advantages of deep learning to solve the decision-making problem of reinforcement learning, the deficiency of Q-learning algorithm can be improved. At the same time, the path planning of deep reinforcement learning is simulated by MATLAB, the simulation results show that the deep reinforcement learning can effectively realize the obstacle avoidance of the robot and plan a collision free optimal path for the robot from the starting point to the end point.

引用

页码：151 / 154

页数：4

共 15 条

[1]

[Anonymous], 2018, ACTA AUTOMATICA SINI, DOI DOI 10.1109/ICISCE.2018.00025

[2]

Chen Z., 2019, Deep Reinforcement Learning Principles and Prac- tice

[3] Dynamic Effects in Statically Stable Walking Machines [J].

Gonzalez De Santos P. ;

Jimenez M.A. ;

Armada M.A. .

Journal of Intelligent and Robotic Systems, 1998, 23 (1) :71-85

[4] Ant colony optimization -: Artificial ants as a computational intelligence technique [J].

Dorigo, Marco ;

Birattari, Mauro ;

Stuetzle, Thomas .

IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2006, 1 (04) :28-39

[5] A fast learning algorithm for deep belief nets [J].

Hinton, Geoffrey E. ;

Osindero, Simon ;

Teh, Yee-Whye .

NEURAL COMPUTATION, 2006, 18 (07) :1527-1554

[6]

Kaelbling LP, 1996, IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, P1319, DOI 10.1109/IROS.1996.568987

[7]

Kennedy J, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P1942, DOI 10.1109/icnn.1995.488968

[8] REAL-TIME OBSTACLE AVOIDANCE FOR MANIPULATORS AND MOBILE ROBOTS [J].

KHATIB, O .

INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1986, 5 (01) :90-98

[9]

LaValle SM, 2001, ALGORITHMIC AND COMPUTATIONAL ROBOTICS: NEW DIRECTIONS, P293

[10]

SUI Bo-wen, 2020, J SHANGHAI MARITIME, V41, P2

← 1 2 →