A Reinforcement Learning Method of Obstacle Avoidance for Industrial Mobile Vehicles in Unknown Environments Using Neural Network

被引：12

作者：

Xia, Chen ^{[1
]}

El Kamel, A. ^{[1
]}

机构：

[1] Ecole Cent Lille, CNRS, UMR 8219, LAGIS, F-59650 Villeneuve Dascq, France

来源：

PROCEEDINGS OF THE 21ST INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT 2014 | 2015年

关键词：

Neural network; obstacle avoidance; Q-learning; reinforcement learning; unpredicted environments;

D O I：

10.2991/978-94-6239-102-4_136

中图分类号：

F [经济];

学科分类号：

02 ;

摘要：

This paper presents a reinforcement learning method for a mobile vehicle to navigate autonomously in an unknown environment. Q-learning algorithm is a model-free reinforcement learning technique and is applied to realize the robot self-learning ability. The state-action Q-values are traditionally stored in a Q table, which will decrease the learning speed when large storage memory is needed. The neural network has a strong ability to deal with large-scale state spaces. Therefore, the neural network is introduced to work with Q-learning to ensure the self-learning efficiency of avoiding obstacles for industrial vehicles in unpredictable environments. Experiment results show that an autonomous mobile vehicle using the proposed method can successfully navigate to the target place without colliding with obstacles, and hence prove the self-learning ability of navigation in an unknown environment.

引用

页码：671 / 675

页数：5

共 15 条

[1]

[Anonymous], P INT C CONTR AUT CO

[2]

[Anonymous], 1996, REINFORCEMENT LEARNI

[3]

[Anonymous], BACK PROPAGATION ALG

[4]

[Anonymous], 2003, Cognitive Systems Research, vol, DOI [10.1016/S1389-0417(03)00007-X, DOI 10.1016/S1389-0417(03)00007-X]

[5]

[Anonymous], 2010, Intelligence artificielle: Avec plus de 500 exercices

[6]

Filliat D., 2003, Cognitive Systems Research, V4, P243, DOI DOI 10.1016/S1389-0417(03)00008-1

[7] Dynamic motion planning for mobile robots using potential field method [J].

Ge, SS ;

Cui, YJ .

AUTONOMOUS ROBOTS, 2002, 13 (03) :207-222

[8]

Hu YR, 2004, IEEE INT CONF ROBOT, P4350

[9]

Huang BQ, 2005, PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, P85

[10] Reinforcement based mobile robot navigation in dynamic environment [J].

Jaradat, Mohammad Abdel Kareem ;

Al-Rousan, Mohammad ;

Quadan, Lara .

ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2011, 27 (01) :135-149

← 1 2 →