Using Deep Reinforcement Learning for Navigation in Simulated Hallways

被引：2

作者：

Leao, Goncalo ^{[1
,2
]}

Almeida, Filipe ^{[1
]}

Trigo, Emanuel ^{[1
]}

Ferreira, Henrique ^{[1
]}

Sousa, Armando ^{[1
,2
]}

Reis, Luis Paulo ^{[1
,3
]}

机构：

[1] Univ Porto, FEUP Fac Engn, Porto, Portugal

[2] INESC TEC INESC Technol & Sci, Porto, Portugal

[3] LIACC Artificial Intelligence & Comp Sci Lab, Porto, Portugal

来源：

2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC | 2023年

关键词：

Deep Q-Network; Intelligent Robotics; Navigation; Reinforcement Learning; Simulation;

D O I：

10.1109/ICARSC58346.2023.10129605

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement Learning (RL) is a well-suited paradigm to train robots since it does not require any previous information or database to train an agent. This paper explores using Deep Reinforcement Learning (DRL) to train a robot to navigate in maps containing different sorts of obstacles and which emulate hallways. Training and testing were performed using the Flatland 2D simulator and a Deep Q-Network (DQN) provided by OpenAI gym. Different sets of maps were used for training and testing. The experiments illustrate how well the robot is able to navigate in maps distinct from the ones used for training by learning new behaviours (namely following walls) and highlight the key challenges when solving this task using DRL, including the appropriate definition of the state space and reward function, as well as of the stopping criteria during training.

引用

页码：207 / 213

页数：7

共 15 条

[1]

[Anonymous], 2016, Openai gym

[2] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[3]

Cahill A., 2010, CATASTROPHIC FORGETT

[4] A deep reinforcement learning based method for real-time path planning and dynamic obstacle avoidance [J].

Chen, Pengzhan ;

Pei, Jiean ;

Lu, Weiqing ;

Li, Mingzhen .

NEUROCOMPUTING, 2022, 497 :64-75

[5]

Minguez J, 2016, SPRINGER HANDBOOK OF ROBOTICS, P1177

[6] Human-level control through deep reinforcement learning [J].

Mnih, Volodymyr ;

Kavukcuoglu, Koray ;

Silver, David ;

Rusu, Andrei A. ;

Veness, Joel ;

Bellemare, Marc G. ;

Graves, Alex ;

Riedmiller, Martin ;

Fidjeland, Andreas K. ;

Ostrovski, Georg ;

Petersen, Stig ;

Beattie, Charles ;

Sadik, Amir ;

Antonoglou, Ioannis ;

King, Helen ;

Kumaran, Dharshan ;

Wierstra, Daan ;

Legg, Shane ;

Hassabis, Demis .

NATURE, 2015, 518 (7540) :529-533

[7] Learning Navigation Policies for Mobile Robots in Deep Reinforcement Learning with Random Network Distillation [J].

Pan, Lifan ;

Li, Anyi ;

Ma, Jun ;

Ji, Jianmin .

2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, :151-157

[8]

Rodrigues Nelson., 2022, Iberian Robotics Conference (ROBOT), page, P274

[9]

Tai L, 2017, IEEE INT C INT ROBOT, P31

[10]

Ventuzelos V., 2022, TEACHING ROS12 REINF, P586

← 1 2 →