Mobile Robot Navigation Using Deep Reinforcement Learning

被引:24
作者
Lee, Min-Fan Ricky [1 ,2 ]
Yusuf, Sharfiden Hassen [1 ]
机构
[1] Natl Taiwan Univ Sci & Technol, Grad Inst Automat & Control, Taipei 106335, Taiwan
[2] Natl Taiwan Univ Sci & Technol, Ctr Cyber Phys Syst Innovat, Taipei 106335, Taiwan
关键词
autonomous navigation; collision avoidance; reinforcement learning; mobile robots; LOCALIZATION; SLAM;
D O I
10.3390/pr10122748
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Learning how to navigate autonomously in an unknown indoor environment without colliding with static and dynamic obstacles is important for mobile robots. The conventional mobile robot navigation system does not have the ability to learn autonomously. Unlike conventional approaches, this paper proposes an end-to-end approach that uses deep reinforcement learning for autonomous mobile robot navigation in an unknown environment. Two types of deep Q-learning agents, such as deep Q-network and double deep Q-network agents are proposed to enable the mobile robot to autonomously learn about collision avoidance and navigation capabilities in an unknown environment. For autonomous mobile robot navigation in an unknown environment, the process of detecting the target object is first carried out using a deep neural network model, and then the process of navigation to the target object is followed using the deep Q-network or double deep Q-network algorithm. The simulation results show that the mobile robot can autonomously navigate, recognize, and reach the target object location in an unknown environment without colliding with static and dynamic obstacles. Similar results are obtained in real-world experiments, but only with static obstacles. The DDQN agent outperforms the DQN agent in reaching the target object location in the test simulation by 5.06%.
引用
收藏
页数:22
相关论文
共 47 条
[1]   Learning dexterous in-hand manipulation [J].
Andrychowicz, Marcin ;
Baker, Bowen ;
Chociej, Maciek ;
Jozefowicz, Rafal ;
McGrew, Bob ;
Pachocki, Jakub ;
Petron, Arthur ;
Plappert, Matthias ;
Powell, Glenn ;
Ray, Alex ;
Schneider, Jonas ;
Sidor, Szymon ;
Tobin, Josh ;
Welinder, Peter ;
Weng, Lilian ;
Zaremba, Wojciech .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) :3-20
[2]  
Aref Mohammad M., 2016, 2016 IEEE International Conference on Automation Science and Engineering (CASE), P515, DOI 10.1109/COASE.2016.7743449
[3]   Consistency of the EKF-SLAM algorithm [J].
Bailey, Tim ;
Nieto, Juan ;
Guivant, Jose ;
Stevens, Michael ;
Nebot, Eduardo .
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, :3562-+
[4]  
Dhariwal C., 2017, GITHUB REPOS
[5]   Simultaneous localization and mapping: Part I [J].
Durrant-Whyte, Hugh ;
Bailey, Tim .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2006, 13 (02) :99-108
[6]   SONAR-BASED REAL-WORLD MAPPING AND NAVIGATION [J].
ELFES, A .
IEEE JOURNAL OF ROBOTICS AND AUTOMATION, 1987, 3 (03) :249-265
[7]  
Garulli A, 2005, IEEE DECIS CONTR P, P2041
[8]   Combining Hector SLAM and Artificial Potential Field for Autonomous Navigation Inside a Greenhouse [J].
Harik, El Houssein Chouaib ;
Korsaeth, Audun .
ROBOTICS, 2018, 7 (02)
[9]  
Hasselt H. V., 2010, Advances in Neural Information Processing Systems, V23, P2613
[10]  
Heess N, 2017, ARXIV