Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning

被引：24

作者：

Bai, Zekun ^{[1
]}

Pang, Hui ^{[1
]}

He, Zhaonian ^{[1
]}

Zhao, Bin ^{[1
]}

Wang, Tong ^{[1
]}

机构：

[1] Xian Univ Technol, Sch Mech & Precis Instrument Engn, Xian 710048, Peoples R China

来源：

IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 12期

基金：

中国国家自然科学基金;

关键词：

Autonomous mobile robot (AMR); deep reinforcement learning (DRL); double deep Q network (DDQN); path planning; path smooth; A-ASTERISK; ALGORITHM; NETWORK;

D O I：

10.1109/JIOT.2024.3379361

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In real-world situations, some unavoidable problems like significant dependence on environment information, long inference time and weak anti-disturbance ability are often involved in path planning of autonomous mobile robot (AMR) under unknown environments. To solve these issues, this article proposes an improved deep reinforcement learning-based path-planning algorithm to find out an optimized path for a class of AMRs. First, the path planning of AMR is described as a Markov decision process framework, and the double deep Q network (DDQN) is utilized to obtain the optimal adaptive solutions of AMR's path planning. Second, a comprehensive reward function integrated with heuristic function is designed to navigate the AMR into the target area. Afterwards, an optimized deep neural network with an adaptive epsilon-greedy action selection policy is designed to deal with the tradeoff between exploration and exploitation, thus further to improve the global searching capability and the convergence performance for the AMR path planning. Moreover, Bezier curve theory is utilized to smooth the planned path. Finally, the comparative simulations are carried out to validate our proposed path-planning algorithm. The results show that, compared with DQN, A*, RRT, and APF algorithms, our improved DDQN algorithm can produce safer and shorter global paths in comprehensive unknown environments. Meanwhile, the IDDQN algorithm has strong adaptability to random disturbances in unknown environments.

引用

页码：22153 / 22166

页数：14

共 48 条

[1] Multi-objective path planning of an autonomous mobile robot using hybrid PSO-MFB optimization algorithm [J].

Ajeil, Fatin H. ;

Ibraheem, Ibraheem Kasim ;

Sahib, Mouayad A. ;

Humaidi, Amjad J. .

APPLIED SOFT COMPUTING, 2020, 89

[2] Courier routing and assignment for food delivery service using reinforcement learning [J].

Bozanta, Aysun ;

Cevik, Mucahit ;

Kavaklioglu, Can ;

Kavuk, Eray M. ;

Tosun, Ayse ;

Sonuc, Sibel B. ;

Duranel, Alper ;

Basar, Ayse .

COMPUTERS & INDUSTRIAL ENGINEERING, 2022, 164

[3]

Chen G., 2024, Mech. Syst. Signal Process., V206

[4] Path Planning Based on Deep Reinforcement Learning for Autonomous Underwater Vehicles Under Ocean Current Disturbance [J].

Chu, Zhenzhong ;

Wang, Fulun ;

Lei, Tingjun ;

Luo, Chaomin .

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01) :108-120

[5] An improved RRT* algorithm for robot path planning based on path expansion heuristic sampling [J].

Ding, Jun ;

Zhou, Yinxuan ;

Huang, Xia ;

Song, Kun ;

Lu, Shiqing ;

Wang, Lusheng .

JOURNAL OF COMPUTATIONAL SCIENCE, 2023, 67

[6] An improved A* algorithm for the industrial robot path planning with high success rate and short length [J].

Fu, Bing ;

Chen, Lin ;

Zhou, Yuntao ;

Zheng, Dong ;

Wei, Zhiqi ;

Dai, Jun ;

Pan, Haihong .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 106 :26-37

[7] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle [J].

Hadi, Behnaz ;

Khosravi, Alireza ;

Sarhadi, Pouria .

APPLIED OCEAN RESEARCH, 2022, 129

[8] Path planning for asteroid hopping rovers with pre-trained deep reinforcement learning architectures [J].

Jiang, Jianxun ;

Zeng, Xiangyuan ;

Guzzetti, Davide ;

You, Yuyang .

ACTA ASTRONAUTICA, 2020, 171 :265-279

[9] Optimal Constant Acceleration Motion Primitives [J].

Klancar, Gregor ;

Blazic, Saso .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (09) :8502-8511

[10] Experimental evaluation of autonomous map-based Spot navigation in confined environments [J].

Koval, Anton ;

Karlsson, Samuel ;

Lulea, George Nikolakopoulos .

BIOMIMETIC INTELLIGENCE AND ROBOTICS, 2022, 2 (01)

← 1 2 3 4 5 →