Path Planning for Mobile Robots Based on TPR-DDPG

被引:6
|
作者
Zhao, Yaping [1 ]
Wang, Xiuqing [1 ,2 ,3 ]
Wang, Ruiyi [1 ]
Yang, Yunpeng [1 ]
Lv, Feng [1 ]
机构
[1] Hebei Normal Univ, Coll Comp & Cyber Secur, Shijiazhuang 050024, Hebei, Peoples R China
[2] Hebei Prov Key Lab Network & Informat Secur, Shijiazhuang, Hebei, Peoples R China
[3] Hebei Prov Engn Res Ctr Supply Chain Big Data Ana, Shijiazhuang, Hebei, Peoples R China
关键词
path planning; deep deterministic policy gradient (DDPG); policy network; value network; mobile robots;
D O I
10.1109/IJCNN52387.2021.9533570
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Path planning is one of the key research topics in robotics. Nowadays, researchers pay more attention to reinforcement learning (RL) and deep learning (DL) because of RL's good generality, self-learning ability, and DL's super leaning ability. Deep deterministic policy gradient (DDPG) algorithm, which combines the architectures of deep Q-learning (DQN), deterministic policy gradient (DPG) and Actor-Critic (AC), is different from the traditional RL methods and is suitable for continuous action space. Therefore, TPR-DDPG based path planning algorithm for mobile robots is proposed. In the algorithm, the state is preprocessed by various normalization methods, and complete reward-functions are designed to make agents reach the target point quickly by optimal paths in complex environments. The BatchNorm layer is added to the policy network, which ensures the stability of the algorithm. Finally, experimental results of agents' reaching the target points successfully through the paths generated by the improved DDPG validate the effectiveness of the proposed algorithm.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Characteristics of a genetic based approach to path planning for mobile robots
    Ashiru, I
    Czarnecki, C
    Routen, T
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 1996, 19 (02) : 149 - 169
  • [22] Path Planning for Mobile Robots Based on Improved RRT Algorithm
    Jiang, Yanglin
    Xu, Xiangrong
    Li, Yonggang
    You, Tianya
    Wang, Xiaoyi
    Wang, Zhixiong
    Wang, Haiyan
    Xu, Shanshan
    Rodic, Aleksandar
    Petrovic, Petar B.
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 793 - 798
  • [23] Survival Probability Map Based Path Planning of Mobile Robots
    Park, Se-Hong
    Beom-Seok Cho
    Kim, Min-Ho
    Lee, Min-Cheol
    2015 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS), 2015, : 868 - 870
  • [24] Cooperative path planning for mobile robots based on visibility graph
    Ma, Yingchong
    Zheng, Gang
    Perruquetti, Wilfrid
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 4915 - 4920
  • [25] A collision predict based local path planning of Mobile Robots
    Gao, Yang
    Sun, Shu-dong
    2009 INTERNATIONAL ASIA CONFERENCE ON INFORMATICS IN CONTROL, AUTOMATION, AND ROBOTICS, PROCEEDINGS, 2009, : 185 - 190
  • [26] Hierarchical Path Planning for Mobile Robots Based on Hybrid Map
    Wu X.
    Yang J.
    Tang K.
    Zhai J.
    Lou P.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2023, 34 (05): : 563 - 575
  • [27] PRTIRL Based Socially Adaptive Path Planning for Mobile Robots
    Ding, Zhiyu
    Liu, Jie
    Chi, Wenzheng
    Wang, Jiankun
    Chen, Guodong
    Sun, Lining
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2023, 15 (02) : 129 - 142
  • [28] Path Planning for Mobile Robots based on Visibility Graphs and A* Algorithm
    Contreras, Juan D.
    Fernando Martinez, S.
    Martinez, Fredy H. S.
    SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2015), 2015, 9631
  • [29] Local path planning for mobile robots based on intermediate objectives
    Ma, Yingchong
    Zheng, Gang
    Perruquetti, Wilfrid
    Qiu, Zhaopeng
    ROBOTICA, 2015, 33 (04) : 1017 - 1031
  • [30] Characteristics of a genetic based approach to path planning for mobile robots
    Intelligent Robotic Syst. Res. Grp., Department of Computer Science, De Montfort University, Leicester LE1 9BH, United Kingdom
    J Network Comput Appl, 2 (149-169):