FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引:0
作者
Abu Bakar, Mohamad Hafiz [1 ]
Shamsudin, Abu Ubaidah [1 ]
Soomro, Zubair Adil [1 ]
Tadokoro, Satoshi [2 ]
Salaan, C. J. [3 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia
[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan
[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines
来源
JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期
关键词
Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;
D O I
暂无
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.
引用
收藏
页码:37 / 49
页数:13
相关论文
共 50 条
  • [31] Density estimation based soft actor-critic: deep reinforcement learning for static output feedback control with measurement noise
    Wang, Ran
    Tian, Ye
    Kashima, Kenji
    ADVANCED ROBOTICS, 2024, 38 (06) : 398 - 409
  • [32] Mobile Robot Navigation based on Deep Reinforcement Learning
    Ruan, Xiaogang
    Ren, Dingqi
    Zhu, Xiaoqing
    Huang, Jing
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 6174 - 6178
  • [33] Deep Reinforcement Learning for the Improvement of Robot Manipulation Skills Under Sparse Reward
    He, Maochang
    Cheng, Hao
    Duan, Feng
    Sun, Zhe
    Li, Si Ning
    Yokoi, Hiroshi
    Zhu, Chi
    2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 5508 - 5513
  • [34] Demonstration Guided Actor-Critic Deep Reinforcement Learning for Fast Teaching of Robots in Dynamic Environments
    Gong, Liang
    Sun, Te
    Li, Xudong
    Lin, Ke
    Diaz-Rodriguez, Natalia
    Filliat, David
    Zhang, Zhengfeng
    Zhang, Junping
    IFAC PAPERSONLINE, 2020, 53 (05): : 271 - 278
  • [35] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
    Chen, Zheyi
    Hu, Jia
    Min, Geyong
    Luo, Chunbo
    El-Ghazawi, Tarek
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923
  • [36] High robustness energy management strategy of hybrid electric vehicle based on improved soft actor-critic deep reinforcement learning
    Sun, Wenjing
    Zou, Yuan
    Zhang, Xudong
    Guo, Ningyuan
    Zhang, Bin
    Du, Guodong
    ENERGY, 2022, 258
  • [37] Multi-objective optimization approach for permanent magnet machine viaimproved soft actor-critic based on deep reinforcement learning
    Wang, Chen
    Dong, Tianyu
    Chen, Lei
    Zhu, Guixiang
    Chen, Yihan
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
  • [38] Modular deep reinforcement learning from reward and punishment for robot navigation
    Wang, Jiexin
    Elfwing, Stefan
    Uchibe, Eiji
    NEURAL NETWORKS, 2021, 135 : 115 - 126
  • [39] A soft actor-critic reinforcement learning framework for optimal energy management in electric vehicles with hybrid storage
    Mazzi, Yahia
    Ben Sassi, Hicham
    Errahimi, Fatima
    Es-Sbai, Najia
    JOURNAL OF ENERGY STORAGE, 2024, 99
  • [40] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
    Li, Weijie
    Yue, Ming
    Shangguan, Jinyong
    Jin, Ye
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (02) : 563 - 574