FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引:0
作者
Abu Bakar, Mohamad Hafiz [1 ]
Shamsudin, Abu Ubaidah [1 ]
Soomro, Zubair Adil [1 ]
Tadokoro, Satoshi [2 ]
Salaan, C. J. [3 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia
[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan
[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines
来源
JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期
关键词
Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;
D O I
暂无
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.
引用
收藏
页码:37 / 49
页数:13
相关论文
共 50 条
  • [41] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
    Weijie Li
    Ming Yue
    Jinyong Shangguan
    Ye Jin
    International Journal of Control, Automation and Systems, 2023, 21 : 563 - 574
  • [42] BAYESIAN OPTIMIZATION OF HYPER-PARAMETERS AND REWARD FUNCTION IN DEEP REINFORCEMENT LEARNING: APPLICATION TO BEHAVIOR LEARNING OF MOBILE ROBOT
    Nishimura, Takuto
    Sota, Ryosuke
    Horiuchi, Tadashi
    International Journal of Innovative Computing, Information and Control, 2025, 21 (02): : 469 - 480
  • [43] Continuous Control with Deep Reinforcement Learning for Mobile Robot Navigation
    Xiang, Jiaqi
    Li, Qingdong
    Dong, Xiwang
    Ren, Zhang
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1501 - 1506
  • [44] A Brief Survey: Deep Reinforcement Learning in Mobile Robot Navigation
    Jiang, Haoge
    Wang, Han
    Yau, Wei-Yun
    Wan, Kong-Wah
    PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 592 - 597
  • [45] An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems
    Zhao, Cong
    Deng, Na
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1445 - 1471
  • [46] An effective deep actor-critic reinforcement learning method for solving the flexible job shop scheduling problem
    Wan L.
    Cui X.
    Zhao H.
    Li C.
    Wang Z.
    Neural Computing and Applications, 2024, 36 (20) : 11877 - 11899
  • [47] CBNAV: Costmap Based Approach to Deep Reinforcement Learning Mobile Robot Navigation
    Tomasi Junior, Darci Luiz
    Todt, Eduardo
    2021 LATIN AMERICAN ROBOTICS SYMPOSIUM / 2021 BRAZILIAN SYMPOSIUM ON ROBOTICS / 2021 WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2021), 2021, : 324 - 329
  • [48] Sensor-based Mobile Robot Navigation via Deep Reinforcement Learning
    Han, Seungho-Ho
    Choi, Ho-Jin
    Benz, Philipp
    Loaiciga, Jorge
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 147 - 154
  • [49] A Novel Augmentative Backward Reward Function with Deep Reinforcement Learning for Autonomous UAV Navigation
    Chansuparp, Manit
    Jitkajornwanich, Kulsawasd
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [50] Experimental Research on Deep Reinforcement Learning in Autonomous navigation of Mobile Robot
    Yue, Pengyu
    Xin, Jing
    Zhao, Huan
    Liu, Ding
    Shan, Mao
    Zhang, Jian
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 1612 - 1616