FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引:0
|
作者
Abu Bakar, Mohamad Hafiz [1 ]
Shamsudin, Abu Ubaidah [1 ]
Soomro, Zubair Adil [1 ]
Tadokoro, Satoshi [2 ]
Salaan, C. J. [3 ]
机构
[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia
[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan
[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines
来源
JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期
关键词
Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;
D O I
暂无
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.
引用
收藏
页码:37 / 49
页数:13
相关论文
共 50 条
  • [1] FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
    Bakar, Mohamad Hafiz Abu
    Shamsudin, Abu Ubaidah
    Soomro, Zubair Adil
    Tadokoro, Satoshi
    Salaan, C. J.
    JURNAL TEKNOLOGI-SCIENCES & ENGINEERING, 2024, 86 (02): : 37 - 49
  • [2] COMMON-SENSICAL INCENTIVE REWARD IN DEEP ACTOR-CRITIC REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
    Sendari, Siti
    Muladi
    Ardiyansyah, Firman
    Setumin, Samsul
    Mokhtar, Norrima Binti
    Lin, Hsien-, I
    Hartono, Pitoyo
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2024, 20 (02): : 373 - 389
  • [3] Reward Shaping-Based Actor-Critic Deep Reinforcement Learning for Residential Energy Management
    Lu, Renzhi
    Jiang, Zhenyu
    Wu, Huaming
    Ding, Yuemin
    Wang, Dong
    Zhang, Hai-Tao
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 2662 - 2673
  • [4] Soft Actor-Critic for Navigation of Mobile Robots
    Junior Costa de Jesus
    Victor Augusto Kich
    Alisson Henrique Kolling
    Ricardo Bedin Grando
    Marco Antonio de Souza Leite Cuadros
    Daniel Fernando Tello Gamarra
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [5] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
  • [6] Integrated Actor-Critic for Deep Reinforcement Learning
    Zheng, Jiaohao
    Kurt, Mehmet Necip
    Wang, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
  • [7] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [8] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
    Zhong, Chen
    Lu, Ziyang
    Gursoy, M. Cenk
    Velipasalar, Senem
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
  • [9] Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm
    Yang, Binglin
    Wang, Hongwei
    Xia, Hao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 755 - 761
  • [10] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    COMPUTERS & SECURITY, 2023, 135