FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引：0

作者：

Abu Bakar, Mohamad Hafiz ^{[1
]}

Shamsudin, Abu Ubaidah ^{[1
]}

Soomro, Zubair Adil ^{[1
]}

Tadokoro, Satoshi ^{[2
]}

Salaan, C. J. ^{[3
]}

机构：

[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia

[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan

[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines

来源：

JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期

关键词：

Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;

D O I：

暂无

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.

引用

页码：37 / 49

页数：13

共 50 条

[1] FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
Bakar, Mohamad Hafiz Abu
Shamsudin, Abu Ubaidah
Soomro, Zubair Adil
Tadokoro, Satoshi
Salaan, C. J.
JURNAL TEKNOLOGI-SCIENCES & ENGINEERING, 2024, 86 (02): : 37 - 49
[2] COMMON-SENSICAL INCENTIVE REWARD IN DEEP ACTOR-CRITIC REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION
Sendari, Siti
Muladi
Ardiyansyah, Firman
Setumin, Samsul
Mokhtar, Norrima Binti
Lin, Hsien-, I
Hartono, Pitoyo
INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2024, 20 (02): : 373 - 389
[3] Reward Shaping-Based Actor-Critic Deep Reinforcement Learning for Residential Energy Management
Lu, Renzhi
Jiang, Zhenyu
Wu, Huaming
Ding, Yuemin
Wang, Dong
Zhang, Hai-Tao
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (03) : 2662 - 2673
[4] Soft Actor-Critic for Navigation of Mobile Robots
Junior Costa de Jesus
Victor Augusto Kich
Alisson Henrique Kolling
Ricardo Bedin Grando
Marco Antonio de Souza Leite Cuadros
Daniel Fernando Tello Gamarra
Journal of Intelligent & Robotic Systems, 2021, 102
[5] Soft Actor-Critic for Navigation of Mobile Robots
de Jesus, Junior Costa
Kich, Victor Augusto
Kolling, Alisson Henrique
Grando, Ricardo Bedin
Cuadros, Marco Antonio de Souza Leite
Gamarra, Daniel Fernando Tello
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
[6] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[7] Deep Actor-Critic Reinforcement Learning for Anomaly Detection
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
[8] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
Zhong, Chen
Lu, Ziyang
Gursoy, M. Cenk
Velipasalar, Senem
2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
[9] Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm
Yang, Binglin
Wang, Hongwei
Xia, Hao
39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 755 - 761
[10] A soft actor-critic reinforcement learning algorithm for network intrusion detection
Li, Zhengfa
Huang, Chuanhe
Deng, Shuhua
Qiu, Wanyu
Gao, Xieping
COMPUTERS & SECURITY, 2023, 135

← 1 2 3 4 5 →