FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引：0

作者：

Abu Bakar, Mohamad Hafiz ^{[1
]}

Shamsudin, Abu Ubaidah ^{[1
]}

Soomro, Zubair Adil ^{[1
]}

Tadokoro, Satoshi ^{[2
]}

Salaan, C. J. ^{[3
]}

机构：

[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia

[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan

[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines

来源：

JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期

关键词：

Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;

D O I：

暂无

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.

引用

页码：37 / 49

页数：13

共 50 条

[31] Density estimation based soft actor-critic: deep reinforcement learning for static output feedback control with measurement noise
Wang, Ran
Tian, Ye
Kashima, Kenji
ADVANCED ROBOTICS, 2024, 38 (06) : 398 - 409
[32] Mobile Robot Navigation based on Deep Reinforcement Learning
Ruan, Xiaogang
Ren, Dingqi
Zhu, Xiaoqing
Huang, Jing
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 6174 - 6178
[33] Deep Reinforcement Learning for the Improvement of Robot Manipulation Skills Under Sparse Reward
He, Maochang
Cheng, Hao
Duan, Feng
Sun, Zhe
Li, Si Ning
Yokoi, Hiroshi
Zhu, Chi
2022 34TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2022, : 5508 - 5513
[34] Demonstration Guided Actor-Critic Deep Reinforcement Learning for Fast Teaching of Robots in Dynamic Environments
Gong, Liang
Sun, Te
Li, Xudong
Lin, Ke
Diaz-Rodriguez, Natalia
Filliat, David
Zhang, Zhengfeng
Zhang, Junping
IFAC PAPERSONLINE, 2020, 53 (05): : 271 - 278
[35] Adaptive and Efficient Resource Allocation in Cloud Datacenters Using Actor-Critic Deep Reinforcement Learning
Chen, Zheyi
Hu, Jia
Min, Geyong
Luo, Chunbo
El-Ghazawi, Tarek
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 33 (08) : 1911 - 1923
[36] High robustness energy management strategy of hybrid electric vehicle based on improved soft actor-critic deep reinforcement learning
Sun, Wenjing
Zou, Yuan
Zhang, Xudong
Guo, Ningyuan
Zhang, Bin
Du, Guodong
ENERGY, 2022, 258
[37] Multi-objective optimization approach for permanent magnet machine viaimproved soft actor-critic based on deep reinforcement learning
Wang, Chen
Dong, Tianyu
Chen, Lei
Zhu, Guixiang
Chen, Yihan
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 264
[38] Modular deep reinforcement learning from reward and punishment for robot navigation
Wang, Jiexin
Elfwing, Stefan
Uchibe, Eiji
NEURAL NETWORKS, 2021, 135 : 115 - 126
[39] A soft actor-critic reinforcement learning framework for optimal energy management in electric vehicles with hybrid storage
Mazzi, Yahia
Ben Sassi, Hicham
Errahimi, Fatima
Es-Sbai, Najia
JOURNAL OF ENERGY STORAGE, 2024, 99
[40] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
Li, Weijie
Yue, Ming
Shangguan, Jinyong
Jin, Ye
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2023, 21 (02) : 563 - 574

← 1 2 3 4 5 →