FUSION SPARSE AND SHAPING REWARD FUNCTION IN SOFT ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR MOBILE ROBOT NAVIGATION

被引：0

作者：

Abu Bakar, Mohamad Hafiz ^{[1
]}

Shamsudin, Abu Ubaidah ^{[1
]}

Soomro, Zubair Adil ^{[1
]}

Tadokoro, Satoshi ^{[2
]}

Salaan, C. J. ^{[3
]}

机构：

[1] Univ Tun Hussein Onn Malaysia, Fac Elect & Elect Engn, Batu Pahat 86400, Johor, Malaysia

[2] Tohoku Univ, 2 Chome 1-1 Katahira,Aoba Ward, Sendai, Miyagi 9808577, Japan

[3] MSU Iligan Inst Technol, Dept Elect Engn & Technol, Andres Bonifacio Ave, Lanao Del Norte 9200, Philippines

来源：

JOURNAL OF THE KOREAN SOCIETY OF RADIOLOGY | 2024年 / 86卷 / 02期

关键词：

Soft Actor Critic Deep Reinforcement Learning (SAC DRL); Deep Reinforcement Learning; Mobile robot navigation; Reward function; Sparse reward; Shaping reward;

D O I：

暂无

中图分类号：

R8 [特种医学]; R445 [影像诊断学];

学科分类号：

1002 ; 100207 ; 1009 ;

摘要：

Nowadays, the advancement in autonomous robots is the latest influenced by the development of a world surrounded by new technologies. Deep Reinforcement Learning (DRL) allows systems to operate automatically, so the robot will learn the next movement based on the interaction with the environment. Moreover, since robots require continuous action, Soft Actor Critic Deep Reinforcement Learning (SAC DRL) is considered the latest DRL approach solution. SAC is used because its ability to control continuous action to produce more accurate movements. SAC fundamental is robust against unpredictability, but some weaknesses have been identified, particularly in the exploration process for accuracy learning with faster maturity. To address this issue, the study identified a solution using a reward function appropriate for the system to guide in the learning process. This research proposes several types of reward functions based on sparse and shaping reward in SAC method to investigate the effectiveness of mobile robot learning. Finally, the experiment shows that using fusion sparse and shaping rewards in the SAC DRL successfully navigates to the target position and can also increase accuracy based on the average error result of 4.99%.

引用

页码：37 / 49

页数：13

共 50 条

[41] Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer
Weijie Li
Ming Yue
Jinyong Shangguan
Ye Jin
International Journal of Control, Automation and Systems, 2023, 21 : 563 - 574
[42] BAYESIAN OPTIMIZATION OF HYPER-PARAMETERS AND REWARD FUNCTION IN DEEP REINFORCEMENT LEARNING: APPLICATION TO BEHAVIOR LEARNING OF MOBILE ROBOT
Nishimura, Takuto
Sota, Ryosuke
Horiuchi, Tadashi
International Journal of Innovative Computing, Information and Control, 2025, 21 (02): : 469 - 480
[43] Continuous Control with Deep Reinforcement Learning for Mobile Robot Navigation
Xiang, Jiaqi
Li, Qingdong
Dong, Xiwang
Ren, Zhang
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1501 - 1506
[44] A Brief Survey: Deep Reinforcement Learning in Mobile Robot Navigation
Jiang, Haoge
Wang, Han
Yau, Wei-Yun
Wan, Kong-Wah
PROCEEDINGS OF THE 15TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2020), 2020, : 592 - 597
[45] An actor-critic framework based on deep reinforcement learning for addressing flexible job shop scheduling problems
Zhao, Cong
Deng, Na
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2024, 21 (01) : 1445 - 1471
[46] An effective deep actor-critic reinforcement learning method for solving the flexible job shop scheduling problem
Wan L.
Cui X.
Zhao H.
Li C.
Wang Z.
Neural Computing and Applications, 2024, 36 (20) : 11877 - 11899
[47] CBNAV: Costmap Based Approach to Deep Reinforcement Learning Mobile Robot Navigation
Tomasi Junior, Darci Luiz
Todt, Eduardo
2021 LATIN AMERICAN ROBOTICS SYMPOSIUM / 2021 BRAZILIAN SYMPOSIUM ON ROBOTICS / 2021 WORKSHOP OF ROBOTICS IN EDUCATION (LARS-SBR-WRE 2021), 2021, : 324 - 329
[48] Sensor-based Mobile Robot Navigation via Deep Reinforcement Learning
Han, Seungho-Ho
Choi, Ho-Jin
Benz, Philipp
Loaiciga, Jorge
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2018, : 147 - 154
[49] A Novel Augmentative Backward Reward Function with Deep Reinforcement Learning for Autonomous UAV Navigation
Chansuparp, Manit
Jitkajornwanich, Kulsawasd
APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
[50] Experimental Research on Deep Reinforcement Learning in Autonomous navigation of Mobile Robot
Yue, Pengyu
Xin, Jing
Zhao, Huan
Liu, Ding
Shan, Mao
Zhang, Jian
PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 1612 - 1616

← 1 2 3 4 5 →