SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引：0

作者：

Gao, Xiaonan ^{[1
]}

Wu, Ziyi ^{[1
]}

Zhu, Xianchao ^{[1
]}

Cai, Lei ^{[2
]}

机构：

[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China

[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China

来源：

JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷

基金：

中国国家自然科学基金;

关键词：

Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;

D O I：

10.23952/jnfa.2025.6

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.

引用

页数：10

共 50 条

[41] Optimal Scheduling of Regional Integrated Energy System Based on Advantage Learning Soft Actor-critic Algorithm and Transfer Learning
Luo W.
Zhang J.
He Y.
Gu T.
Nie X.
Fan L.
Yuan X.
Li B.
Dianwang Jishu/Power System Technology, 2023, 47 (04): : 1601 - 1611
[42] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[43] Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
Duan, Jingliang
Guan, Yang
Li, Shengbo Eben
Ren, Yangang
Sun, Qi
Cheng, Bo
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6584 - 6598
[44] Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
Fan, Zhou
Su, Rui
Zhang, Weinan
Yu, Yong
PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2279 - 2285
[45] Memory-based soft actor-critic with prioritized experience replay for autonomous navigation
Wei, Zhigang
Xiao, Wendong
Yuan, Liang
Ran, Teng
Cui, Jianping
Lv, Kai
INTELLIGENT SERVICE ROBOTICS, 2024, 17 (03) : 621 - 630
[46] Application of Soft Actor-Critic algorithms in optimizing wastewater treatment with time delays integration
Mohammadi, Esmaeel
Ortiz-Arroyo, Daniel
Hansen, Aviaja Anna
Stokholm-Bjerregaard, Mikkel
Gros, Sebastien
Anand, Akhil S.
Durdevic, Petar
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
[47] A Proposed Priority Pushing and Grasping Strategy Based on an Improved Actor-Critic Algorithm
You, Tianya
Wu, Hao
Xu, Xiangrong
Petrovic, Petar B.
Rodic, Aleksandar
ELECTRONICS, 2022, 11 (13)
[48] Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network
Qin, Yunhui
Zhang, Zhongshan
Wei, Huangfu
Zhang, Haijun
Long, Keping
IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (03) : 396 - 400
[49] Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework
Montazeralghaem, Ali
Allan, James
Thomas, Philip S.
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 220 - 229
[50] Intelligent control system for droplet volume in inkjet printing based on stochastic state transition soft actor-critic DRL algorithm
Yue, Xiao
Chen, Jiankui
Li, Yiqun
Li, Xin
Zhu, Hong
Yin, Zhouping
JOURNAL OF MANUFACTURING SYSTEMS, 2023, 68 : 455 - 464

← 1 2 3 4 5 →