SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引：0

作者：

Gao, Xiaonan ^{[1
]}

Wu, Ziyi ^{[1
]}

Zhu, Xianchao ^{[1
]}

Cai, Lei ^{[2
]}

机构：

[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China

[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China

来源：

JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷

基金：

中国国家自然科学基金;

关键词：

Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;

D O I：

10.23952/jnfa.2025.6

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.

引用

页数：10

共 50 条

[21] Soft Actor-Critic Algorithm Featured Residential Demand Response Strategic Bidding for Load Aggregators
Zhang, Zhenyuan
Chen, Zihan
Lee, Wei-Jen
IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (04) : 4298 - 4308
[22] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647
[23] Residential Demand Response Considered Strategic Bidding for Load Aggregators with Soft Actor-Critic Algorithm
Zhang, Zhenyuan
Chen, Zihan
Lee, Wei-Jen
2021 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING (IAS), 2021,
[24] Evaluate, explain, and explore the state more exactly: an improved Actor-Critic algorithm for complex environment
Zha, ZhongYi
Wang, Bo
Tang, XueSong
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17) : 12271 - 12282
[25] Soft Actor-Critic with Inhibitory Networks for Retraining UAV Controllers Faster
Choi, Minkyu
Filter, Max
Alcedo, Kevin
Walker, Thayne T.
Rosenbluth, David
Ide, Jaime S.
2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 1561 - 1570
[26] An Experience-Guided Deep Deterministic Actor-Critic Algorithm with Multi-Actor
Chen H.
Liu Q.
Yan Y.
He B.
Jiang Y.
Zhang L.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (08): : 1708 - 1720
[27] Soft Actor-Critic Algorithm-Based Energy Management Strategy for Plug-In Hybrid Electric Vehicle
Li, Tao
Cui, Wei
Cui, Naxin
WORLD ELECTRIC VEHICLE JOURNAL, 2022, 13 (10):
[28] Incorporating Actor-Critic in Monte Carlo tree search for symbolic regression
Lu, Qiang
Tao, Fan
Zhou, Shuo
Wang, Zhiguang
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14) : 8495 - 8511
[29] Off-Policy Actor-critic for Recommender Systems
Chen, Minmin
Xu, Can
Gatto, Vince
Jain, Devanshu
Kumar, Aviral
Chi, Ed
PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 338 - 349
[30] CONTROLLED SENSING AND ANOMALY DETECTION VIA SOFT ACTOR-CRITIC REINFORCEMENT LEARNING
Zhong, Chen
Gursoy, M. Cenk
Velipasalar, Senem
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4198 - 4202

← 1 2 3 4 5 →