SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引:0
作者
Gao, Xiaonan [1 ]
Wu, Ziyi [1 ]
Zhu, Xianchao [1 ]
Cai, Lei [2 ]
机构
[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China
来源
JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷
基金
中国国家自然科学基金;
关键词
Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;
D O I
10.23952/jnfa.2025.6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Soft Actor-Critic Algorithm Featured Residential Demand Response Strategic Bidding for Load Aggregators
    Zhang, Zhenyuan
    Chen, Zihan
    Lee, Wei-Jen
    IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2022, 58 (04) : 4298 - 4308
  • [22] Multi-actor mechanism for actor-critic reinforcement learning
    Li, Lin
    Li, Yuze
    Wei, Wei
    Zhang, Yujia
    Liang, Jiye
    INFORMATION SCIENCES, 2023, 647
  • [23] Residential Demand Response Considered Strategic Bidding for Load Aggregators with Soft Actor-Critic Algorithm
    Zhang, Zhenyuan
    Chen, Zihan
    Lee, Wei-Jen
    2021 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING (IAS), 2021,
  • [24] Evaluate, explain, and explore the state more exactly: an improved Actor-Critic algorithm for complex environment
    Zha, ZhongYi
    Wang, Bo
    Tang, XueSong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (17) : 12271 - 12282
  • [25] Soft Actor-Critic with Inhibitory Networks for Retraining UAV Controllers Faster
    Choi, Minkyu
    Filter, Max
    Alcedo, Kevin
    Walker, Thayne T.
    Rosenbluth, David
    Ide, Jaime S.
    2022 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2022, : 1561 - 1570
  • [26] An Experience-Guided Deep Deterministic Actor-Critic Algorithm with Multi-Actor
    Chen H.
    Liu Q.
    Yan Y.
    He B.
    Jiang Y.
    Zhang L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (08): : 1708 - 1720
  • [27] Soft Actor-Critic Algorithm-Based Energy Management Strategy for Plug-In Hybrid Electric Vehicle
    Li, Tao
    Cui, Wei
    Cui, Naxin
    WORLD ELECTRIC VEHICLE JOURNAL, 2022, 13 (10):
  • [28] Incorporating Actor-Critic in Monte Carlo tree search for symbolic regression
    Lu, Qiang
    Tao, Fan
    Zhou, Shuo
    Wang, Zhiguang
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14) : 8495 - 8511
  • [29] Off-Policy Actor-critic for Recommender Systems
    Chen, Minmin
    Xu, Can
    Gatto, Vince
    Jain, Devanshu
    Kumar, Aviral
    Chi, Ed
    PROCEEDINGS OF THE 16TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2022, 2022, : 338 - 349
  • [30] CONTROLLED SENSING AND ANOMALY DETECTION VIA SOFT ACTOR-CRITIC REINFORCEMENT LEARNING
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4198 - 4202