SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引:0
作者
Gao, Xiaonan [1 ]
Wu, Ziyi [1 ]
Zhu, Xianchao [1 ]
Cai, Lei [2 ]
机构
[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China
来源
JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷
基金
中国国家自然科学基金;
关键词
Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;
D O I
10.23952/jnfa.2025.6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Optimal Scheduling of Regional Integrated Energy System Based on Advantage Learning Soft Actor-critic Algorithm and Transfer Learning
    Luo W.
    Zhang J.
    He Y.
    Gu T.
    Nie X.
    Fan L.
    Yuan X.
    Li B.
    Dianwang Jishu/Power System Technology, 2023, 47 (04): : 1601 - 1611
  • [42] Integrated Actor-Critic for Deep Reinforcement Learning
    Zheng, Jiaohao
    Kurt, Mehmet Necip
    Wang, Xiaodong
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
  • [43] Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors
    Duan, Jingliang
    Guan, Yang
    Li, Shengbo Eben
    Ren, Yangang
    Sun, Qi
    Cheng, Bo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (11) : 6584 - 6598
  • [44] Hybrid Actor-Critic Reinforcement Learning in Parameterized Action Space
    Fan, Zhou
    Su, Rui
    Zhang, Weinan
    Yu, Yong
    PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2279 - 2285
  • [45] Memory-based soft actor-critic with prioritized experience replay for autonomous navigation
    Wei, Zhigang
    Xiao, Wendong
    Yuan, Liang
    Ran, Teng
    Cui, Jianping
    Lv, Kai
    INTELLIGENT SERVICE ROBOTICS, 2024, 17 (03) : 621 - 630
  • [46] Application of Soft Actor-Critic algorithms in optimizing wastewater treatment with time delays integration
    Mohammadi, Esmaeel
    Ortiz-Arroyo, Daniel
    Hansen, Aviaja Anna
    Stokholm-Bjerregaard, Mikkel
    Gros, Sebastien
    Anand, Akhil S.
    Durdevic, Petar
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [47] A Proposed Priority Pushing and Grasping Strategy Based on an Improved Actor-Critic Algorithm
    You, Tianya
    Wu, Hao
    Xu, Xiangrong
    Petrovic, Petar B.
    Rodic, Aleksandar
    ELECTRONICS, 2022, 11 (13)
  • [48] Cooperative Resource Allocation Based on Soft Actor-Critic With Data Augmentation in Cellular Network
    Qin, Yunhui
    Zhang, Zhongshan
    Wei, Huangfu
    Zhang, Haijun
    Long, Keping
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2023, 12 (03) : 396 - 400
  • [49] Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework
    Montazeralghaem, Ali
    Allan, James
    Thomas, Philip S.
    15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 220 - 229
  • [50] Intelligent control system for droplet volume in inkjet printing based on stochastic state transition soft actor-critic DRL algorithm
    Yue, Xiao
    Chen, Jiankui
    Li, Yiqun
    Li, Xin
    Zhu, Hong
    Yin, Zhouping
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 68 : 455 - 464