SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引:0
作者
Gao, Xiaonan [1 ]
Wu, Ziyi [1 ]
Zhu, Xianchao [1 ]
Cai, Lei [2 ]
机构
[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China
[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China
来源
JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷
基金
中国国家自然科学基金;
关键词
Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;
D O I
10.23952/jnfa.2025.6
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] An Adaptive Threshold for the Canny Edge With Actor-Critic Algorithm
    Choi, Keong-Hun
    Ha, Jong-Eun
    IEEE ACCESS, 2023, 11 : 67058 - 67069
  • [2] Optimal scheduling of virtual power plant based on Soft Actor-Critic algorithm
    Pan, Pengfei
    Song, Minggang
    Zou, Nan
    Qin, Junhan
    Li, Guangdi
    Ma, Hongyuan
    2024 6TH ASIA ENERGY AND ELECTRICAL ENGINEERING SYMPOSIUM, AEEES 2024, 2024, : 835 - 840
  • [3] The soft actor-critic algorithm for automatic mode-locked fiber lasers
    Li, Jin
    Chang, Kun
    Liu, Congcong
    Ning, Yu
    Ma, Yuansheng
    He, Jiangyong
    Liu, Yange
    Wang, Zhi
    OPTICAL FIBER TECHNOLOGY, 2023, 81
  • [4] A Novel Hierarchical Soft Actor-Critic Algorithm for Multi-Logistics Robots Task Allocation
    Tang, Hengliang
    Wang, Anqi
    Xue, Fei
    Yang, Jiaxin
    Cao, Yang
    IEEE ACCESS, 2021, 9 : 42568 - 42582
  • [5] A soft actor-critic reinforcement learning algorithm for network intrusion detection
    Li, Zhengfa
    Huang, Chuanhe
    Deng, Shuhua
    Qiu, Wanyu
    Gao, Xieping
    COMPUTERS & SECURITY, 2023, 135
  • [6] Soft Actor-Critic for Navigation of Mobile Robots
    de Jesus, Junior Costa
    Kich, Victor Augusto
    Kolling, Alisson Henrique
    Grando, Ricardo Bedin
    Cuadros, Marco Antonio de Souza Leite
    Gamarra, Daniel Fernando Tello
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (02)
  • [7] Soft Actor-Critic for Navigation of Mobile Robots
    Junior Costa de Jesus
    Victor Augusto Kich
    Alisson Henrique Kolling
    Ricardo Bedin Grando
    Marco Antonio de Souza Leite Cuadros
    Daniel Fernando Tello Gamarra
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [8] Energy optimization management of microgrid using improved soft actor-critic algorithm
    Yu, Zhiwen
    Zheng, Wenjie
    Zeng, Kaiwen
    Zhao, Ruifeng
    Zhang, Yanxu
    Zeng, Mengdi
    INTERNATIONAL JOURNAL OF RENEWABLE ENERGY DEVELOPMENT-IJRED, 2024, 13 (02): : 329 - 339
  • [9] Mapless Navigation for Mobile Robots Based on Improved Soft Actor-Critic Algorithm
    Yang, Binglin
    Wang, Hongwei
    Xia, Hao
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 755 - 761
  • [10] The Effect of Discounting Actor-loss in Actor-Critic Algorithm
    Yaputra, Jordi
    Suyanto, Suyanto
    2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,