Online regulation control of pulsed power loads via supercapacitor with deep reinforcement learning utilizing a long short-term memory network and attention mechanism

被引:1
作者
Shang, Chengya [1 ]
Fu, Lijun [1 ]
Xiao, Haipeng [1 ]
Lin, Yunfeng [1 ]
机构
[1] Naval Univ Engn, Natl Key Lab Electromagnet Energy, Wuhan 430033, Peoples R China
关键词
Attention mechanism; Deep reinforcement learning; Pulse power load; Ship power system; Supercapacitor regulation control; ENERGY-STORAGE; COORDINATION;
D O I
10.1016/j.est.2024.114080
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The integration of pulse power loads (PPLs) presents substantial challenges to the stable operation of DC ship power systems (SPSs). However, current model-based control strategies, both offline and online, are susceptible to the impact of model inaccuracies or parameter uncertainties. This article proposes a novel deep reinforcement learning (DRL) method to address PPL online regulation problem in real-time for SPS. While considering the charging current's ramp-up limitation for supercapacitor-based energy storage systems (ESSs), the PPL online regulation model is formulated with the goal of the fast charging of supercapacitor, rapid regulation of bus voltage, and proportional distribution of generator load current. Then, a twin delayed deep deterministic policy gradient (TD3) combined with a bi-directional long short-term memory (Bi-LSTM) network and an attention mechanism (AM), referred to as Bi-LSTM-AM-TD3 algorithm, is applied to optimize the generator output voltage and ESS charging current. The proposed method can improve the feature extraction ability of agents from state data and enhance their control performance. A case study is analyzed based on historical operational dataset of a DC SPS. The numerical results indicate that the proposed method improves the reward by 8.43 %, 9.72 %, and 20.16 % compared to TD3, DDPG, and PI-based methods, respectively. Additionally, it shows a 5.75 % improvement in the reward and a 23.19 % reduction in convergence time compared to the agent without AM. The effectiveness of the proposed method under continuous PPL scenarios and migration scenarios is also validated. Finally, we test the algorithm's performance on a laboratory-scale platform.
引用
收藏
页数:19
相关论文
共 41 条
  • [1] Short-Term Photovoltaic Power Forecasting Based on Long Short Term Memory Neural Network and Attention Mechanism
    Zhou, Hangxia
    Zhang, Yujin
    Yang, Lingfan
    Liu, Qian
    Yan, Ke
    Du, Yang
    IEEE ACCESS, 2019, 7 : 78063 - 78074
  • [2] Attention meets long short-term memory: A deep learning network for traffic flow forecasting
    Fang, Weiwei
    Zhuo, Wenhao
    Yan, Jingwen
    Song, Youyi
    Jiang, Dazhi
    Zhou, Teng
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 587
  • [3] Short-term wind power forecasting based on Attention Mechanism and Deep Learning
    Xiong, Bangru
    Lou, Lu
    Meng, Xinyu
    Wang, Xin
    Ma, Hui
    Wang, Zhengxia
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 206
  • [4] Energy Procurement and Retail Pricing for Electricity Retailers via Deep Reinforcement Learning with Long Short-term Memory
    Xu, Hongsheng
    Wen, Jinyu
    Hu, Qinran
    Shu, Jiao
    Lu, Jixiang
    Yang, Zhihong
    CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (05): : 1338 - 1351
  • [5] Sentiment classification using attention mechanism and bidirectional long short-term memory network
    Wu, Peng
    Li, Xiaotong
    Ling, Chen
    Ding, Shengchun
    Shen, Si
    APPLIED SOFT COMPUTING, 2021, 112
  • [6] Shear Wave Velocity Prediction Based on the Long Short-Term Memory Network with Attention Mechanism
    Fu, Xingan
    Wei, Youhua
    Su, Yun
    Hu, Haixia
    APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [7] Enhancing Anomaly Detection for Cultural Heritage via Long Short-Term Memory with Attention Mechanism
    Wu, Yuhan
    Dong, Yabo
    Shan, Zeyang
    Meng, Xiyu
    He, Yang
    Jia, Ping
    Lu, Dongming
    ELECTRONICS, 2024, 13 (07)
  • [8] Knowledge Tracing with Contrastive Learning and Attention-Based Long Short-Term Memory Network
    Xu, Liancheng
    Guo, Lihua
    Wu, Xiaoqi
    Wang, Xinhua
    Guo, Lei
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 25 - 36
  • [9] Short-term wind power forecasting with the integration of a deep error feedback learning and attention mechanism
    Hu Y.
    Zhu L.
    Li J.
    Li Y.
    Zeng Y.
    Zheng L.
    Shuai Z.
    Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (04): : 100 - 108
  • [10] Short-term wind power prediction framework using numerical weather predictions and residual convolutional long short-term memory attention network
    Xie, Chenlei
    Yang, Xuelei
    Chen, Tao
    Fang, Qiansheng
    Wang, Jie
    Shen, Yan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133