Online regulation control of pulsed power loads via supercapacitor with deep reinforcement learning utilizing a long short-term memory network and attention mechanism

被引：1

作者：

Shang, Chengya ^{[1
]}

Fu, Lijun ^{[1
]}

Xiao, Haipeng ^{[1
]}

Lin, Yunfeng ^{[1
]}

机构：

[1] Naval Univ Engn, Natl Key Lab Electromagnet Energy, Wuhan 430033, Peoples R China

来源：

JOURNAL OF ENERGY STORAGE | 2024年 / 102卷

关键词：

Attention mechanism; Deep reinforcement learning; Pulse power load; Ship power system; Supercapacitor regulation control; ENERGY-STORAGE; COORDINATION;

D O I：

10.1016/j.est.2024.114080

中图分类号：

TE [石油、天然气工业]; TK [能源与动力工程];

学科分类号：

0807 ; 0820 ;

摘要：

The integration of pulse power loads (PPLs) presents substantial challenges to the stable operation of DC ship power systems (SPSs). However, current model-based control strategies, both offline and online, are susceptible to the impact of model inaccuracies or parameter uncertainties. This article proposes a novel deep reinforcement learning (DRL) method to address PPL online regulation problem in real-time for SPS. While considering the charging current's ramp-up limitation for supercapacitor-based energy storage systems (ESSs), the PPL online regulation model is formulated with the goal of the fast charging of supercapacitor, rapid regulation of bus voltage, and proportional distribution of generator load current. Then, a twin delayed deep deterministic policy gradient (TD3) combined with a bi-directional long short-term memory (Bi-LSTM) network and an attention mechanism (AM), referred to as Bi-LSTM-AM-TD3 algorithm, is applied to optimize the generator output voltage and ESS charging current. The proposed method can improve the feature extraction ability of agents from state data and enhance their control performance. A case study is analyzed based on historical operational dataset of a DC SPS. The numerical results indicate that the proposed method improves the reward by 8.43 %, 9.72 %, and 20.16 % compared to TD3, DDPG, and PI-based methods, respectively. Additionally, it shows a 5.75 % improvement in the reward and a 23.19 % reduction in convergence time compared to the agent without AM. The effectiveness of the proposed method under continuous PPL scenarios and migration scenarios is also validated. Finally, we test the algorithm's performance on a laboratory-scale platform.

引用

页数：19

共 41 条

[1] Short-Term Photovoltaic Power Forecasting Based on Long Short Term Memory Neural Network and Attention Mechanism
Zhou, Hangxia
Zhang, Yujin
Yang, Lingfan
Liu, Qian
Yan, Ke
Du, Yang
IEEE ACCESS, 2019, 7 : 78063 - 78074
[2] Attention meets long short-term memory: A deep learning network for traffic flow forecasting
Fang, Weiwei
Zhuo, Wenhao
Yan, Jingwen
Song, Youyi
Jiang, Dazhi
Zhou, Teng
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2022, 587
[3] Short-term wind power forecasting based on Attention Mechanism and Deep Learning
Xiong, Bangru
Lou, Lu
Meng, Xinyu
Wang, Xin
Ma, Hui
Wang, Zhengxia
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 206
[4] Energy Procurement and Retail Pricing for Electricity Retailers via Deep Reinforcement Learning with Long Short-term Memory
Xu, Hongsheng
Wen, Jinyu
Hu, Qinran
Shu, Jiao
Lu, Jixiang
Yang, Zhihong
CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (05): : 1338 - 1351
[5] Sentiment classification using attention mechanism and bidirectional long short-term memory network
Wu, Peng
Li, Xiaotong
Ling, Chen
Ding, Shengchun
Shen, Si
APPLIED SOFT COMPUTING, 2021, 112
[6] Shear Wave Velocity Prediction Based on the Long Short-Term Memory Network with Attention Mechanism
Fu, Xingan
Wei, Youhua
Su, Yun
Hu, Haixia
APPLIED SCIENCES-BASEL, 2024, 14 (06):
[7] Enhancing Anomaly Detection for Cultural Heritage via Long Short-Term Memory with Attention Mechanism
Wu, Yuhan
Dong, Yabo
Shan, Zeyang
Meng, Xiyu
He, Yang
Jia, Ping
Lu, Dongming
ELECTRONICS, 2024, 13 (07)
[8] Knowledge Tracing with Contrastive Learning and Attention-Based Long Short-Term Memory Network
Xu, Liancheng
Guo, Lihua
Wu, Xiaoqi
Wang, Xinhua
Guo, Lei
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024, 2024, 14865 : 25 - 36
[9] Short-term wind power forecasting with the integration of a deep error feedback learning and attention mechanism
Hu Y.
Zhu L.
Li J.
Li Y.
Zeng Y.
Zheng L.
Shuai Z.
Dianli Xitong Baohu yu Kongzhi/Power System Protection and Control, 2024, 52 (04): : 100 - 108
[10] Short-term wind power prediction framework using numerical weather predictions and residual convolutional long short-term memory attention network
Xie, Chenlei
Yang, Xuelei
Chen, Tao
Fang, Qiansheng
Wang, Jie
Shen, Yan
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133

← 1 2 3 4 5 →