Harvesting Optimal Operation Strategies from Historical Data for Solar Thermal Power Plants Using Reinforcement Learning

被引:0
作者
Zeng, Zhichen [1 ]
Ni, Dong [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, 38 Zheda Rd, Hangzhou, Peoples R China
来源
SOLARPACES 2020 - 26TH INTERNATIONAL CONFERENCE ON CONCENTRATING SOLAR POWER AND CHEMICAL ENERGY SYSTEMS | 2022年 / 2445卷
关键词
OPTIMIZATION;
D O I
10.1063/5.0085691
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Operation strategy optimization for concentrating solar power (CSP) plants has been a long-studied topic in solar energy. In our work, an effective and systematic approach has been developed to harvest optimal operation strategies for CSP plants via reinforcement learning (RL). Our goal is to find the optimal strategy, which instructs the operation of different key operating variables of CSP plants in order to maximize daily power generation under a given solar irradiation series. Key variables are extracted through data mining techniques, and Deep Q-Network (DQN) is applied to find the optimal solution to the problem. A case study based on a 10MW central tower receiver solar thermal plant's operating data is carried out, which shows great improvement in power generation through our proposed method.
引用
收藏
页数:9
相关论文
共 17 条
  • [1] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [2] Bertsekas D. P., 1995, DYNAMIC PROGRAMMING, V1
  • [3] A survey on control schemes for distributed solar collector fields. Part 1: Modeling and basic control approaches
    Camacho, E. F.
    Rubio, F. R.
    Berenguel, M.
    Valenzuela, L.
    [J]. SOLAR ENERGY, 2007, 81 (10) : 1240 - 1251
  • [4] Techno-Economic Analysis of a Central Receiver Power Plant with Different Levels of Storage Capacity and Hybridization
    Costa, Daniel L. C.
    Neto, Jose H. M.
    [J]. INTERNATIONAL CONFERENCE ON CONCENTRATING SOLAR POWER AND CHEMICAL ENERGY SYSTEMS (SOLARPACES 2017), 2018, 2033
  • [5] MARKOV APPROACH FOR PREDICTIVE MODELING AND PERFORMANCE EVALUATION OF A THERMAL POWER PLANT
    Gupta, Sorabh
    Tewari, P. C.
    [J]. INTERNATIONAL JOURNAL OF RELIABILITY QUALITY & SAFETY ENGINEERING, 2010, 17 (01) : 41 - 55
  • [6] Reinforcement learning: A survey
    Kaelbling, LP
    Littman, ML
    Moore, AW
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 : 237 - 285
  • [7] A study of a kanban based assembly line feeding system through integration of simulation and particle swarm optimization
    Kundu, Kaustav
    Rossini, Matteo
    Portioli-Staudacher, Alberto
    [J]. INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING COMPUTATIONS, 2019, 10 (03) : 421 - 442
  • [8] Two-dimensional dynamic PCA for batch process monitoring
    Lu, NY
    Yao, Y
    Gao, FR
    Wang, FL
    [J]. AICHE JOURNAL, 2005, 51 (12) : 3300 - 3304
  • [9] Novel optimization design strategy for solar power tower plants
    Luo, Yan
    Lu, Tao
    Du, Xiaoze
    [J]. ENERGY CONVERSION AND MANAGEMENT, 2018, 177 : 682 - 692
  • [10] Human-level control through deep reinforcement learning
    Mnih, Volodymyr
    Kavukcuoglu, Koray
    Silver, David
    Rusu, Andrei A.
    Veness, Joel
    Bellemare, Marc G.
    Graves, Alex
    Riedmiller, Martin
    Fidjeland, Andreas K.
    Ostrovski, Georg
    Petersen, Stig
    Beattie, Charles
    Sadik, Amir
    Antonoglou, Ioannis
    King, Helen
    Kumaran, Dharshan
    Wierstra, Daan
    Legg, Shane
    Hassabis, Demis
    [J]. NATURE, 2015, 518 (7540) : 529 - 533