Optimal model-free adaptive control based on reinforcement Q-Learning for solar thermal collector fields

被引:7
作者
Pataro, Igor M. L. [1 ]
Cunha, Rita [2 ]
Gil, Juan D. [1 ]
Guzman, Jose L. [1 ]
Berenguel, Manuel [1 ]
Lemos, Joao M. [3 ]
机构
[1] Univ Almeria, Ctr Mixto CIESOL, ceiA3,Ctra Sacramento S-N, Almeria 04120, Spain
[2] Univ Lisbon, Inst Syst & Robot ISR Lisboa, LARSyS, Inst Super Tecn, Lisbon, Lisboa, Portugal
[3] Univ Lisbon, INESC ID, Inst Super Tecn, Lisbon, Portugal
关键词
Reinforcement learning; Q-Learning; Optimal control; Solar plant; ENERGY; VALIDATION; OPERATION; SYSTEMS;
D O I
10.1016/j.engappai.2023.106785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study addresses the challenge and related difficulties of controlling solar collector fields (SCFs) using high-complex models by proposing an adaptive optimal model-free controller based on the Reinforcement Q-Learning algorithm. The controller aims to achieve optimal performance using only plant measurements, particularly the working fluid temperature, flow rate, and reference trajectory. The proposed solution offers advantages over model-based controllers, as it can handle nonlinearities, time-varying model parameters, and computational costs associated with nonlinear models. Additionally, the Q-Learning algorithm requires a small amount of data, overcoming data-driven solutions presented in the literature till now for SCF systems. To ensure the controller performance, simulations are conducted using a validated SCF model based on actual data from the CIESOL thermal plant located in Almeria, Spain. The results demonstrate the effectiveness and usefulness of the model-free controller as the Q-Learning algorithm converges to the optimal gains of the Linear Quadratic Tracking (LQT) controller, exhibiting adaptive optimal response. Furthermore, the Q-Learning strategy outperforms the LQT controller in managing steady-state errors, particularly on sunny days with smooth solar irradiance variations. The average discrepancy in gains computed with reinforcement learning in a one-day campaign trial considering 18000 data is less than 15% compared to optimal LQT gains. This model-free approach proves its value and can easily be extended to different SCF systems without altering the general control formulation.
引用
收藏
页数:16
相关论文
共 64 条
[1]   Climate action: Prospects of solar energy in Africa [J].
Abdelrazik, Maryam K. ;
Abdelaziz, Sara E. ;
Hassan, Mariam F. ;
Hatem, Tarek M. .
ENERGY REPORTS, 2022, 8 :11363-11377
[2]   Application of deep Q-networks for model-free optimal control balancing between different HVAC systems [J].
Ahn, Ki Uhn ;
Park, Cheol Soo .
SCIENCE AND TECHNOLOGY FOR THE BUILT ENVIRONMENT, 2020, 26 (01) :61-74
[3]   A survey on control issues in renewable energy integration and microgrid [J].
Badal, Faisal R. ;
Das, Purnima ;
Sarker, Subrata K. ;
Das, Sajal K. .
PROTECTION AND CONTROL OF MODERN POWER SYSTEMS, 2019, 4 (01)
[4]  
Bara O., 2017, 2017 N AM POWER S, P1, DOI [10.1109/NAPS.2017.8107328, DOI 10.1109/NAPS.2017.8107328]
[5]   Development and validation of a detailed TRNSYS-Matlab model for large solar collector fields for district heating applications [J].
Bava, Federico ;
Furbo, Simon .
ENERGY, 2017, 135 :698-708
[6]   Optimal control analysis and Practical NMPC applied to refrigeration systems [J].
Bejarano, G. ;
Ortega, M. G. ;
Normey-Rico, J. E. ;
Rubio, F. R. .
ISA TRANSACTIONS, 2020, 107 :90-106
[7]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[8]   Optimal model-free control of solar thermal membrane distillation system [J].
Bendevis, Paul ;
Karam, Ayman ;
Laleg-Kirati, Taous-Meriem .
COMPUTERS & CHEMICAL ENGINEERING, 2020, 133
[9]   Analysis and Adaptation of Q-Learning Algorithm to Expert Controls of a Solar Domestic Hot Water System [J].
Bettoni, Davide ;
Soppelsa, Anton ;
Fedrizzi, Roberto ;
del Toro Matamoros, Raul Mario .
APPLIED SYSTEM INNOVATION, 2019, 2 (02) :1-25
[10]   Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control [J].
Biemann, Marco ;
Scheller, Fabian ;
Liu, Xiufeng ;
Huang, Lizhen .
APPLIED ENERGY, 2021, 298