Deep reinforcement learning approach for MPPT control of partially shaded PV systems in Smart Grids

被引:55
作者
Avila, Luis [1 ]
De Paula, Mariano [2 ]
Trimboli, Maximiliano [3 ]
Carlucho, Ignacio [4 ]
机构
[1] CONICET UNSL, Lab Invest & Desarrollo Inteligencia Computac LID, Av Ejercito Andes 950,D5700HHW, San Luis, Argentina
[2] Consejo Nacl Invest Cient & Tecn, INTELYMEC, Ctr Invest Fis & Ingn, Ctr CIFICEN,UNICEN,CICpBA, Av Valle 5737, RA-7400 Olavarria, Argentina
[3] CONICET UNSL, Lab Control Automat LCA, Ruta Prov 55,D5730, San Luis, Argentina
[4] Louisiana State Univ, Dept Mech Engn, Baton Rouge, LA 70803 USA
关键词
MPPT; Deep RL; PV systems; OpenAI Gym; POWER POINT TRACKING; PARTICLE SWARM OPTIMIZATION; LEVEL CONTROL; FUZZY-LOGIC; PERTURB; ALGORITHM;
D O I
10.1016/j.asoc.2020.106711
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Photovoltaic systems (PV) are having an increased importance in modern smart grids systems. Usually, in order to maximize the energy output of the PV arrays a maximum power point tracking (MPPT) algorithm is used. However, once deployed, weather conditions such as clouds can cause shades in the PV arrays affecting the dynamics of each panel differently. These conditions directly affect the available energy output of the arrays and in turn make the MPPT task extremely difficult. For these reasons, under partial shading conditions, it is necessary to have algorithms that are able to learn and adapt online to the changing state of the system. In this work we propose the use of deep reinforcement learning (DRL) techniques to address the MPPT problem of a PV array under partial shading conditions. We develop a model free RL algorithm to maximize the efficiency in MPPT control. The agent's policy is parameterized by neural networks, which take the sensory information as input and directly output the control signal. Furthermore, a PV environment under shading conditions was developed in the open source OpenAI Gym platform and is made available in an open repository. Several tests are performed, using the developed simulated environment, to test the robustness of the proposed control strategies to different climate conditions. The obtained results show the feasibility of our proposal with a successful performance with fast responses and stable behaviors. The best results for the presented methodology show that the maximum operating power point achieved has a deviation less than 1% compared to the theoretical maximum power point. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
Abo-Al-Ez KM, 2020, GREEN ENERGY TECHNOL, P199, DOI 10.1007/978-3-030-05578-3_7
[2]   An improved perturb and observe (P&O) maximum power point tracking (MPPT) algorithm for higher efficiency [J].
Ahmed, Jubaer ;
Salam, Zainal .
APPLIED ENERGY, 2015, 150 :97-108
[3]  
[Anonymous], 2016, PROC INT C LEARNING
[4]   Particle Swarm Optimization Based Solar PV Array Reconfiguration of the Maximum Power Extraction Under Partial Shading Conditions [J].
Babu, Thanikanti Sudhakar ;
Ram, J. Prasanth ;
Dragicevic, Tomislav ;
Miyatake, Masafumi ;
Blaabjerg, Frede ;
Rajasekar, Natarajan .
IEEE TRANSACTIONS ON SUSTAINABLE ENERGY, 2018, 9 (01) :74-85
[5]   A new MPPT-based ANN for photovoltaic system under partial shading conditions [J].
Bouselham, Loubna ;
Hajji, Mohammed ;
Hajji, Bekkay ;
Bouali, Hicham .
8TH INTERNATIONAL CONFERENCE ON SUSTAINABILITY IN ENERGY AND BUILDINGS, SEB-16, 2017, 111 :934-943
[6]  
Brockman Greg, 2016, arXiv
[7]   Adaptive low-level control of autonomous underwater vehicles using deep reinforcement learning [J].
Carlucho, Ignacio ;
De Paula, Mariano ;
Wang, Sen ;
Petillot, Yvan ;
Acosta, Gerardo G. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 107 :71-86
[8]   Maximum Power Point Tracking of Photovoltaic System Based on Reinforcement Learning [J].
Chou, Kuan-Yu ;
Yang, Shu-Ting ;
Chen, Yon-Ping .
SENSORS, 2019, 19 (22)
[9]   Comparative and comprehensive review of maximum power point tracking methods for PV cells [J].
Danandeh, M. A. ;
Mousavi G., S. M. .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2018, 82 :2743-2767
[10]  
Degris T, 2012, P AMER CONTR CONF, P2177