A Novel Nonlinear Deep Reinforcement Learning Controller for DC-DC Power Buck Converters

被引：88

作者：

Gheisarnejad, Meysam ^{[1
]}

Farsizadeh, Hamed ^{[2
]}

Khooban, Mohammad Hassan ^{[3
]}

机构：

[1] Islamic Azad Univ, Najafabad Branch, Dept Elect Engn, Esfahan, Iran

[2] Shiraz Univ Technol, Shiraz 25529, Iran

[3] Aarhus Univ, Dept Engn, DIGIT, Finlandsgade 22, DK-8200 Aarhus, Denmark

来源：

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS | 2021年 / 68卷 / 08期

关键词：

Buck converters; Observers; Fuel cells; Mathematical model; Voltage control; Capacitors; Reinforcement learning; Buck converter; constant power load (CPL); deep deterministic policy gradient (DDPG); sliding mode (SM) observer; ultralocal model (ULM); SLIDING-MODE CONTROL; CONSTANT; SYSTEMS; STABILITY; LOADS;

D O I：

10.1109/TIE.2020.3005071

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The nonlinearities and unmodeled dynamics inevitably degrade the quality and reliability of power conversion, and as a result, pose big challenges on higher-performance voltage stabilization of dc-dc buck converters. The stability of such power electronic equipment is further threatened when feeding the nonideal constant power loads (CPLs) because of the induced negative impedance specifications. In response to these challenges, the advanced regulatory and technological mechanisms associated with the converters require to be developed to efficiently implement these interface systems in the microgrid configuration. This article addresses an intelligent proportional-integral based on sliding mode (SM) observer to mitigate the destructive impedance instabilities of nonideal CPLs with time-varying nature in the ultralocal model sense. In particular, in the current article, an auxiliary deep deterministic policy gradient (DDPG) controller is adaptively developed to decrease the observer estimation error and further ameliorate the dynamic characteristics of dc-dc buck converters. The design of the DDPG is realized in two parts: (i) an actor-network which generates the policy commands, while (ii) a critic-network evaluates the quality of the policy command generated by the actor. The suggested strategy establishes the DDPG-based control to handle for what the iPI-based SM observer is unable to compensate. In this application, the weight coefficients of the actor and critic networks are trained based on the reward feedback of the voltage error, by using the gradient descent scheme. Finally, to investigate the merits and implementation feasibility of the suggested method, some experimental results on a laboratory prototype of the dc-dc buck converter, which feeds a time-varying CPL, are presented.

引用

页码：6849 / 6858

页数：10

共 37 条

[1] On the control of robot manipulator: A model-free approach [J].

Abouaissa, Hassane ;

Chouraqui, Samira .

JOURNAL OF COMPUTATIONAL SCIENCE, 2019, 31 :6-16

[2] Robust synchronization of master-slave chaotic systems using approximate model: An experimental study [J].

Ahmed, Hafiz ;

Salgado, Ivan ;

Rios, Hector .

ISA TRANSACTIONS, 2018, 73 :141-146

[3] Robust Model-Free Control Applied to a Quadrotor UAV [J].

Al Younes, Younes ;

Drak, Ahmad ;

Noura, Hassan ;

Rabhi, Abdelhamid ;

El Hajjaji, Ahmed .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 84 (1-4) :37-52

[4] On Existence and Stability of Equilibria of Linear Time-Invariant Systems With Constant Power Loads [J].

Barabanov, Nikita ;

Ortega, Romeo ;

Grino, Robert ;

Polyak, Boris .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2016, 63 (01) :114-121

[5] Constant power loads and negative impedance instability in automotive systems: Definition, modeling, stability, and control of power electronic converters and motor drives [J].

Emadi, Ali ;

Khaligh, Alireza ;

Rivetta, Claudio H. ;

Williamson, Geoffrey A. .

IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2006, 55 (04) :1112-1125

[6] An Intelligent and Fast Controller for DC/DC Converter Feeding CPL in a DC Microgrid [J].

Farsizadeh, Hamed ;

Gheisarnejad, Meysam ;

Mosayebi, Mahdi ;

Rafiei, Mehdi ;

Khooban, Mohammad Hassan .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2020, 67 (06) :1104-1108

[7]

Gheisarnejad M., 2020, IEEE Trans. Ind. Electron.

[8] Intelligent PD controller design for active suspension system based on robust model-free control strategy [J].

Haddar, Maroua ;

Chaari, Riadh ;

Baslamisli, S. Caglar ;

Chaari, Fakher ;

Haddar, Mohamed .

PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2019, 233 (14) :4863-4880

[9] Model-free based adaptive nonsingular fast terminal sliding mode control with time-delay estimation for a 12 DOF multi-functional lower limb exoskeleton [J].

Han, Shuaishuai ;

Wang, Haoping ;

Tian, Yang .

ADVANCES IN ENGINEERING SOFTWARE, 2018, 119 :38-47

[10] A novel optimized hybrid fuzzy logic intelligent PID controller for an interconnected multi-area power system with physical constraints and boiler dynamics [J].

Haroun, A. H. Gomaa ;

Li, Yin-ya .

ISA TRANSACTIONS, 2017, 71 :364-379

← 1 2 3 4 →