An Intelligent Non-Integer PID Controller-Based Deep Reinforcement Learning: Implementation and Experimental Results

被引：71

作者：

Gheisarnejad, Meysam ^{[1
]}

Khooban, Mohammad Hassan ^{[2
]}

机构：

[1] Islamic Azad Univ, Dept Elect Engn, Najafabad Branch, Esfahan 1477893855, Iran

[2] Aarhus Univ, Dept Engn, DIGIT, DK-8200 Aarhus, Denmark

来源：

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS | 2021年 / 68卷 / 04期

关键词：

Mobile robots; Vehicle dynamics; Kinematics; Wheels; Mathematical model; Heuristic algorithms; Deep deterministic policy gradient (DDPG); dynamic controller; noninteger proportional integral derivative (PID) controller; wheeled mobile robot (WMR); SLIDING-MODE CONTROL; MOBILE ROBOT; DYNAMIC CONTROLLER; DESIGN; TRACKING; STABILIZATION; MANIPULATOR;

D O I：

10.1109/TIE.2020.2979561

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, a noninteger proportional integral derivative (PID)-type controller based on the deep deterministic policy gradient algorithm is developed for the tracking problem of a mobile robot. This robot system is a typical case of nonholonomic plants and is exposed to the measurement noises and external disturbances. To accomplish the control methodology, two control mechanisms are established independently: a kinematic controller (which is designed based on the kinematic model of the vehicle), and a dynamic controller (which is realized according to the physical specifications of the vehicle dynamics). In particular, an optimal noninteger PID controller is initially designed as the primary dynamic controller for the tracking problem of a nonholonomic wheeled mobile robot. Then, a DDPG algorithm with the actor-critic framework is established for the supplementary dynamic controller, which is beneficial to the tracking stabilization by adapting to the uncertainties and disturbances. This strategy implements the supplementary based control to compensate for what the original controller is unable to handle. A prototype of the WMR was also adopted to investigate the applicability of the suggested controller from a real-time platform perspective. The outcomes in experimental environments are presented to affirm the effectiveness of the suggested control methodology.

引用

页码：3609 / 3618

页数：10

共 38 条

[1] Optimal design of fractional-order PID controller for five bar linkage robot using a new particle swarm optimization algorithm [J].

Aghababa, Mohammad Pourmahmood .

SOFT COMPUTING, 2016, 20 (10) :4055-4067

[2] Adaptive sliding mode dynamic controller with integrator in the loop for nonholonomic wheeled mobile robot trajectory tracking [J].

Asif, Muhammad ;

Khan, Muhammad Junaid ;

Cai, Ning .

INTERNATIONAL JOURNAL OF CONTROL, 2014, 87 (05) :964-975

[3] Control Strategy of Speed Servo Systems Based on Deep Reinforcement Learning [J].

Chen, Pengzhan ;

He, Zhiqiang ;

Chen, Chuanxi ;

Xu, Jiahong .

ALGORITHMS, 2018, 11 (05)

[4] Robust Distance-Based Tracking Control of Wheeled Mobile Robots Using Vision Sensors in the Presence of Kinematic Disturbances [J].

Chwa, Dongkyoung .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2016, 63 (10) :6172-6183

[5] Design and implementation of an adaptive fuzzy logic-based controller for wheeled mobile robots [J].

Das, Tamoghna ;

Kar, Indra Narayan .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2006, 14 (03) :501-510

[6]

De la Cruz C, 2006, IEEE IND ELEC, P1543

[7] A New Adaptive Type-II Fuzzy-Based Deep Reinforcement Learning Control: Fuel Cell Air-Feed Sensors Control [J].

Gheisarnejad, Meysam ;

Boudjadar, Jalil ;

Khooban, Mohammad Hassan .

IEEE SENSORS JOURNAL, 2019, 19 (20) :9081-9089

[8] Design an optimal fuzzy fractional proportional integral derivative controller with derivative filter for load frequency control in power systems [J].

Gheisarnejad, Meysam ;

Khooban, Mohammad Hassan .

TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2019, 41 (09) :2563-2581

[9] Supervised control strategy in trajectory tracking for a wheeled mobile robot [J].

Gheisarnejad, Meysam ;

Khooban, Mohammad-Hassan .

IET COLLABORATIVE INTELLIGENT MANUFACTURING, 2019, 1 (01) :3-9

[10] Deep reinforcement learning-based joint task offloading and bandwidth allocation for multi-user mobile edge computing [J].

Huang, Liang ;

Feng, Xu ;

Zhang, Cheng ;

Qian, Liping ;

Wu, Yuan .

DIGITAL COMMUNICATIONS AND NETWORKS, 2019, 5 (01) :10-17

← 1 2 3 4 →