Double Q-PID algorithm for mobile robot control

被引:51
作者
Carlucho, Ignacio [1 ]
De Paula, Mariano [1 ]
Acosta, Gerardo G. [1 ]
机构
[1] CIFICEN UNICEN CICpBA CONICET, Ctr Invest Fis Ctr, INTELYMEC, RA-7400 Olavarria, Argentina
关键词
Reinforcement learning; Double Q-learning; Incremental learning; Double Q-PID; Mobile robots; Multi-platforms; AUTONOMOUS UNDERWATER VEHICLE; ADAPTIVE-CONTROL; OPTIMUM SETTINGS; PERFORMANCE; ATTITUDE; SYSTEMS; ZIEGLER;
D O I
10.1016/j.eswa.2019.06.066
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many expert systems have been developed for self-adaptive PID controllers of mobile robots. However, the high computational requirements of the expert systems layers, developed for the tuning of the PID controllers, still require previous expert knowledge and high efficiency in algorithmic and software execution for real-time applications. To address these problems, in this paper we propose an expert agent based system, based on a reinforcement learning agent, for self-adapting multiple low-level PID controllers in mobile robots. For the formulation of the artificial expert agent, we develop an incremental model-free algorithm version of the double Q-Learning algorithm for fast on-line adaptation of multiple low-level PID controllers. Fast learning and high on-line adaptability of the artificial expert agent is achieved by means of a proposed incremental active-learning exploration-exploitation procedure, for a non-uniform state space exploration, along with an experience replay mechanism for multiple value functions updates in the double Q-Iearning algorithm. A comprehensive comparative simulation study and experiments in a real mobile robot demonstrate the high performance of the proposed algorithm for a real-time simultaneous tuning of multiple adaptive low-level PID controllers of mobile robots in real world conditions. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:292 / 307
页数:16
相关论文
共 77 条
[1]   A PC-Based Simulation Platform for a Quadcopter System With Self-Tuning Fuzzy PID Controllers [J].
Abu Rmilah, Moayad H. Y. ;
Hassan, Mohsen A. ;
Bin Mardi, Noor Azizi .
COMPUTER APPLICATIONS IN ENGINEERING EDUCATION, 2016, 24 (06) :934-950
[2]   AN EXPERT PID CONTROLLER USES REFINED ZIEGLER AND NICHOLS RULES AND FUZZY-LOGIC IDEAS [J].
ACOSTA, GG ;
MAYOSKY, MA ;
CATALFO, JM .
APPLIED INTELLIGENCE, 1994, 4 (01) :53-66
[3]  
Ahmed SF, 2015, 2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTING TECHNOLOGY AND INFORMATION MANAGEMENT (ICCTIM), P99, DOI 10.1109/ICCTIM.2015.7224600
[4]   A Reactive Method for Collision Avoidance in Industrial Environments [J].
Alejo, D. ;
Cobano, J. A. ;
Heredia, G. ;
Ollero, A. .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2016, 84 (1-4) :745-758
[5]  
[Anonymous], 2013, P 13 IR C FUZZ SYST
[6]  
[Anonymous], P 2 MULT C REINF LEA
[7]  
[Anonymous], DESIGN SIMULATION SE
[8]  
[Anonymous], P AIAA INF AER
[9]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[10]  
[Anonymous], IFAC P VOLUMES