Safe online optimization of motor speed synchronization control with incremental Q-learning

被引：0

作者：

Huang, Jianfeng ^{[1
]}

Lu, Guoqiang ^{[1
]}

Yao, Xudong ^{[1
]}

机构：

[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 255卷

关键词：

Online controller tuning; Safe reinforcement learning; Q; -learning; Motor speed synchronization; PARTICLE SWARM OPTIMIZATION; ENERGY MANAGEMENT; REINFORCEMENT; DESIGN; STRATEGY;

D O I：

10.1016/j.eswa.2024.124622

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) is promising for online controller optimization. However, its practical application has been hindered by safety issues. This paper proposes an algorithm named Incremental Q-learning (IQ) and applies it to the online optimization of motor speed synchronization control. IQ ensures safe learning by adopting so-called incremental action variables which represent incremental change rather than absolute magnitude, and dividing the one-round learning process in the classic Q-learning (in this paper referred to as Absolute Qlearning, AQ) into multiple consecutive ones with the Q table getting reset at the beginning of each round. Since the permitted interval of change is restricted to be very small, the agent can learn its way safely, steadily, and robustly towards the optimal policy. Simulation results show that IQ is advantageous to AQ in optimality, safety, and adaptability. IQ converges to better final performances with significantly smaller performance variance along the whole learning process, smaller torque trajectory deviation between consecutive episodes and adapts to unknown disturbances faster. It is of great potential for online controller optimization/tuning in practical engineering projects. Source code and demos are provided.

引用

页数：16

共 50 条

[31] Hierarchical control of traffic signals using Q-learning with tile coding
Abdoos, Monireh
Mozayani, Nasser
Bazzan, Ana L. C.
APPLIED INTELLIGENCE, 2014, 40 (02) : 201 - 213
[32] Discrete control model Q-learning for an energy storage system with a hydrogen unit of an autonomous hybrid power plant of a railway substation
Matrenin, Pavel
Ghulomzoda, Anvari
Safaraliev, Murodbek
Tavlintsev, Alexander
INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 93 : 704 - 714
[33] Q-learning whale optimization algorithm for test suite generation with constraints support
Hassan, Ali Abdullah
Abdullah, Salwani
Zamli, Kamal Z.
Razali, Rozilawati
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34) : 24069 - 24090
[34] Q-Learning Based on Particle Swarm Optimization for Positioning System of Underwater Vehicles
Gao Yan-zeng
Ye Jia-wei
Chen Yuan-ming
Liang Fu-ling
2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 2, 2009, : 68 - 71
[35] Hyperparameter Optimization for the LSTM Method of AUV Model Identification Based on Q-Learning
Wang, Dianrui
Wan, Junhe
Shen, Yue
Qin, Ping
He, Bo
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
[36] Q-learning based vegetation evolution for numerical optimization and wireless sensor network coverage optimization
Zhong, Rui
Peng, Fei
Yu, Jun
Munetomo, Masaharu
ALEXANDRIA ENGINEERING JOURNAL, 2024, 87 : 148 - 163
[37] Comparative study of motor speed synchronization control for an integrated motor-transmission powertrain system
Huang, Jianfeng
Zhang, Jianlong
Yin, Chengliang
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2020, 234 (04) : 1137 - 1152
[38] Q-learning improved golden jackal optimization algorithm and its application to reliability optimization of hydraulic system
Chen, Dongning
Wang, Haowen
Hu, Dongbo
Xian, Qinggui
Wu, Bingyu
SCIENTIFIC REPORTS, 2024, 14 (01):
[39] Q-learning based fault estimation and fault tolerant iterative learning control for MIMO systems
Wang, Rui
Zhuang, Zhihe
Tao, Hongfeng
Paszke, Wojciech
Stojanovic, Vladimir
ISA TRANSACTIONS, 2023, 142 : 123 - 135
[40] Quantized measurements in Q-learning based model-free optimal control
Tiistola, Sini
Ritala, Risto
Vilkko, Matti
IFAC PAPERSONLINE, 2020, 53 (02): : 1640 - 1645

← 1 2 3 4 5 →