Safe online optimization of motor speed synchronization control with incremental Q-learning

被引:0
|
作者
Huang, Jianfeng [1 ]
Lu, Guoqiang [1 ]
Yao, Xudong [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
关键词
Online controller tuning; Safe reinforcement learning; Q; -learning; Motor speed synchronization; PARTICLE SWARM OPTIMIZATION; ENERGY MANAGEMENT; REINFORCEMENT; DESIGN; STRATEGY;
D O I
10.1016/j.eswa.2024.124622
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) is promising for online controller optimization. However, its practical application has been hindered by safety issues. This paper proposes an algorithm named Incremental Q-learning (IQ) and applies it to the online optimization of motor speed synchronization control. IQ ensures safe learning by adopting so-called incremental action variables which represent incremental change rather than absolute magnitude, and dividing the one-round learning process in the classic Q-learning (in this paper referred to as Absolute Qlearning, AQ) into multiple consecutive ones with the Q table getting reset at the beginning of each round. Since the permitted interval of change is restricted to be very small, the agent can learn its way safely, steadily, and robustly towards the optimal policy. Simulation results show that IQ is advantageous to AQ in optimality, safety, and adaptability. IQ converges to better final performances with significantly smaller performance variance along the whole learning process, smaller torque trajectory deviation between consecutive episodes and adapts to unknown disturbances faster. It is of great potential for online controller optimization/tuning in practical engineering projects. Source code and demos are provided.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Hierarchical control of traffic signals using Q-learning with tile coding
    Abdoos, Monireh
    Mozayani, Nasser
    Bazzan, Ana L. C.
    APPLIED INTELLIGENCE, 2014, 40 (02) : 201 - 213
  • [32] Discrete control model Q-learning for an energy storage system with a hydrogen unit of an autonomous hybrid power plant of a railway substation
    Matrenin, Pavel
    Ghulomzoda, Anvari
    Safaraliev, Murodbek
    Tavlintsev, Alexander
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2024, 93 : 704 - 714
  • [33] Q-learning whale optimization algorithm for test suite generation with constraints support
    Hassan, Ali Abdullah
    Abdullah, Salwani
    Zamli, Kamal Z.
    Razali, Rozilawati
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (34) : 24069 - 24090
  • [34] Q-Learning Based on Particle Swarm Optimization for Positioning System of Underwater Vehicles
    Gao Yan-zeng
    Ye Jia-wei
    Chen Yuan-ming
    Liang Fu-ling
    2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 2, 2009, : 68 - 71
  • [35] Hyperparameter Optimization for the LSTM Method of AUV Model Identification Based on Q-Learning
    Wang, Dianrui
    Wan, Junhe
    Shen, Yue
    Qin, Ping
    He, Bo
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
  • [36] Q-learning based vegetation evolution for numerical optimization and wireless sensor network coverage optimization
    Zhong, Rui
    Peng, Fei
    Yu, Jun
    Munetomo, Masaharu
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 87 : 148 - 163
  • [37] Comparative study of motor speed synchronization control for an integrated motor-transmission powertrain system
    Huang, Jianfeng
    Zhang, Jianlong
    Yin, Chengliang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2020, 234 (04) : 1137 - 1152
  • [38] Q-learning improved golden jackal optimization algorithm and its application to reliability optimization of hydraulic system
    Chen, Dongning
    Wang, Haowen
    Hu, Dongbo
    Xian, Qinggui
    Wu, Bingyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [39] Q-learning based fault estimation and fault tolerant iterative learning control for MIMO systems
    Wang, Rui
    Zhuang, Zhihe
    Tao, Hongfeng
    Paszke, Wojciech
    Stojanovic, Vladimir
    ISA TRANSACTIONS, 2023, 142 : 123 - 135
  • [40] Quantized measurements in Q-learning based model-free optimal control
    Tiistola, Sini
    Ritala, Risto
    Vilkko, Matti
    IFAC PAPERSONLINE, 2020, 53 (02): : 1640 - 1645