Q-learning based adaptive Kalman filtering for partial model-free dynamic systems

被引:0
|
作者
Tang, Kun [1 ]
Luan, Xiaoli [1 ]
Ding, Feng [1 ]
Liu, Fei [1 ]
机构
[1] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Peoples R China
基金
中国国家自然科学基金;
关键词
adaptive Kalman filtering; model information unknown; multi-innovation least squares; Q-learning; PARAMETER-ESTIMATION; ALGORITHM; STATE;
D O I
10.1002/acs.3764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose an adaptive Kalman filtering based on Q-learning for partial model-free dynamic systems. First, a cost function is defined to iteratively update the prior state value when the model parameters are unknown. Then, the observations in a period of time are utilized to improve the accuracy and updating speed of the prior state estimation by means of the multi-innovation least squares. Next, considering that the weight matrix in the cost function will change due to external noise noise and model mismatch, the innovation-based adaptive estimation algorithm is presented to adjust the weight matrix by using the covariance of the information sequence. Finally, the proposed algorithms are applied to estimate the water level of a quadruple water tank system.
引用
收藏
页码:954 / 967
页数:14
相关论文
共 50 条
  • [21] Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach
    Vamvoudakis, Kyriakos G.
    SYSTEMS & CONTROL LETTERS, 2017, 100 : 14 - 20
  • [22] Model-free H∞ control design for unknown linear discrete-time systems via Q-learning with LMI
    Kim, J. -H.
    Lewis, F. L.
    AUTOMATICA, 2010, 46 (08) : 1320 - 1326
  • [23] Cooperative strategy based on adaptive Q-learning for robot soccer systems
    Hwang, KS
    Tan, SW
    Chen, CC
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
  • [24] Model-Free Perception-Based Control via Q-Learning with an Application to Heat-Seeking Missile Guidance
    Kovalik, Wade S.
    Zhai, Lijing
    Vamvoudakis, Kyriakos G.
    5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 1142 - 1147
  • [25] Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    NEUROCOMPUTING, 2018, 275 : 317 - 329
  • [26] Model based path planning using Q-Learning
    Sharma, Avinash
    Gupta, Kanika
    Kumar, Anirudha
    Sharma, Aishwarya
    Kumar, Rajesh
    2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2017, : 837 - 842
  • [27] An Adaptive and Near Parameter-Free BRKGA Using Q-Learning Method
    Chaves, Antonio Augusto
    Nogueira Lorena, Luiz Henrique
    2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 2331 - 2338
  • [28] Adaptive and Coordinated Traffic Signal Control Based on Q-Learning and MULTIBAND Model
    Lu, Shoufeng
    Liu, Ximin
    Dai, Shiqiang
    2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 446 - +
  • [29] Stackelberg games for model-free continuous-time stochastic systems based on adaptive dynamic programming
    Liu, Xikui
    Ge, Yingying
    Li, Yan
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 363
  • [30] Maneuvering Target Tracking Using Q-learning Based Kalman Filter
    Bekhtaoui, Z.
    Meche, A.
    Dahmani, M.
    Meraim, K. Abed
    2017 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING - BOUMERDES (ICEE-B), 2017,