Q-learning based adaptive Kalman filtering for partial model-free dynamic systems

被引:0
|
作者
Tang, Kun [1 ]
Luan, Xiaoli [1 ]
Ding, Feng [1 ]
Liu, Fei [1 ]
机构
[1] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Peoples R China
基金
中国国家自然科学基金;
关键词
adaptive Kalman filtering; model information unknown; multi-innovation least squares; Q-learning; PARAMETER-ESTIMATION; ALGORITHM; STATE;
D O I
10.1002/acs.3764
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, we propose an adaptive Kalman filtering based on Q-learning for partial model-free dynamic systems. First, a cost function is defined to iteratively update the prior state value when the model parameters are unknown. Then, the observations in a period of time are utilized to improve the accuracy and updating speed of the prior state estimation by means of the multi-innovation least squares. Next, considering that the weight matrix in the cost function will change due to external noise noise and model mismatch, the innovation-based adaptive estimation algorithm is presented to adjust the weight matrix by using the covariance of the information sequence. Finally, the proposed algorithms are applied to estimate the water level of a quadruple water tank system.
引用
收藏
页码:954 / 967
页数:14
相关论文
共 50 条
  • [41] Model-Free Optimal Tracking Control of Nonlinear Input-Affine Discrete-Time Systems via an Iterative Deterministic Q-Learning Algorithm
    Song, Shijie
    Zhu, Minglei
    Dai, Xiaolin
    Gong, Dawei
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 999 - 1012
  • [42] Model-free LQR design by Q-function learning
    Farjadnasab, Milad
    Babazadeh, Maryam
    AUTOMATICA, 2022, 137
  • [43] Q-learning-based Model-free Swing Up Control of an Inverted Pendulum
    Ghio, Alessio
    Ramos, Oscar E.
    PROCEEDINGS OF THE 2019 IEEE XXVI INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2019,
  • [44] Dynamic feature selection algorithm based on Q-learning mechanism
    Ruohao Xu
    Mengmeng Li
    Zhongliang Yang
    Lifang Yang
    Kangjia Qiao
    Zhigang Shang
    Applied Intelligence, 2021, 51 : 7233 - 7244
  • [45] Model-based Q-Learning for Humanoid Robots
    Le, Than D.
    Le, An T.
    Nguyen, Duy T.
    2017 18TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2017, : 608 - 613
  • [46] Model-free incremental adaptive dynamic programming based approximate robust optimal regulation
    Li, Cong
    Wang, Yongchao
    Liu, Fangzhou
    Liu, Qingchen
    Buss, Martin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2662 - 2682
  • [47] Dynamic feature selection algorithm based on Q-learning mechanism
    Xu, Ruohao
    Li, Mengmeng
    Yang, Zhongliang
    Yang, Lifang
    Qiao, Kangjia
    Shang, Zhigang
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
  • [48] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
    Gupta, Anmol
    Roy, Partha Pratim
    Dutt, Varun
    IEEE ACCESS, 2021, 9 : 138775 - 138790
  • [49] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
    Hao, Bing
    Du, He
    Yan, Zheping
    OCEAN ENGINEERING, 2023, 270
  • [50] An adaptive backoff selection scheme based on Q-learning for CSMA/CA
    Zhichao Zheng
    Shengming Jiang
    Ruoyu Feng
    Lige Ge
    Chongchong Gu
    Wireless Networks, 2023, 29 : 1899 - 1909