Q-learning based adaptive Kalman filtering for partial model-free dynamic systems

被引：0

作者：

Tang, Kun ^{[1
]}

Luan, Xiaoli ^{[1
]}

Ding, Feng ^{[1
]}

Liu, Fei ^{[1
]}

机构：

[1] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2024年 / 38卷 / 03期

基金：

中国国家自然科学基金;

关键词：

adaptive Kalman filtering; model information unknown; multi-innovation least squares; Q-learning; PARAMETER-ESTIMATION; ALGORITHM; STATE;

D O I：

10.1002/acs.3764

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose an adaptive Kalman filtering based on Q-learning for partial model-free dynamic systems. First, a cost function is defined to iteratively update the prior state value when the model parameters are unknown. Then, the observations in a period of time are utilized to improve the accuracy and updating speed of the prior state estimation by means of the multi-innovation least squares. Next, considering that the weight matrix in the cost function will change due to external noise noise and model mismatch, the innovation-based adaptive estimation algorithm is presented to adjust the weight matrix by using the covariance of the information sequence. Finally, the proposed algorithms are applied to estimate the water level of a quadruple water tank system.

引用

页码：954 / 967

页数：14

共 50 条

[41] Model-Free Optimal Tracking Control of Nonlinear Input-Affine Discrete-Time Systems via an Iterative Deterministic Q-Learning Algorithm
Song, Shijie
Zhu, Minglei
Dai, Xiaolin
Gong, Dawei
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 999 - 1012
[42] Model-free LQR design by Q-function learning
Farjadnasab, Milad
Babazadeh, Maryam
AUTOMATICA, 2022, 137
[43] Q-learning-based Model-free Swing Up Control of an Inverted Pendulum
Ghio, Alessio
Ramos, Oscar E.
PROCEEDINGS OF THE 2019 IEEE XXVI INTERNATIONAL CONFERENCE ON ELECTRONICS, ELECTRICAL ENGINEERING AND COMPUTING (INTERCON), 2019,
[44] Dynamic feature selection algorithm based on Q-learning mechanism
Ruohao Xu
Mengmeng Li
Zhongliang Yang
Lifang Yang
Kangjia Qiao
Zhigang Shang
Applied Intelligence, 2021, 51 : 7233 - 7244
[45] Model-based Q-Learning for Humanoid Robots
Le, Than D.
Le, An T.
Nguyen, Duy T.
2017 18TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2017, : 608 - 613
[46] Model-free incremental adaptive dynamic programming based approximate robust optimal regulation
Li, Cong
Wang, Yongchao
Liu, Fangzhou
Liu, Qingchen
Buss, Martin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2662 - 2682
[47] Dynamic feature selection algorithm based on Q-learning mechanism
Xu, Ruohao
Li, Mengmeng
Yang, Zhongliang
Yang, Lifang
Qiao, Kangjia
Shang, Zhigang
APPLIED INTELLIGENCE, 2021, 51 (10) : 7233 - 7244
[48] Evaluation of Instance-Based Learning and Q-Learning Algorithms in Dynamic Environments
Gupta, Anmol
Roy, Partha Pratim
Dutt, Varun
IEEE ACCESS, 2021, 9 : 138775 - 138790
[49] A path planning approach for unmanned surface vehicles based on dynamic and fast Q-learning
Hao, Bing
Du, He
Yan, Zheping
OCEAN ENGINEERING, 2023, 270
[50] An adaptive backoff selection scheme based on Q-learning for CSMA/CA
Zhichao Zheng
Shengming Jiang
Ruoyu Feng
Lige Ge
Chongchong Gu
Wireless Networks, 2023, 29 : 1899 - 1909

← 1 2 3 4 5 →