Q-learning based adaptive Kalman filtering for partial model-free dynamic systems

被引：0

作者：

Tang, Kun ^{[1
]}

Luan, Xiaoli ^{[1
]}

Ding, Feng ^{[1
]}

Liu, Fei ^{[1
]}

机构：

[1] Jiangnan Univ, Inst Automat, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2024年 / 38卷 / 03期

基金：

中国国家自然科学基金;

关键词：

adaptive Kalman filtering; model information unknown; multi-innovation least squares; Q-learning; PARAMETER-ESTIMATION; ALGORITHM; STATE;

D O I：

10.1002/acs.3764

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, we propose an adaptive Kalman filtering based on Q-learning for partial model-free dynamic systems. First, a cost function is defined to iteratively update the prior state value when the model parameters are unknown. Then, the observations in a period of time are utilized to improve the accuracy and updating speed of the prior state estimation by means of the multi-innovation least squares. Next, considering that the weight matrix in the cost function will change due to external noise noise and model mismatch, the innovation-based adaptive estimation algorithm is presented to adjust the weight matrix by using the covariance of the information sequence. Finally, the proposed algorithms are applied to estimate the water level of a quadruple water tank system.

引用

页码：954 / 967

页数：14

共 50 条

[21] Q-learning for continuous-time linear systems: A model-free infinite horizon optimal control approach
Vamvoudakis, Kyriakos G.
SYSTEMS & CONTROL LETTERS, 2017, 100 : 14 - 20
[22] Model-free H∞ control design for unknown linear discrete-time systems via Q-learning with LMI
Kim, J. -H.
Lewis, F. L.
AUTOMATICA, 2010, 46 (08) : 1320 - 1326
[23] Cooperative strategy based on adaptive Q-learning for robot soccer systems
Hwang, KS
Tan, SW
Chen, CC
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (04) : 569 - 576
[24] Model-Free Perception-Based Control via Q-Learning with an Application to Heat-Seeking Missile Guidance
Kovalik, Wade S.
Zhai, Lijing
Vamvoudakis, Kyriakos G.
5TH IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS (IEEE CCTA 2021), 2021, : 1142 - 1147
[25] Data-driven model-free slip control of anti-lock braking systems using reinforcement Q-learning
Radac, Mircea-Bogdan
Precup, Radu-Emil
NEUROCOMPUTING, 2018, 275 : 317 - 329
[26] Model based path planning using Q-Learning
Sharma, Avinash
Gupta, Kanika
Kumar, Anirudha
Sharma, Aishwarya
Kumar, Rajesh
2017 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2017, : 837 - 842
[27] An Adaptive and Near Parameter-Free BRKGA Using Q-Learning Method
Chaves, Antonio Augusto
Nogueira Lorena, Luiz Henrique
2021 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC 2021), 2021, : 2331 - 2338
[28] Adaptive and Coordinated Traffic Signal Control Based on Q-Learning and MULTIBAND Model
Lu, Shoufeng
Liu, Ximin
Dai, Shiqiang
2008 IEEE CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 446 - +
[29] Stackelberg games for model-free continuous-time stochastic systems based on adaptive dynamic programming
Liu, Xikui
Ge, Yingying
Li, Yan
APPLIED MATHEMATICS AND COMPUTATION, 2019, 363
[30] Maneuvering Target Tracking Using Q-learning Based Kalman Filter
Bekhtaoui, Z.
Meche, A.
Dahmani, M.
Meraim, K. Abed
2017 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING - BOUMERDES (ICEE-B), 2017,

← 1 2 3 4 5 →