Q-Learning Based Detector Design for State Estimation Under Non-gaussian Noises

被引：0

作者：

Luo, Yue ^{[1
]}

Liu, Yun ^{[1
]}

Yang, Wen ^{[1
]}

Wang, Xiaofan ^{[2
,3
]}

机构：

[1] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Shanghai 200237, Peoples R China

[2] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China

[3] Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai 201418, Peoples R China

来源：

CIRCUITS SYSTEMS AND SIGNAL PROCESSING | 2025年

基金：

中国国家自然科学基金;

关键词：

Kalman filtering; State estimation; Non-gaussian noise; Attack detection; KL divergence; Q-learning;

D O I：

10.1007/s00034-025-03001-3

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper addresses the problem of detecting false data injection attacks (FDI) for state estimation with non-Gaussian noises. During the estimation process, potential attacks and non-Gaussian noise can lead to the non-Gaussian property of the innovation, rendering traditional attack detection methods ineffective. To tackle this issue, we propose a novel detection strategy using Kullback-Leibler (KL) divergence as a detection metric, which adapts well to non-Gaussian scenarios. Furthermore, we adopt a Q-learning strategy to train the safety threshold of the detector to improve the reliability of detection. Through verification via Python simulation experiments, we demonstrate that the designed detector has a negligible impact on estimation performance, and provide an effective detection performances against FDI attacks.

引用

页码：4082 / 4100

页数：19

共 26 条

[1] Alzubi OA, 2016, J UNIVERS COMPUT SCI, V22, P552
[2] Fault Detection Filter Design for Networked Systems with Deception Attacks and Communication Delays
Badie, Khalid
Chalh, Zakaria
Alfidi, Mohammed
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (10) : 5958 - 5977
[3] A Novel Adaptive Control Design for a Class of Nonstrict-Feedback Discrete-Time Systems via Reinforcement Learning
Bai, Weiwei
Li, Tieshan
Long, Yue
Chen, C. L. Philip
Xiao, Yang
Li, Wenjiang
Li, Ronghui
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1250 - 1262
[4] Adaptive control for multi-agent systems with actuator fault via reinforcement learning and its application on multi-unmanned surface vehicle
Bai, Weiwei
Zhang, Wenjun
Cao, Liang
Liu, Qiang
[J]. OCEAN ENGINEERING, 2023, 280
[5] ADMM-Based Distributed State Estimation of Smart Grid Under Data Deception and Denial of Service Attacks
Du, Dajun
Li, Xue
Li, Wenting
Chen, Rui
Fei, Minrui
Wu, Lei
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (08): : 1698 - 1711
[6] Nonlinear Spline Adaptive Filtering Against Non-Gaussian Noise
Guo, Wenyan
Zhi, Yongfeng
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 579 - 596
[7] Kanamori T, 2009, J MACH LEARN RES, V10, P1391
[8] ON INFORMATION AND SUFFICIENCY
KULLBACK, S
LEIBLER, RA
[J]. ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01): : 79 - 86
[9] False Data Injection Attack for Cyber-Physical Systems With Resource Constraint
Li, Fangfei
Tang, Yang
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 729 - 738
[10] Distributed Kalman Filter for Cooperative Localization With Integrated Measurements
Li, Wenling
Jia, Yingmin
Du, Junping
[J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2020, 56 (04) : 3302 - 3310

← 1 2 3 →