Q-Learning Based Detector Design for State Estimation Under Non-gaussian Noises

被引:0
作者
Luo, Yue [1 ]
Liu, Yun [1 ]
Yang, Wen [1 ]
Wang, Xiaofan [2 ,3 ]
机构
[1] East China Univ Sci & Technol, Key Lab Smart Mfg Energy Chem Proc, Shanghai 200237, Peoples R China
[2] Shanghai Univ, Sch Mechatron Engn & Automat, Shanghai 200444, Peoples R China
[3] Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai 201418, Peoples R China
基金
中国国家自然科学基金;
关键词
Kalman filtering; State estimation; Non-gaussian noise; Attack detection; KL divergence; Q-learning;
D O I
10.1007/s00034-025-03001-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper addresses the problem of detecting false data injection attacks (FDI) for state estimation with non-Gaussian noises. During the estimation process, potential attacks and non-Gaussian noise can lead to the non-Gaussian property of the innovation, rendering traditional attack detection methods ineffective. To tackle this issue, we propose a novel detection strategy using Kullback-Leibler (KL) divergence as a detection metric, which adapts well to non-Gaussian scenarios. Furthermore, we adopt a Q-learning strategy to train the safety threshold of the detector to improve the reliability of detection. Through verification via Python simulation experiments, we demonstrate that the designed detector has a negligible impact on estimation performance, and provide an effective detection performances against FDI attacks.
引用
收藏
页码:4082 / 4100
页数:19
相关论文
共 26 条
  • [1] Alzubi OA, 2016, J UNIVERS COMPUT SCI, V22, P552
  • [2] Fault Detection Filter Design for Networked Systems with Deception Attacks and Communication Delays
    Badie, Khalid
    Chalh, Zakaria
    Alfidi, Mohammed
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (10) : 5958 - 5977
  • [3] A Novel Adaptive Control Design for a Class of Nonstrict-Feedback Discrete-Time Systems via Reinforcement Learning
    Bai, Weiwei
    Li, Tieshan
    Long, Yue
    Chen, C. L. Philip
    Xiao, Yang
    Li, Wenjiang
    Li, Ronghui
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (02): : 1250 - 1262
  • [4] Adaptive control for multi-agent systems with actuator fault via reinforcement learning and its application on multi-unmanned surface vehicle
    Bai, Weiwei
    Zhang, Wenjun
    Cao, Liang
    Liu, Qiang
    [J]. OCEAN ENGINEERING, 2023, 280
  • [5] ADMM-Based Distributed State Estimation of Smart Grid Under Data Deception and Denial of Service Attacks
    Du, Dajun
    Li, Xue
    Li, Wenting
    Chen, Rui
    Fei, Minrui
    Wu, Lei
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 49 (08): : 1698 - 1711
  • [6] Nonlinear Spline Adaptive Filtering Against Non-Gaussian Noise
    Guo, Wenyan
    Zhi, Yongfeng
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2022, 41 (01) : 579 - 596
  • [7] Kanamori T, 2009, J MACH LEARN RES, V10, P1391
  • [8] ON INFORMATION AND SUFFICIENCY
    KULLBACK, S
    LEIBLER, RA
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01): : 79 - 86
  • [9] False Data Injection Attack for Cyber-Physical Systems With Resource Constraint
    Li, Fangfei
    Tang, Yang
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (02) : 729 - 738
  • [10] Distributed Kalman Filter for Cooperative Localization With Integrated Measurements
    Li, Wenling
    Jia, Yingmin
    Du, Junping
    [J]. IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2020, 56 (04) : 3302 - 3310