Heavy-Tailed Reinforcement Learning With Penalized Robust Estimator

被引:0
作者
Park, Hyeon-Jun [1 ]
Lee, Kyungjae [1 ]
机构
[1] Chung Ang Univ, Dept Artificial Intelligence, Seoul 06974, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Noise measurement; Heavily-tailed distribution; Q-learning; Stochastic processes; Random variables; Object recognition; Markov decision processes; Reinforcement learning; heavy-tailed noise; regret analysis;
D O I
10.1109/ACCESS.2024.3424828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider finite-horizon episodic reinforcement learning (RL) under heavy-tailed noises, where the p-th moment is bounded for any p is an element of (1,2]. In this setting, existing RL algorithms are limited by their requirement for prior knowledge about the bounded moment order of the noise distribution. This requirement hinders their practical application, as such prior information is rarely available in real-world scenarios. Our proposed method eliminates the need for this prior knowledge, enabling implementation in a wider range of scenarios. We introduce two RL algorithms, p-Heavy-UCRL and p-Heavy-Q-learning, designed for model-based and model-free RL settings, respectively. Without the need for prior knowledge, these algorithms demonstrate robustness to heavy-tailed noise and achieve nearly optimal regret bounds, up to logarithmic terms, with the same dependencies on dominating terms as existing algorithms. Finally, we show that our proposed algorithms have empirically comparable performance to existing algorithms in synthetic tabular scenario.
引用
收藏
页码:107800 / 107817
页数:18
相关论文
共 50 条
  • [1] Robust Heavy-Tailed Linear Bandits Algorithm
    Ma L.
    Zhao P.
    Zhou Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1385 - 1395
  • [2] Delay-Optimal Scheduling for Heavy-Tailed and Light-Tailed Flows via Reinforcement Learning
    Guo, Mian
    Guan, Quansheng
    Chen, Weiqi
    Ji, Fei
    Peng, Zhiping
    PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS 2018), 2018, : 292 - 296
  • [3] Robust sequential learning of feedforward neural networks in the presence of heavy-tailed noise
    Vukovic, Najdan
    Miljkovic, Zoran
    NEURAL NETWORKS, 2015, 63 : 31 - 47
  • [4] Robust Adaptive Filters and Smoothers for Linear Systems With Heavy-Tailed Multiplicative/Additive Noises
    Yu, Xingkai
    Qu, Zhi
    Jin, Gumin
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (05) : 6717 - 6733
  • [5] Performance Analysis of a Robust Wavelet Threshold for Heavy-tailed Noises
    Wei Guangfen
    Su Feng
    Jian Tao
    ADVANCES IN SCIENCE AND ENGINEERING, PTS 1 AND 2, 2011, 40-41 : 979 - +
  • [6] Study on the Robust Wavelet Threshold Technique for Heavy-tailed Noises
    Wei, Guangfen
    Su, Feng
    Jian, Tao
    JOURNAL OF COMPUTERS, 2011, 6 (06) : 1246 - 1253
  • [7] High Probability Convergence of Clipped Distributed Dual Averaging With Heavy-Tailed Noises
    Qin, Yanfu
    Lu, Kaihong
    Xu, Hang
    Chen, Xiangyong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025,
  • [8] Robust Rauch-Tung-Striebel Smoothing Framework for Heavy-Tailed and/or Skew Noises
    Huang, Yulong
    Zhang, Yonggang
    Zhao, Yuxin
    Mihaylova, Lyudmila
    Chambers, Jonathon A.
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2020, 56 (01) : 415 - 441
  • [9] A Novel Heavy-Tailed Mixture Distribution Based Robust Kalman Filter for Cooperative Localization
    Bai, Mingming
    Huang, Yulong
    Zhang, Yonggang
    Chen, Feng
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (05) : 3671 - 3681
  • [10] Robust Bayesian Recursive Ensemble Kalman Filter Under the Nonstationary Heavy-Tailed Noise
    Wang, Li
    Chen, Hui
    Lian, Feng
    Zhang, Wenxu
    Liu, Jiabin
    IEEE SENSORS JOURNAL, 2025, 25 (01) : 749 - 762