Heavy-Tailed Reinforcement Learning With Penalized Robust Estimator

被引:0
作者
Park, Hyeon-Jun [1 ]
Lee, Kyungjae [1 ]
机构
[1] Chung Ang Univ, Dept Artificial Intelligence, Seoul 06974, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Noise measurement; Heavily-tailed distribution; Q-learning; Stochastic processes; Random variables; Object recognition; Markov decision processes; Reinforcement learning; heavy-tailed noise; regret analysis;
D O I
10.1109/ACCESS.2024.3424828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider finite-horizon episodic reinforcement learning (RL) under heavy-tailed noises, where the p-th moment is bounded for any p is an element of (1,2]. In this setting, existing RL algorithms are limited by their requirement for prior knowledge about the bounded moment order of the noise distribution. This requirement hinders their practical application, as such prior information is rarely available in real-world scenarios. Our proposed method eliminates the need for this prior knowledge, enabling implementation in a wider range of scenarios. We introduce two RL algorithms, p-Heavy-UCRL and p-Heavy-Q-learning, designed for model-based and model-free RL settings, respectively. Without the need for prior knowledge, these algorithms demonstrate robustness to heavy-tailed noise and achieve nearly optimal regret bounds, up to logarithmic terms, with the same dependencies on dominating terms as existing algorithms. Finally, we show that our proposed algorithms have empirically comparable performance to existing algorithms in synthetic tabular scenario.
引用
收藏
页码:107800 / 107817
页数:18
相关论文
共 50 条
  • [21] Functional Convergence of Linear Processes with Heavy-Tailed Innovations
    Balan, Raluca
    Jakubowski, Adam
    Louhichi, Sana
    JOURNAL OF THEORETICAL PROBABILITY, 2016, 29 (02) : 491 - 526
  • [22] ASYMPTOTICS FOR SYSTEMIC RISK WITH DEPENDENT HEAVY-TAILED LOSSES
    Liu, Jiajun
    Yang, Yang
    ASTIN BULLETIN-THE JOURNAL OF THE INTERNATIONAL ACTUARIAL ASSOCIATION, 2021, 51 (02) : 571 - 605
  • [23] Principal Gaussian Overbound for Heavy-Tailed Error Bounding
    Yan, Penggao
    Zhong, Yihan
    Hsu, Li-Ta
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2025, 61 (01) : 829 - 852
  • [24] Robust particle filter algorithms for one-step randomly delayed measurements and heavy-tailed noise
    Wang, Zongyuan
    Zhou, Weidong
    ELECTRONICS LETTERS, 2020, 56 (17) : 899 - 901
  • [25] MMF-GSTIW-PMBM Adaptive Filter for Multiple Group Target Tracking With Heavy-Tailed Noise
    Xue, Xirui
    Huang, Shucai
    Wei, Daozhi
    IEEE SENSORS JOURNAL, 2023, 23 (17) : 19959 - 19973
  • [26] Reconstructed Variational Bayesian Kalman Filter Under Heavy-Tailed and Skewed Noises
    Wang, Ke
    Wu, Panlong
    Zhao, Baochen
    Kong, Lingqi
    He, Shan
    IEEE SIGNAL PROCESSING LETTERS, 2024, 31 : 2405 - 2409
  • [27] A Robust Gaussian Approximate Fixed-Interval Smoother for Nonlinear Systems With Heavy-Tailed Process and Measurement Noises
    Huang, Yulong
    Zhang, Yonggang
    Li, Ning
    Chambers, Jonathon
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) : 468 - 472
  • [28] Denoising Heavy-Tailed Data Using a Novel Robust Non-Parametric Method Based on Quantile Regression
    Gorji, Ferdos
    Aminghafari, Mina
    FLUCTUATION AND NOISE LETTERS, 2017, 16 (03):
  • [29] Robust Variational Bayesian Adaptive Cubature Kalman Filtering Algorithm for Simultaneous Localization and Mapping with Heavy-Tailed Noise
    Zhang Z.
    Dong P.
    Tuo H.
    Liu G.
    Jia H.
    Journal of Shanghai Jiaotong University (Science), 2020, 25 (01) : 76 - 87
  • [30] Asymptotic estimates for systemic risk with dependent heavy-tailed losses
    Xu, Chenghao
    Peng, Jiangyan
    Zou, Lei
    LITHUANIAN MATHEMATICAL JOURNAL, 2025, 65 (01) : 134 - 150