Heavy-Tailed Reinforcement Learning With Penalized Robust Estimator

被引:0
作者
Park, Hyeon-Jun [1 ]
Lee, Kyungjae [1 ]
机构
[1] Chung Ang Univ, Dept Artificial Intelligence, Seoul 06974, South Korea
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Noise measurement; Heavily-tailed distribution; Q-learning; Stochastic processes; Random variables; Object recognition; Markov decision processes; Reinforcement learning; heavy-tailed noise; regret analysis;
D O I
10.1109/ACCESS.2024.3424828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider finite-horizon episodic reinforcement learning (RL) under heavy-tailed noises, where the p-th moment is bounded for any p is an element of (1,2]. In this setting, existing RL algorithms are limited by their requirement for prior knowledge about the bounded moment order of the noise distribution. This requirement hinders their practical application, as such prior information is rarely available in real-world scenarios. Our proposed method eliminates the need for this prior knowledge, enabling implementation in a wider range of scenarios. We introduce two RL algorithms, p-Heavy-UCRL and p-Heavy-Q-learning, designed for model-based and model-free RL settings, respectively. Without the need for prior knowledge, these algorithms demonstrate robustness to heavy-tailed noise and achieve nearly optimal regret bounds, up to logarithmic terms, with the same dependencies on dominating terms as existing algorithms. Finally, we show that our proposed algorithms have empirically comparable performance to existing algorithms in synthetic tabular scenario.
引用
收藏
页码:107800 / 107817
页数:18
相关论文
共 50 条
  • [41] A sparse approach for high-dimensional data with heavy-tailed noise
    Ye, Yafen
    Shao, Yuanhai
    Li, Chunna
    [J]. ECONOMIC RESEARCH-EKONOMSKA ISTRAZIVANJA, 2022, 35 (01): : 2764 - 2780
  • [42] HEAVY-TAILED DISTRIBUTIONS IN FATAL TRAFFIC ACCIDENTS: ROLE OF HUMAN ACTIVITIES
    Tseng, Jie-Jun
    Lee, Ming-Jer
    Li, Sai-Ping
    [J]. INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 2009, 20 (08): : 1281 - 1290
  • [43] Robust Student's t-Based Stochastic Cubature Filter for Nonlinear Systems With Heavy-Tailed Process and Measurement Noises
    Huang, Yulong
    Zhang, Yonggang
    [J]. IEEE ACCESS, 2017, 5 : 7964 - 7974
  • [44] Gaussian Particle Filtering for Nonlinear Systems With Heavy-Tailed Noises: A Progressive Transform-Based Approach
    Zhang, Wen-An
    Zhang, Jie
    Shi, Ling
    Yang, Xusheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, : 6934 - 6942
  • [45] Gaussian Particle Filtering for Nonlinear Systems With Heavy-Tailed Noises: A Progressive Transform-Based Approach
    Zhang, Wen-An
    Zhang, Jie
    Shi, Ling
    Yang, Xusheng
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 6934 - 6942
  • [46] Structured Signal Recovery From Non-Linear and Heavy-Tailed Measurements
    Goldstein, Larry
    Minsker, Stanislav
    Wei, Xiaohan
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (08) : 5513 - 5530
  • [47] Fuzzy value-at-risk and expected shortfall for portfolios with heavy-tailed returns
    Moussa, A. Mbairadjim
    Kamdem, J. Sadefo
    Terraza, M.
    [J]. ECONOMIC MODELLING, 2014, 39 : 247 - 256
  • [48] Sequential Fusion for Multirate Multisensor Systems With Heavy-Tailed Noises and Unreliable Measurements
    Yan, Liping
    Di, Chenying
    Wu, Q. M. Jonathan
    Xia, Yuanqing
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (01): : 523 - 532
  • [49] The eigenvalues of the sample covariance matrix of a multivariate heavy-tailed stochastic volatility model
    Janssen, Anja
    Mikosch, Thomas
    Rezapour, Mohsen
    Xie, Xiaolei
    [J]. BERNOULLI, 2018, 24 (02) : 1351 - 1393
  • [50] Dependency measures for the diagnosis of local faults in application to the heavy-tailed vibration signal
    Nowicki, Jakub
    Hebda-Sobkowicz, Justyna
    Zimroz, Radoslaw
    Wylomanska, Agnieszka
    [J]. APPLIED ACOUSTICS, 2021, 178