Exponential Hardness of Reinforcement Learning with Linear Function Approximation

被引:0
|
作者
Kane, Daniel [1 ]
Liu, Sihan [1 ]
Lovett, Shachar [1 ]
Mahajan, Gaurav [2 ]
Szepesvári, Csaba [3 ,4 ]
Weisz, Gellért [5 ]
机构
[1] University of California, San Diego, United States
[2] Yale University, United States
[3] DeepMind, London, United Kingdom
[4] University of Alberta, Edmonton, Canada
[5] University College London, London, United Kingdom
来源
Proceedings of Machine Learning Research | 2023年 / 195卷
关键词
Compendex;
D O I
36th Annual Conference on Learning Theory, COLT 2023
中图分类号
学科分类号
摘要
Reinforcement learning
引用
收藏
页码:1588 / 1617
相关论文
共 50 条
  • [1] Neural networks: A general framework for non-linear function approximation
    Institute for Economic Geography and GIScience, Vienna University of Economics and Business Administration, Nordbergstr. 15, Vienna A-1090, Austria
    1600, 521-533 (July 2006):
  • [2] A novel off policy Q(λ) algorithm based on linear function approximation
    Fu, Qi-Ming
    Liu, Quan
    Wang, Hui
    Xiao, Fei
    Yu, Jun
    Li, Jiao
    Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (03): : 677 - 686
  • [3] Max product exponential approximation operators
    Bencsik, Attila L.
    Bede, Barnabás
    Noje, Dan
    Nobuhara, Hajime
    Hirota, Kaoru
    IEEE Int Symp Ind Electron, 1600, (542-547):
  • [4] USE OF AN EXPONENTIAL APPROXIMATION FOR HARMONIC ANALYSIS WITH A COMPUTER.
    Popov, P.A.
    Anuchin, A.N.
    Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 1980, 34-35 (12): : 64 - 66
  • [5] Exponential Function Generator.
    Schiffer, Viktor
    Elektronik Munchen, 1981, 30 (18): : 91 - 92
  • [6] A Novel Online Safe Reinforcement Learning with Control Barrier Function Technique for Autonomous vehicles
    Jabbari, Fatemeh
    Samsami, Reza
    Arefi, Mohammad Mehdi
    2024 10th International Conference on Control, Instrumentation and Automation, ICCIA 2024, 2024,
  • [7] Hardness and Approximation for the Star β -Hub Routing Cost Problem in Δβ -Metric Graphs
    Tsai, Meng-Shiou
    Hsieh, Sun-Yuan
    Hung, Ling-Ju
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14422 LNCS : 97 - 111
  • [8] Robust Control for Uncertain Discrete-Time Linear Systems Using Reinforcement Learning With Discount Factor
    Ding, Yuntian
    Yang, Yuxiao
    Yan, Zhilian
    Tai, Weipeng
    IAENG International Journal of Applied Mathematics, 2024, 54 (12) : 2783 - 2791
  • [9] Podracer architectures for scalable reinforcement learning
    DeepMind, United Kingdom
    arXiv,
  • [10] Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning
    Rho, Heeseon
    Yu, Yeonguk
    Lee, Kyoobin
    International Conference on Control, Automation and Systems, 2022, 2022-November : 227 - 231