Exponential Hardness of Reinforcement Learning with Linear Function Approximation

被引：0

作者：

Kane, Daniel ^{[1
]}

Liu, Sihan ^{[1
]}

Lovett, Shachar ^{[1
]}

Mahajan, Gaurav ^{[2
]}

Szepesvári, Csaba ^{[3
,4
]}

Weisz, Gellért ^{[5
]}

机构：

[1] University of California, San Diego, United States

[2] Yale University, United States

[3] DeepMind, London, United Kingdom

[4] University of Alberta, Edmonton, Canada

[5] University College London, London, United Kingdom

来源：

Proceedings of Machine Learning Research | 2023年 / 195卷

关键词：

Compendex;

D O I：

36th Annual Conference on Learning Theory, COLT 2023

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

页码：1588 / 1617

共 50 条

[1] Neural networks: A general framework for non-linear function approximation
Institute for Economic Geography and GIScience, Vienna University of Economics and Business Administration, Nordbergstr. 15, Vienna A-1090, Austria
1600, 521-533 (July 2006):
[2] A novel off policy Q(λ) algorithm based on linear function approximation
Fu, Qi-Ming
Liu, Quan
Wang, Hui
Xiao, Fei
Yu, Jun
Li, Jiao
Jisuanji Xuebao/Chinese Journal of Computers, 2014, 37 (03): : 677 - 686
[3] Max product exponential approximation operators
Bencsik, Attila L.
Bede, Barnabás
Noje, Dan
Nobuhara, Hajime
Hirota, Kaoru
IEEE Int Symp Ind Electron, 1600, (542-547):
[4] USE OF AN EXPONENTIAL APPROXIMATION FOR HARMONIC ANALYSIS WITH A COMPUTER.
Popov, P.A.
Anuchin, A.N.
Telecommunications and Radio Engineering (English translation of Elektrosvyaz and Radiotekhnika), 1980, 34-35 (12): : 64 - 66
[5] Exponential Function Generator.
Schiffer, Viktor
Elektronik Munchen, 1981, 30 (18): : 91 - 92
[6] A Novel Online Safe Reinforcement Learning with Control Barrier Function Technique for Autonomous vehicles
Jabbari, Fatemeh
Samsami, Reza
Arefi, Mohammad Mehdi
2024 10th International Conference on Control, Instrumentation and Automation, ICCIA 2024, 2024,
[7] Hardness and Approximation for the Star β -Hub Routing Cost Problem in Δβ -Metric Graphs
Tsai, Meng-Shiou
Hsieh, Sun-Yuan
Hung, Ling-Ju
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2024, 14422 LNCS : 97 - 111
[8] Robust Control for Uncertain Discrete-Time Linear Systems Using Reinforcement Learning With Discount Factor
Ding, Yuntian
Yang, Yuxiao
Yan, Zhilian
Tai, Weipeng
IAENG International Journal of Applied Mathematics, 2024, 54 (12) : 2783 - 2791
[9] Podracer architectures for scalable reinforcement learning
DeepMind, United Kingdom
arXiv,
[10] Learning to Box: Reinforcement Learning using Heuristic Three-step Curriculum Learning
Rho, Heeseon
Yu, Yeonguk
Lee, Kyoobin
International Conference on Control, Automation and Systems, 2022, 2022-November : 227 - 231

← 1 2 3 4 5 →