Exponential Hardness of Reinforcement Learning with Linear Function Approximation

被引：0

作者：

Kane, Daniel ^{[1
]}

Liu, Sihan ^{[1
]}

Lovett, Shachar ^{[1
]}

Mahajan, Gaurav ^{[2
]}

Szepesvári, Csaba ^{[3
,4
]}

Weisz, Gellért ^{[5
]}

机构：

[1] University of California, San Diego, United States

[2] Yale University, United States

[3] DeepMind, London, United Kingdom

[4] University of Alberta, Edmonton, Canada

[5] University College London, London, United Kingdom

来源：

Proceedings of Machine Learning Research | 2023年 / 195卷

关键词：

Compendex;

D O I：

36th Annual Conference on Learning Theory, COLT 2023

中图分类号：

学科分类号：

摘要：

Reinforcement learning

引用

页码：1588 / 1617

共 50 条

[21] An expansion for the sum of a product of an exponential and a Bessel function. II
Paris, Richard B.
arXiv, 2022,
[22] Adaptive evolution strategy with ensemble of mutations for Reinforcement Learning
Ajani, Oladayo S.
Mallipeddi, Rammohan
Knowledge-Based Systems, 2022, 245
[23] Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey
Ganai, Milan
Gao, Sicun
Herbert, Sylvia L.
IEEE Open Journal of Control Systems, 2024, 3 : 310 - 324
[24] Finite mixture estimation algorithm for arbitrary function approximation
University of Ljubljana, Faculty of Mechanical Engineering, Aškerceva 6, 1000 Ljubljana, Slovenia
Stroj Vest, 2 (115-124):
[25] Approximation of the Probability Function of Adjacent Pulse Disturbances.
Larsen, Guenter
AEU. Archiv fur Elektronik und Ubertragungstechnik, 1979, 33 (10): : 403 - 406
[26] ON APPROXIMATION OF THE TRANSFER FUNCTION FOR A PULSE SHAPING CIRCUIT.
Filanovsky, I.M.
Stromsmoe, K.A.
Journal Water Pollution Control Federation, 1980, : 592 - 596
[27] Special Issue on Aerospace and Mechanical Applications of Reinforcement Learning and Adaptive Learning Based Control
How, Jonathan P.
Chowdhary, Girish
Walsh, Thomas
JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2014, 11 (09): : 541 - 541
[28] ADAPTIVE LINEAR FUNCTION ELIMINATION FILTER
PLOTKIN, E
WULICH, D
INTERNATIONAL JOURNAL OF ELECTRONICS, 1979, 47 (04) : 355 - 364
[29] Reinforcement Learning Based Techniques for Radar Anti-Jamming
Institute of Space Technology, Electrical Engineering Department, Islamabad, Pakistan
Proc. Int. Bhurban Conf. Appl. Sci. Technol., IBCAST, (1021-1025):
[30] Optimistic PAC Reinforcement Learning: the Instance-Dependent View
Tirinzoni, Andrea
Al-Marjani, Aymen
Kaufmann, Emilie
Proceedings of Machine Learning Research, 2023, 201 : 1460 - 1480

← 1 2 3 4 5 →