Optimistic PAC Reinforcement Learning: the Instance-Dependent View

被引：0

作者：

Tirinzoni, Andrea ^{[1
]}

Al-Marjani, Aymen ^{[2
]}

Kaufmann, Emilie ^{[3
]}

机构：

[1] Meta AI, United States

[2] UMPA, ENS Lyon, France

[3] Univ. Lille, CNRS, Inria, Centrale Lille, UMR 9189 - CRIStAL, France

来源：

关键词：

Compendex;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Markov processes

引用

页码：1460 / 1480