Provably efficient learning with typed parametric models

被引:0
|
作者
Brunskill, Emma [1 ]
Leffler, Bethany R. [1 ]
Li, Hong [1 ]
Littman, Michael L. [2 ]
Roy, Nicholas [2 ]
机构
[1] Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02143, United States
[2] Department of Computer Science, Rutgers University Piscataway, NJ 08854, United States
关键词
Compendex;
D O I
暂无
中图分类号
学科分类号
摘要
Markov processes
引用
收藏
页码:1955 / 1988
相关论文
共 50 条
  • [11] Provably Efficient Imitation Learning from Observation Alone
    Sun, Wen
    Vemula, Anirudh
    Boots, Byron
    Bagnell, J. Andrew
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [12] Provably Efficient Adversarial Imitation Learning with Unknown Transitions
    Xu, Tian
    Li, Ziniu
    Yu, Yang
    Luo, Zhi-Quan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 2367 - 2378
  • [13] Elliptic PDE learning is provably data-efficient
    Boulle, Nicolas
    Halikias, Diana
    Townsend, Alex
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (39)
  • [14] Provably Efficient Reinforcement Learning with Linear Function Approximation
    Jin, Chi
    Yang, Zhuoran
    Wang, Zhaoran
    Jordan, Michael, I
    MATHEMATICS OF OPERATIONS RESEARCH, 2023, 48 (03) : 1496 - 1521
  • [15] Provably Efficient Reinforcement Learning via Surprise Bound
    Zhu, Hanlin
    Wang, Ruosong
    Lee, Jason D.
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [16] Stabilizing Q-learning with Linear Architectures for Provably Efficient Learning
    Zanette, Andrea
    Wainwright, Martin J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [17] SaDe: Learning Models that Provably Satisfy Domain Constraints
    Goyal, Kshitij
    Dumancic, Sebastijan
    Blockeel, Hendrik
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT V, 2023, 13717 : 410 - 425
  • [18] Efficient shrinkage in parametric models
    Hansen, Bruce E.
    JOURNAL OF ECONOMETRICS, 2016, 190 (01) : 115 - 132
  • [19] Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning
    Kong, Dingwen
    Yang, Lin F.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [20] Provably Efficient Q-Learning with Low Switching Cost
    Bai, Yu
    Xie, Tengyang
    Jiang, Nan
    Wang, Yu-Xiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32