Hybrid Least-Squares Algorithms for Approximate Policy Evaluation

被引:0
|
作者
Johns, Jeff [1 ]
Petrik, Marek [1 ]
Mahadevan, Sridhar [1 ]
机构
[1] Univ Massachusetts, Dept Comp Sci, Amherst, MA 01003 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页码:9 / 9
页数:1
相关论文
共 50 条
  • [1] Hybrid least-squares algorithms for approximate policy evaluation
    Johns, Jeff
    Petrik, Marek
    Mahadevan, Sridhar
    MACHINE LEARNING, 2009, 76 (2-3) : 243 - 256
  • [2] Hybrid least-squares algorithms for approximate policy evaluation
    Jeff Johns
    Marek Petrik
    Sridhar Mahadevan
    Machine Learning, 2009, 76 : 243 - 256
  • [3] LEAST-SQUARES ALGORITHMS
    WAMPLER, RH
    AMERICAN STATISTICIAN, 1977, 31 (01): : 52 - 53
  • [4] Probabilistic Approximate Least-Squares
    Bartels, Simon
    Hennig, Philipp
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 676 - 684
  • [5] FAST LEAST-SQUARES ALGORITHMS
    DAVIDON, WC
    AMERICAN JOURNAL OF PHYSICS, 1977, 45 (03) : 260 - 262
  • [6] Model-based least-squares policy evaluation
    Lu, F
    Schuurmans, D
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2671 : 342 - 352
  • [7] Least-squares policy iteration
    Lagoudakis, MG
    Parr, R
    JOURNAL OF MACHINE LEARNING RESEARCH, 2004, 4 (06) : 1107 - 1149
  • [8] Least-squares policy iteration algorithms for robotics: Online, continuous, and automatic
    Friedrich, Stefan R.
    Schreibauer, Michael
    Buss, Martin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 83 : 72 - 84
  • [9] EFFICIENT ALGORITHMS FOR LEAST-SQUARES RESTORATION
    AUYEUNG, C
    MERSEREAU, RM
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING IV, PTS 1-3, 1989, 1199 : 1534 - 1540
  • [10] ADAPTIVE LEAST-SQUARES ESCALATOR ALGORITHMS
    MENG, YC
    YANG, SK
    GONG, ZY
    INTERNATIONAL JOURNAL OF ELECTRONICS, 1989, 66 (06) : 857 - 864