Approximate Newton methods for policy search in markov decision processes
被引:0
作者:
Furmston, Thomas
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, University College London, London,WC1E 6BT, United KingdomDepartment of Computer Science, University College London, London,WC1E 6BT, United Kingdom
Furmston, Thomas
[1
]
Lever, Guy
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, University College London, London,WC1E 6BT, United KingdomDepartment of Computer Science, University College London, London,WC1E 6BT, United Kingdom
Lever, Guy
[1
]
Barber, David
论文数: 0引用数: 0
h-index: 0
机构:
Department of Computer Science, University College London, London,WC1E 6BT, United KingdomDepartment of Computer Science, University College London, London,WC1E 6BT, United Kingdom
Barber, David
[1
]
机构:
[1] Department of Computer Science, University College London, London,WC1E 6BT, United Kingdom