共 104 条
[1]
Argall BD(2009)A survey of robot learning from demonstration Robot Auton Syst 57 469-483
[2]
Chernova S(2017)Deep reinforcement learning: a brief survey IEEE Signal Process Mag 34 26-38
[3]
Veloso M(2019)Team learning from human demonstration with coordination confidence Knowl Eng Rev 34 e12-1828
[4]
Browning B(1952)On the theory of dynamic programming Proc Natl Acad Sci USA 38 716-27
[5]
Arulkumaran K(2013)Representation learning: a review and new perspectives IEEE Trans Pattern Anal Mach Intell 35 1798-586
[6]
Deisenroth MP(2019)Active deep Q-learning with demonstration Mach Learn 109 1-1480
[7]
Brundage M(2021)First return, then explore Nature 590 580-574
[8]
Bharath AA(2015)A comprehensive survey on safe reinforcement learning J Mach Learn Res 16 1437-1218
[9]
Banerjee B(2003)Markov decision processes with delays and asynchronous cost collection IEEE Trans Autom Control 48 568-1274
[10]
Vittanala S(2020)Optical coherence tomography-guided robotic ophthalmic microsurgery via reinforcement learning from demonstration IEEE Trans Rob 36 1207-603