共 39 条
[1]
Achiam Joshua, 2017, Constrained Policy Optimization
[2]
Alshiekh M., 2017, Safe reinforcement learning via shielding
[5]
Berkenkamp Felix, 2017, SAFE MODEL BASED REI
[6]
Chen Kuo, 2018, 2018 IEEE RSJ INT C
[7]
Chua Kurtland, 2018, Deep reinforce- ment learning in a handful of trials using probabilistic dynamics models
[8]
Coumans E., 2016, PYBULLET PYTHON MODU
[10]
Finn C, 2017, PR MACH LEARN RES, V70