共 36 条
- [21] Lawrence ND, 2004, ADV NEUR IN, V16, P329
- [22] Marco A, 2016, IEEE INT CONF ROBOT, P270, DOI 10.1109/ICRA.2016.7487144
- [23] Mukadam M, 2016, IEEE INT CONF ROBOT, P9, DOI 10.1109/ICRA.2016.7487091
- [24] Okadome Y, 2014, IEEE INT C INT ROBOT, P661, DOI 10.1109/IROS.2014.6942629
- [25] Okadome Y, 2013, LECT NOTES COMPUT SC, V8131, P17, DOI 10.1007/978-3-642-40728-4_3
- [26] Reinforcement learning of motor skills with policy gradients [J]. NEURAL NETWORKS, 2008, 21 (04) : 682 - 697
- [28] Rasmussen CE, 2005, ADAPT COMPUT MACH LE, P1
- [29] Ross S, 2011, J MACH LEARN RES, V12, P1729
- [30] A point-based POMDP algorithm for robot planning [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2399 - 2404