共 36 条
- [31] Sutton RS, 2000, ADV NEUR IN, V12, P1057
- [32] Reinforcement Learning of Motor Skills in High Dimensions: A Path Integral Approach [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, : 2397 - 2403
- [33] Thrun S, 1992, Technical Report
- [34] A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems [J]. ACC: PROCEEDINGS OF THE 2005 AMERICAN CONTROL CONFERENCE, VOLS 1-7, 2005, : 300 - 306