共 115 条
[71]
Reinforcement Learning for Partially Observable Dynamic Processes: Adaptive Dynamic Programming Using Measured Output Data
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS,
2011, 41 (01)
:14-25
[72]
Lewis FL, 2003, Robot manipulator control: theory and practice
[74]
Liu W, 2010, DES AUT TEST EUROPE, P602
[75]
Liu X, 2000, P AMER CONTR CONF, P1929, DOI 10.1109/ACC.2000.879538
[76]
Maei HamidReza., 2010, P 3 C ARTIFICIAL GEN, P1
[80]
Peters Jan., 2003, 3 IEEE RAS INT C HUM