共 42 条
- [2] Cox C, 1999, INT J ROBUST NONLIN, V9, P1071, DOI 10.1002/(SICI)1099-1239(19991215)9:14<1071::AID-RNC453>3.0.CO
- [3] 2-W
- [4] Reinforcement learning in continuous time and space [J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245
- [5] Hanbing Dan, 2021, CES Transactions on Electrical Machines and Systems, V5, P90, DOI 10.30941/CESTEMS.2021.00012
- [10] Policy Gradient Adaptive Critic Designs for Model-Free Optimal Tracking Control With Experience Replay [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (06): : 3692 - 3703