共 31 条
- [1] Abbasi-Yadkori Y., 2011, P 24 ANN C LEARNING, P1
- [2] Basei Matteo, 2021, Logarithmic regret for episodic continuoustime linear-quadratic reinforcement learning over a finite-time horizon
- [4] Bosworth J. T, 1992, Linearized aerodynamic and control law models of the X-29A airplane and comparison with flight data, V4356
- [7] Cassel A., 2020, PR MACH LEARN RES, P1328
- [8] Chen Xinyi, 2021, P MACHINE LEARNING R, V134
- [9] Reinforcement learning in continuous time and space [J]. NEURAL COMPUTATION, 2000, 12 (01) : 219 - 245