共 21 条
[1]
LEWIS F L, VRABIE D L, SYRMOS V L., Optimal Control, (2012)
[2]
BERTSEKAS D P., Reinforcement Learning and Optimal Control, (2019)
[3]
MODARES H, LEWIS F L., Linear quadratic tracking control of partially-unknown continuous-time systems using reinforcement learning, IEEE Transactions on Automatic Control, 59, 11, pp. 3051-3056, (2014)
[4]
KIUMARSI B, LEWIS F L, MODARES H, Et al., Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics, Automatica, 50, 4, pp. 1167-1175, (2014)
[5]
SUN W J, ZHAO G Y, PENG Y J., Adaptive optimal output feedback tracking control for unknown discrete-time linear systems using a combined reinforcement Q-learning and internal model method, IET Control Theory & Applications, 13, 18, pp. 3075-3086, (2019)
[6]
BIAN T, JIANG Z P., Reinforcement learning and adaptive optimal control for continuous-time nonlinear systems: A value iteration approach, IEEE Transactions on Neural Networks and Learning Systems, 33, 7, pp. 2781-2790, (2021)
[7]
RAJASEKARAN P K, SATYANARAYANA N, SRINATH M D., Optimum linear estimation of stochastic signals in the presence of multiplicative noise, IEEE Transactions on Aerospace and Electronic Systems, 7, 3, pp. 462-468, (1971)
[8]
DING D, WANG Z, WEI G, Et al., Event-based security control for discrete-time stochastic systems, IET Control Theory & Applications, 10, 15, pp. 1808-1815, (2016)
[9]
GUAN Z H, CHEN C Y, FENG G, Et al., Optimal tracking performance limitation of networked control systems with limited bandwidth and additive colored white gaussian noise, IEEE Transactions on Circuits & Systems I Regular Papers, 60, 1, pp. 189-198, (2013)
[10]
WANG T, ZHANG H G, LUO Y H., Stochastic linear quadratic optimal control for model-free discrete-time systems based on Q-learning algorithm, Neurocomputing, 312, 27, pp. 1-8, (2018)