共 34 条
[1]
Araar O, 2014, 2014 UKACC INTERNATIONAL CONFERENCE ON CONTROL (CONTROL), P133, DOI 10.1109/CONTROL.2014.6915128
[2]
Branch S. T., 2011, International Journal of Intelligent Information Processing, V2, P74
[3]
Bryson A., 1969, APPL OPTIMAL CONTROL
[4]
Deng XF, 2017, CHIN CONT DECIS CONF, P832, DOI 10.1109/CCDC.2017.7978635
[5]
Ghoreishi S. A., 2011, Int. J. of Intelligent Information Processing, P74
[6]
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
[J].
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS,
2012, 42 (06)
:1291-1307
[7]
Hespanha JP, 2009, LINEAR SYSTEMS THEORY, P204
[8]
Ioffe S, 2015, Arxiv, DOI arXiv:1502.03167
[9]
Jacknoon A, 2017, 2017 INTERNATIONAL CONFERENCE ON COMMUNICATION, CONTROL, COMPUTING AND ELECTRONICS ENGINEERING (ICCCCEE)
[10]
Kalman R.E., 1960, BOL SOC MAT MEX, V5, P102